ENST00000251020.9 | SALL1-201 | 5134 | 1324aa | ENSP00000251020.4 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS10747 | Q9NSC2-1 | NM_002968.3 | The Matched Annotation from NCBI and EMBL-EBI is a collaboration between Ensembl/GENCODE and RefSeq. The MANE Select is a default transcript per human gene that is representative of biology, well-supported, expressed and highly-conserved. This transcript set matches GRCh38 and is 100% identical between RefSeq and Ensembl/GENCODE for 5' UTR, CDS, splicing and 3'UTR.MANE Select, A single transcript chosen for a gene which is the most conserved, most highly expressed, has the longest coding sequence and is represented in other key resources, such as NCBI and UniProt. This is defined in detail on http://www.ensembl.org/info/genome/genebuild/canonical.htmlEnsembl Canonical, GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P2: Where the APPRIS core modules are unable to choose a clear principal variant (approximately 25% of human protein coding genes), the database chooses two or more of the CDS variants as "candidates" to be the principal variant. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene. APPRIS P2, TSL 1: A transcript where all splice junctions are supported by at least one non-suspect mRNA. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript. TSL:1, |
ENST00000884629.1 | SALL1-208 | 5254 | 1324aa | ENSP00000554688.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS10747 | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P2: Where the APPRIS core modules are unable to choose a clear principal variant (approximately 25% of human protein coding genes), the database chooses two or more of the CDS variants as "candidates" to be the principal variant. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene. APPRIS P2, |
ENST00000911277.1 | SALL1-209 | 5246 | 1324aa | ENSP00000581336.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS10747 | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P2: Where the APPRIS core modules are unable to choose a clear principal variant (approximately 25% of human protein coding genes), the database chooses two or more of the CDS variants as "candidates" to be the principal variant. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene. APPRIS P2, |
ENST00000685868.1 | SALL1-206 | 5223 | 1324aa | ENSP00000509873.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS10747 | Q9NSC2-1 | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P2: Where the APPRIS core modules are unable to choose a clear principal variant (approximately 25% of human protein coding genes), the database chooses two or more of the CDS variants as "candidates" to be the principal variant. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene. APPRIS P2, |
ENST00000440970.6 | SALL1-202 | 5122 | 1324aa | ENSP00000407914.2 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS10747 | Q9NSC2-1 | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P2: Where the APPRIS core modules are unable to choose a clear principal variant (approximately 25% of human protein coding genes), the database chooses two or more of the CDS variants as "candidates" to be the principal variant. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene. APPRIS P2, TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript. TSL:5, |
ENST00000570206.2 | SALL1-205 | 4116 | 1227aa | ENSP00000456777.2 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS45483 | Q9NSC2-2 | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene. APPRIS ALT2, TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript. TSL:5, |
ENST00000690502.1 | SALL1-207 | 4035 | 1196aa | ENSP00000510560.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | | A0A8I5KRF8 | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, |
ENST00000566102.1 | SALL1-204 | 565 | 34aa | ENSP00000455582.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | | H3BQ32 | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, TSL 1: A transcript where all splice junctions are supported by at least one non-suspect mRNA. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript. TSL:1, |
ENST00000562674.1 | SALL1-203 | 407 | No protein | - | Alternatively spliced transcript of a protein coding gene for which we cannot define a CDS.Protein coding CDS not defined | | - | - | TSL 3: A transcript where the only support is from a single EST The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript. TSL:3, |