thymidine phosphorylase [Source:HGNC Symbol;Acc:HGNC:3148]
ECGF1, MNGIE
Chromosome 22: 50,525,752-50,530,326 reverse strand.
GRCh38:CM000684.2
This gene has 38 transcripts (splice variants), 172 orthologues and is associated with 2 phenotypes.
| Transcript ID | Name | bp | Protein | Translation ID | Biotype | CCDS | UniProt Match | RefSeq Match | Flags | 
|---|---|---|---|---|---|---|---|---|---|
| ENST00000252029.8 | TYMP-201 | 1586 | 482aa | ENSP00000252029.3 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS14096 | P19971-1 | NM_001953.5 | The Matched Annotation from NCBI and EMBL-EBI is a collaboration between Ensembl/GENCODE and RefSeq. The MANE Select is a default transcript per human gene that is representative of biology, well-supported, expressed and highly-conserved. This transcript set matches GRCh38 and is 100% identical between RefSeq and Ensembl/GENCODE for 5' UTR, CDS, splicing and 3'UTR.MANE Select, A single transcript chosen for a gene which is the most conserved, most highly expressed, has the longest coding sequence and is represented in other key resources, such as NCBI and UniProt. This is defined in detail on http://www.ensembl.org/info/genome/genebuild/canonical.htmlEnsembl Canonical, GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P3: Where the APPRIS core modules are unable to choose a clear principal variant and there more than one of the variants have distinct CCDS identifiers, APPRIS selects the variant with lowest CCDS identifier as the principal variant. The lower the CCDS identifier, the earlier it was annotated. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS P3, TSL 1: A transcript where all splice junctions are supported by at least one non-suspect mRNA. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:1, | 
| ENST00000970788.1 | TYMP-235 | 1941 | 539aa | ENSP00000640847.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000893016.1 | TYMP-219 | 1926 | 488aa | ENSP00000563075.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000893015.1 | TYMP-218 | 1855 | 467aa | ENSP00000563074.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000893017.1 | TYMP-220 | 1780 | 467aa | ENSP00000563076.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000893030.1 | TYMP-233 | 1779 | 486aa | ENSP00000563089.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000893025.1 | TYMP-228 | 1767 | 481aa | ENSP00000563084.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000970790.1 | TYMP-237 | 1764 | 539aa | ENSP00000640849.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000487577.5 | TYMP-208 | 1751 | 482aa | ENSP00000498844.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS14096 | P19971-1 | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P3: Where the APPRIS core modules are unable to choose a clear principal variant and there more than one of the variants have distinct CCDS identifiers, APPRIS selects the variant with lowest CCDS identifier as the principal variant. The lower the CCDS identifier, the earlier it was annotated. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS P3, TSL 1: A transcript where all splice junctions are supported by at least one non-suspect mRNA. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:1, | 
| ENST00000970789.1 | TYMP-236 | 1663 | 482aa | ENSP00000640848.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS14096 | E5KRG5 | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P3: Where the APPRIS core modules are unable to choose a clear principal variant and there more than one of the variants have distinct CCDS identifiers, APPRIS selects the variant with lowest CCDS identifier as the principal variant. The lower the CCDS identifier, the earlier it was annotated. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS P3, | 
| ENST00000893028.1 | TYMP-231 | 1648 | 503aa | ENSP00000563087.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000893024.1 | TYMP-227 | 1646 | 498aa | ENSP00000563083.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000893026.1 | TYMP-229 | 1634 | 498aa | ENSP00000563085.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000893021.1 | TYMP-224 | 1627 | 487aa | ENSP00000563080.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS58811 | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | 
| ENST00000893018.1 | TYMP-221 | 1625 | 481aa | ENSP00000563077.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000395680.6 | TYMP-203 | 1621 | 482aa | ENSP00000379037.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS14096 | P19971-1 | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P3: Where the APPRIS core modules are unable to choose a clear principal variant and there more than one of the variants have distinct CCDS identifiers, APPRIS selects the variant with lowest CCDS identifier as the principal variant. The lower the CCDS identifier, the earlier it was annotated. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS P3, TSL 1: A transcript where all splice junctions are supported by at least one non-suspect mRNA. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:1, | 
| ENST00000395678.7 | TYMP-202 | 1614 | 482aa | ENSP00000379036.3 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS14096 | P19971-1 | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P3: Where the APPRIS core modules are unable to choose a clear principal variant and there more than one of the variants have distinct CCDS identifiers, APPRIS selects the variant with lowest CCDS identifier as the principal variant. The lower the CCDS identifier, the earlier it was annotated. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS P3, TSL 1: A transcript where all splice junctions are supported by at least one non-suspect mRNA. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:1, | 
| ENST00000893022.1 | TYMP-225 | 1610 | 487aa | ENSP00000563081.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000395681.6 | TYMP-204 | 1601 | 487aa | ENSP00000379038.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS58811 | P19971-2 | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, TSL 1: A transcript where all splice junctions are supported by at least one non-suspect mRNA. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:1, | 
| ENST00000893023.1 | TYMP-226 | 1592 | 483aa | ENSP00000563082.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000893027.1 | TYMP-230 | 1583 | 481aa | ENSP00000563086.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS ALT2: For genes in which the APPRIS core modules are unable to choose a clear principal isoform, the ALT1 is the candidate transcript(s) models that appear to be conserved in fewer than three tested species. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS ALT2, | |
| ENST00000893031.1 | TYMP-234 | 1565 | 467aa | ENSP00000563090.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000970791.1 | TYMP-238 | 1564 | 467aa | ENSP00000640850.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000893029.1 | TYMP-232 | 1550 | 467aa | ENSP00000563088.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000893019.1 | TYMP-222 | 1529 | 449aa | ENSP00000563078.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000893020.1 | TYMP-223 | 1527 | 454aa | ENSP00000563079.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000425169.1 | TYMP-205 | 1432 | 445aa | ENSP00000395875.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | C9JGI3 | - | TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, 3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incomplete, | |
| ENST00000650719.1 | TYMP-209 | 1101 | 325aa | ENSP00000498276.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | A0A494BZZ4 | - | 3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incomplete, | |
| ENST00000652401.1 | TYMP-217 | 866 | 289aa | ENSP00000498619.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | A0A494C0L6 | - | 5' and 3' truncations in transcript evidence prevent annotation of the start and the end of the CDS.CDS 5' and 3' incomplete, | |
| ENST00000651401.1 | TYMP-212 | 616 | 169aa | ENSP00000499115.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | A0A494C1N7 | - | 3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incomplete, | |
| ENST00000651196.1 | TYMP-211 | 559 | 137aa | ENSP00000499096.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | A0A494C1L9 | - | 3' truncation in transcript evidence prevents annotation of the end of the CDS.CDS 3' incomplete, | |
| ENST00000651490.1 | TYMP-213 | 259 | 80aa | ENSP00000498433.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | A0A494C0A4 | - | 5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incomplete, | |
| ENST00000652352.1 | TYMP-216 | 525 | 107aa | ENSP00000498579.1 | Nonsense mediated decay | A0A494C0L3 | - | 5' truncation in transcript evidence prevents annotation of the start of the CDS.CDS 5' incomplete, | |
| ENST00000487162.1 | TYMP-207 | 2049 | No protein | - | An alternatively spliced transcript believed to contain intronic sequence relative to other, coding, transcripts of the same gene.Retained intron | - | - | TSL 2: A transcript where the best supporting mRNA is flagged as suspect or the support is from multiple ESTs The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:2, | |
| ENST00000476284.1 | TYMP-206 | 1572 | No protein | - | An alternatively spliced transcript believed to contain intronic sequence relative to other, coding, transcripts of the same gene.Retained intron | - | - | TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000651906.1 | TYMP-214 | 1156 | No protein | - | An alternatively spliced transcript believed to contain intronic sequence relative to other, coding, transcripts of the same gene.Retained intron | - | - | - | |
| ENST00000652237.1 | TYMP-215 | 1144 | No protein | - | An alternatively spliced transcript believed to contain intronic sequence relative to other, coding, transcripts of the same gene.Retained intron | - | - | - | |
| ENST00000651095.1 | TYMP-210 | 736 | No protein | - | An alternatively spliced transcript believed to contain intronic sequence relative to other, coding, transcripts of the same gene.Retained intron | - | - | - | 


