cathepsin D [Source:HGNC Symbol;Acc:HGNC:2529]
CLN10, CPSD
Chromosome 11: 1,752,752-1,764,573 reverse strand.
GRCh38:CM000673.2
This gene has 25 transcripts (splice variants), 213 orthologues, 9 paralogues and is associated with 3 phenotypes.
| Transcript ID | Name | bp | Protein | Translation ID | Biotype | CCDS | UniProt Match | RefSeq Match | Flags | 
|---|---|---|---|---|---|---|---|---|---|
| ENST00000236671.7 | CTSD-201 | 2055 | 412aa | ENSP00000236671.2 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | CCDS7725 | P07339 V9HWI3 | NM_001909.5 | The Matched Annotation from NCBI and EMBL-EBI is a collaboration between Ensembl/GENCODE and RefSeq. The MANE Select is a default transcript per human gene that is representative of biology, well-supported, expressed and highly-conserved. This transcript set matches GRCh38 and is 100% identical between RefSeq and Ensembl/GENCODE for 5' UTR, CDS, splicing and 3'UTR.MANE Select, A single transcript chosen for a gene which is the most conserved, most highly expressed, has the longest coding sequence and is represented in other key resources, such as NCBI and UniProt. This is defined in detail on http://www.ensembl.org/info/genome/genebuild/canonical.htmlEnsembl Canonical, GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, TSL 1: A transcript where all splice junctions are supported by at least one non-suspect mRNA. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:1, | 
| ENST00000367196.4 | CTSD-202 | 2190 | 377aa | ENSP00000356164.4 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | F8W787 | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000962446.1 | CTSD-224 | 2170 | 467aa | ENSP00000632505.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000916370.1 | CTSD-221 | 2104 | 410aa | ENSP00000586429.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000962444.1 | CTSD-222 | 2061 | 408aa | ENSP00000632503.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, APPRIS P1: Transcript(s) expected to code for the main functional isoform based solely on the core modules in the APPRIS. APPRIS is a system to annotate alternatively spliced transcripts based on a range of computational methods to identify the most functionally important transcript(s) of a gene.APPRIS P1, | |
| ENST00000962445.1 | CTSD-223 | 2053 | 406aa | ENSP00000632504.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000962447.1 | CTSD-225 | 2047 | 427aa | ENSP00000632506.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000637815.2 | CTSD-212 | 2019 | 406aa | ENSP00000490344.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | A0A1B0GV23 | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000907824.1 | CTSD-218 | 2004 | 395aa | ENSP00000577883.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000637915.1 | CTSD-213 | 1982 | 409aa | ENSP00000490471.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | A0A1B0GVD5 | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000429746.2 | CTSD-203 | 1973 | 377aa | ENSP00000402586.2 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | F8W787 | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, TSL 3: A transcript where the only support is from a single EST The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:3, | |
| ENST00000907825.1 | CTSD-219 | 1905 | 371aa | ENSP00000577884.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000636571.1 | CTSD-207 | 1790 | 405aa | ENSP00000490770.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | A0A1B0GW44 | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000907826.1 | CTSD-220 | 1708 | 313aa | ENSP00000577885.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000637387.1 | CTSD-211 | 1597 | 405aa | ENSP00000490598.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | A0A1B0GVP3 | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000438213.6 | CTSD-205 | 1544 | 451aa | ENSP00000415036.2 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | C9JH19 | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, TSL 2: A transcript where the best supporting mRNA is flagged as suspect or the support is from multiple ESTs The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:2, | |
| ENST00000636843.1 | CTSD-208 | 1512 | 410aa | ENSP00000490897.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | A0A1B0GWE8 | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000907823.1 | CTSD-217 | 1323 | 168aa | ENSP00000577882.1 | Gene/transcipt that contains an open reading frame (ORF).Protein coding | - | - | GENCODE Primary represents a minimal set that contains MANE Select, MANE Plus Clinical and Ensembl Canonical transcripts and transcripts containing any conserved exons and common alternative splicing events (including exons skips) that are absent from the MANE and Ensembl Canonical transcripts for protein-coding genes. Other biotypes will have the GENCODE Primary flag added to the Ensembl Canonical transcript and for lncRNA genes only this will be the transcripts with the longest genomic span.GENCODE Primary, A subset of the GENCODE transcript set, containing only 5' and 3' complete transcripts at protein-coding genes.GENCODE Basic, | |
| ENST00000678991.1 | CTSD-216 | 2198 | 93aa | ENSP00000503019.1 | Nonsense mediated decay | A0A7I2V2N3 | - | - | |
| ENST00000433655.6 | CTSD-204 | 2035 | 276aa | ENSP00000404902.1 | Nonsense mediated decay | F8WD96 | - | TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000637937.1 | CTSD-214 | 1262 | No protein | - | Alternatively spliced transcript of a protein coding gene for which we cannot define a CDS.Protein coding CDS not defined | - | - | TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000637158.1 | CTSD-209 | 1200 | No protein | - | Alternatively spliced transcript of a protein coding gene for which we cannot define a CDS.Protein coding CDS not defined | - | - | TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000677300.1 | CTSD-215 | 1081 | No protein | - | Alternatively spliced transcript of a protein coding gene for which we cannot define a CDS.Protein coding CDS not defined | - | - | - | |
| ENST00000637381.2 | CTSD-210 | 4408 | No protein | - | An alternatively spliced transcript believed to contain intronic sequence relative to other, coding, transcripts of the same gene.Retained intron | - | - | TSL 5: A transcript where no single transcript supports the model structure. The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:5, | |
| ENST00000497544.3 | CTSD-206 | 798 | No protein | - | An alternatively spliced transcript believed to contain intronic sequence relative to other, coding, transcripts of the same gene.Retained intron | - | - | TSL 2: A transcript where the best supporting mRNA is flagged as suspect or the support is from multiple ESTs The Transcript Support Level (TSL) is a method to highlight the well-supported and poorly-supported transcript models for users, based on the type and quality of the alignments used to annotate the transcript.TSL:2, | 


