Caenorhabditis elegans (Nematode, N2) assembly and gene annotation

Name: Ensembl Caenorhabditis elegans (Nematode, N2) Gene Set
Creator: WormBase
License: https://www.apache.org/licenses/LICENSE-2.0
Keywords: genebuild, transcripts, transcription, alignment, loci

Caenorhabditis elegans is a free-living, transparent nematode, about 1 mm in length, that lives in temperate soil environments. In 1974, Sydney Brenner began research into the molecular and developmental biology of C. elegans, which has since been extensively used as a model organism, being the first multicellular species to have its whole genome sequenced.

Assembly

Genome sequence and annotation have been imported from the WS269 release of WormBase (which includes the WBcel235 version of the C.elegans reference genome). Included are annotated operons, genes, transcripts and translations, as well as RNAi and BLAST/BLAT homology data. In addition Affymetrix and Agilent cross references for C.elegans expression arrays have been added.

We include C.elegans in Ensembl (and also Ensembl Genomes) to allow people to access the data through the Ensembl user interface (both for visualisation and data mining) and to provide cross-species integration through our comparative genomics resources (such as homologous gene links and protein family pages).

Other assemblies

WS190 (Ensembl release 54)

Gene annotation

This species and other invertebrates are also available from our sister site, Ensembl Metazoa

More information

General information about this species can be found in Wikipedia.

Statistics

Summary

Assembly	WBcel235, INSDC Assembly GCA_000002985.3, Dec 2012
Base Pairs	100,286,401
Golden Path Length	100,286,401
Assembly provider	WormBase
Annotation provider	WormBase
Annotation method	Import
Genebuild started	Jan 2022
Genebuild released	Oct 2014
Genebuild last updated/patched	Oct 2014
Database version	115.282

Gene counts

Gene/transcipt that contains an open reading frame (ORF).Coding genes	19,985
Non coding genes	24,813
Small non coding genes	24,519
Long non coding genes	294
A gene that has homology to known protein-coding genes but contain a frameshift and/or stop codon(s) which disrupts the ORF. Thought to have arisen through duplication followed by loss of function.Pseudogenes	2,128
A transcript is the operational unit of a gene. In a genomic context, transcripts consist of one or more exons, with adjoining exons being separated by introns. The exons/introns are transcribed and then the introns spliced out. Transcripts may or may not encode a proteinGene transcripts	60,000

Caenorhabditis elegans (Nematode, N2) assembly and gene annotation

Assembly

Other assemblies

Gene annotation

More information

Statistics

Summary

Gene counts

About Us

Get help

Our sister sites

Follow us

Favourite species

All species

Caenorhabditis elegans (Nematode, N2) assembly and gene annotation

Assembly

Other assemblies

Gene annotation

More information

Statistics

Summary

Gene counts

About Us

Get help

Our sister sites

Follow us