The Sscrofa10.2 assembly of the pig genome was produced in August 2011 by the Swine Genome Sequencing Consortium (SGSC). It consists of 20 chromosomes (1-18, X and Y) and 4562 unplaced scaffolds. This genome assembly has GCA_000003025.4 as its GenBank assembly accession.
The genome assembly represented here corresponds to GenBank Assembly ID GCA_000003025.4
Sscrofa10.2 was annotated using the standard Ensembl automatic gene annotation system, incorporating RNA-Seq data provided by the (SGSC). The annotation process is described in the document below. The Ensembl annotations were then merged with Vega annotations at the transcript level. Transcripts were merged if they shared the same internal exon-intron boundaries (i.e. had identical splicing pattern) with slight differences in the terminal exons allowed. Importantly, all Vega source transcripts were included in the final merged gene set. The Vega annotations comprised manual annotation of 2,000 genes both from Havana and from the Immune Response Annotation Group (IRAG) community annotation initiative, which was performed under the guidance of the Havana group.
General information about this species can be found in Wikipedia.
|Assembly||Sscrofa10.2, INSDC Assembly GCA_000003025.4, Aug 2011|
|Golden Path Length||2,808,525,991|
|Genebuild method||Full genebuild|
|Genebuild started||Sep 2011|
|Genebuild released||May 2012|
|Genebuild last updated/patched||Feb 2014|
|Coding genes||21,630 (incl 10 ) readthrough|
|Non coding genes||3,124|
|Small non coding genes||2,804|
|Long non coding genes||135 (incl 1 ) readthrough|
|Misc non coding genes||185|
|Genscan gene predictions||52,372|