Assembly
The Gallus_gallus-5.0 assembly of the chicken genome was released in December 2015 by the International Chicken Genome Consortium. It consists of 34 chromosomes, 1 linkage group and 15,411 unplaced scaffolds.
The genome assembly represented here corresponds to GenBank Assembly ID GCA_000002315.3
Other assemblies
Gene annotation
Gallus_gallus-5.0 was annotated using a standard Ensembl genebuild pipeline, incorporating RNASeq data (PRJEB12891) and PacBio long read data (PRJEB13248, PRJEB13246) provided by the Roslin Institute. The annotation process is described in the document below.
PacBio long read data set
Two tissue samples were sequenced using the PacBio long read sequencing technology, embryo and brain. Both sets were used to add UTR to gene models and as input source for our lincRNA discovery pipeline. The embryo set was sequenced using 5' and 3' capping, therefore all the sequences were considered as full length cDNAs and incorporated into the gene models.
RNASeq data set
In addition to the main set, we have predicted gene models for each tissue type using the RNA-Seq pipeline. We did a BLASTp of these models against UniProt proteins of vertabrate species with protein existence level 1 and 2 in order to confirm the open reading frame (ORF). The best BLAST hit is displayed as a transcript supporting evidence. The data was also used to add UTR to gene models.
The tissue-specific sets of transcript models built using our RNAseq pipeline are as follows:
| Tissue | Number of gene models |
|---|---|
| Breast muscle | 6132 |
| Bursa | 8388 |
| Caecal tonsil | 10421 |
| Cerebellum | 8567 |
| Duodenum | 8808 |
| Gizzard fat | 8892 |
| Harderian gland | 6544 |
| Heart muscle | 7746 |
| Ileum | 8309 |
| Kidney | 7611 |
| Left optic lobe | 7616 |
| Liver | 8751 |
| Lung | 6774 |
| Ovary | 8112 |
| Pancreas | 8271 |
| Proventriculus | 8316 |
| Skin | 1751 |
| Spleen | 8232 |
| Thymus | 7332 |
| Thyroid | 7766 |
| Trachea | 8987 |
| Merged | 14200 |
More information
General information about this species can be found in Wikipedia.
Statistics
Summary
| Assembly | Gallus_gallus-5.0, INSDC Assembly GCA_000002315.3, Dec 2015 |
| Database version | 89.5 |
| Base Pairs | 1,285,637,921 |
| Golden Path Length | 1,230,258,557 |
| Genebuild by | Ensembl |
| Genebuild method | Full genebuild |
| Genebuild started | Jun 2016 |
| Genebuild released | Oct 2016 |
| Genebuild last updated/patched | Dec 2016 |
Gene counts
| Coding genes | 18,346 |
| Non coding genes | 6,492 |
| Small non coding genes | 1,705 |
| Long non coding genes | 4,643 |
| Misc non coding genes | 144 |
| Pseudogenes | 43 |
| Gene transcripts | 38,118 |
Other
| Genscan gene predictions | 50,996 |
| Short Variants | 20,915,434 |
