Ensembl Macaque

 

Explore the Macaca mulatta genome

Search Ensembl Macaca mulatta

Search:

e.g. chromosome 19 or 1:10000..200000 or Q59FM4.1

Karyotype

Click on a chromosome for a closer view

chromosome X chromosome 20 chromosome 19 chromosome 18 chromosome 17 chromosome 16 chromosome 15 chromosome 14 chromosome 13 chromosome 12 chromosome 11 chromosome 10 chromosome 9 chromosome 8 chromosome 7 chromosome 6 chromosome 5 chromosome 4 chromosome 3 chromosome 2 chromosome 1 Macaque karyotype selector

Jump directly to sequence position

Region:
From (bp):
To (bp):

About the Rhesus macaque genome

Assembly

Rhesus macaque Mmul_1 is a preliminary assembly of the rhesus monkey, Macaca mulatta using whole genome shotgun (WGS) reads from small and medium insert clones. Several WGS libraries, with inserts of 2-4 kb and 10 kb, fosmids with ~35kb inserts, and BACs with 180kb inserts were used to produce the data.

The release was produced by the Macaque Genome Sequencing Consortium, led by the Baylor College of Human Medicine, melding three separate complementary assemblies (created using the Atlas, Celera and PCAP systems). This involved iteratively splitting likely chimeric scaffolds and joining together existing scaffolds where possible. Chimeric scaffolds (<100 total) were identified by breaks in synteny with the human genome, which were confirmed to be artefacts by the other assemblies.

This is a draft sequence and may contain errors so users should exercise caution. Typical errors in draft genome sequences include misassemblies of repeated sequences, collapses of repeated regions, and unmerged overlaps (e.g. due to polymorphisms) creating artificial duplications. However base accuracy in contigs (contiguous blocks of sequence) is usually very high with most errors near the ends of contigs. [More about the assembly].

Annotation

The gene set for macaque was built using the ensembl pipeline. The species-specific resources for macaque are relatively limited, so we decided to take a combined approach utilizing macaque's great similarity to human to aid our annotation efforts. The gene structures are mainly based on alignments to human and macaque protein data. Both macaque and human cDNAs were used to add UTR structures, and finally gene predictions based on Uniprot proteins and human cdnas were used to fill gaps in the annotation.

What's New in Ensembl 49

Macaca mulatta News

There is no Macaca mulatta-specific news this release.

General News

  • Release schedule

    Ensembl release 50 will occur in July (rather than in April as originally scheduled).
    Read more...

  • API changes - regulatory features
    Regulatory feature support has been moved from the core API to the functional genomics API. More information about using the new code...
  • Removal of viral genes
    We have removed a total of about 1200 viral genes from the following species.
    Read more...
  • Mart updates
    RGD and SGD Symbol+ID combinations have been introduced, similar to HGNC, MGI and ZFIN. The issue with subsets of homologs being returned has been resolved.
  • Compara updates

    Multiple alignments

    We are now using the new enredo-pecan-ortheus (EPO) pipeline for 7-way alignments (placental mammals).
    Read more...

More news...

Statistics

Assembly: MMUL 1.0, Feb 2006
Genebuild: Ensembl, Jan 2006
Database version: 49.10h
Known protein-coding genes: 874
Projected protein-coding genes: 17,340
Novel protein-coding genes: 3,691
Pseudogenes: 1,762
RNA genes: 4,003
Genscan gene predictions: 125,893
Gene exons: 246,943
Gene transcripts: 38,146
Base Pairs: 3,093,871,206
Golden Path Length: 3,097,179,960
Most common InterPro domains: Top 40 Top 500

How the statistics are calculated


 

© 2008 WTSI / EBI. Ensembl is available to download for public use - please see the code licence for details.

                
Ensembl release 49 - Mar 2008
HOME · BLAST · BIOMART · SITEMAP HELP