Dog
"Tasha" - Canis lupus familiaris
(NHGRI Press Photos)

The May 2005 dog (Canis familiaris) whole genome shotgun (WGS) assembly v2.0 was sequenced and assembled by the Broad Institute of MIT/Harvard and Agencourt Bioscience (now part of Beckman Coulter Genomics). For more information about this assembly, see CanFam2.0 in the NCBI Assembly database.

Sample Position Queries

A genome position can be specified by the accession number of a sequenced genomic clone, an mRNA or EST, a chromosomal coordinate range, or keywords from the GenBank description of an mRNA. The following list provides examples of various types of position queries for the dog genome. See the User's Guide for more information.

Request:
  Genome Browser Response:

chr16   Displays all of chromosome 16
chr16:1-5000000   Displays first 5 million bases of chr 16
chr16:1000000+2000 Displays a region of chr 16 that spans 2000 bases, starting with position 1000000
AY572227   Displays location of mRNA with GenBank accession AY572227
BM538765   Displays location of EST with GenBank accession BM538765
chr6_8.29   Displays location of Genscan gene prediction with identifier chr6_8.29

pseudogene mRNA   Lists transcribed pseudogenes but not cDNAs
zinc finger   Lists zinc finger mRNAs
breast cancer   Lists mRNAs associated with breast cancer susceptibility proteins
evans   Lists mRNAs deposited by scientist named Evans
Murphy,K.E.   Lists mRNAs deposited by co-author K.E. Murphy

Use this last format for author queries. Although GenBank requires the search format Murphy KE, internally it uses the format Murphy,K.E..

Assembly Details

The dog genome contains approximately 2.5 billion base pairs. This sequence is based on 7.6X coverage of the dog genome, assuming a WGS assembly size of 2.4 Gb. The boxer breed was selected for the initial sequencing effort based on the lower variation rate in its genome relative to other breeds. For more information about this assembly, see the Broad Institute Dog Genome Sequencing Project web page.

The canFam2 sequence and annotation data can be downloaded from the UCSC Genome Browser FTP server or downloads page.

Many thanks to the Broad Institute of MIT/Harvard, NHGRI, and the many institutions who contributed to the sequencing and mapping effort for this release. The canFam2 annotation tracks were generated by UCSC and collaborators worldwide. See the credits page for a detailed list of the organizations and individuals who contributed to this release.


GenBank Pipeline Details

For the purposes of the GenBank alignment pipeline, this assembly is considered to be: well-ordered.