The mosaic structure of variation in the laboratory mouse genome

Most inbred laboratory mouse strains are known to have originated from a mixed but limited founder population in a few laboratories. However, the effect of this breeding history on patterns of genetic variation among these strains and the implications for their use are not well understood. Here we present an analysis of the fine structure of variation in the mouse genome, using single nucleotide polymorphisms (SNPs). When the recently assembled genome sequence from the C57BL/6J strain is aligned with sample sequence from other strains, we observe long segments of either extremely high (∼40 SNPs per 10 kb) or extremely low (∼0.5 SNPs per 10 kb) polymorphism rates. In all strain-to-strain comparisons examined, only one-third of the genome falls into long regions (averaging >1 Mb) of a high SNP rate, consistent with estimated divergence rates between Mus musculus domesticus and either M. m. musculus or M. m. castaneus. These data suggest that the genomes of these inbred strains are mosaics with the vast majority of segments derived from domesticus and musculus sources. These observations have important implications for the design and interpretation of positional cloning experiments.

[1]  Robert W. Williams,et al.  Genetic dissection of complex and quantitative traits: from fantasy to reality via a community effort , 2002, Mammalian Genome.

[2]  F. Bonhomme,et al.  The polyphyletic origin of laboratory inbred mice and their rate of evolution , 1987 .

[3]  Eric S. Lander,et al.  A comprehensive genetic map of the mouse genome , 1996, Nature.

[4]  Janan T. Eppig,et al.  A mouse phenome project , 2000, Mammalian Genome.

[5]  B. Kwon,et al.  Conserved cysteine to serine mutation in tyrosinase is responsible for the classical albino mutation in laboratory mice. , 1990, Nucleic acids research.

[6]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[8]  Janan T. Eppig,et al.  Genealogies of mouse inbred strains , 2000, Nature Genetics.

[9]  E. M. Prager,et al.  Genetic variation and phylogeography of central Asian and other house mice, including a major new mitochondrial lineage in Yemen. , 1998, Genetics.

[10]  Eric S. Lander,et al.  Large-scale discovery and genotyping of single-nucleotide polymorphisms in the mouse , 2000, Nature Genetics.

[11]  Eric S. Lander,et al.  A comprehensive genetic map of the mouse genome , 1996, Nature.

[12]  C. V. Jongeneel,et al.  Numerous potentially functional but non-genic conserved sequences on human chromosome 21 , 2002, Nature.

[13]  D. Lowy,et al.  Bovine papillomavirus E2 trans-activating gene product binds to specific sites in papillomavirus DNA , 1987, Nature.

[14]  Muriel T. Davisson,et al.  Genetic variation among 129 substrains and its importance for targeted mutagenesis in mice , 1997, Nature Genetics.

[15]  H. Himmelbauer,et al.  Panel of microsatellite markers for whole-genome scans and radiation hybrid mapping and a mouse family tree. , 1999, Genome research.

[16]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[17]  B. Berger,et al.  ARACHNE: a whole-genome shotgun assembler. , 2002, Genome research.

[18]  J. Mullikin,et al.  SSAHA: a fast search method for large DNA databases. , 2001, Genome research.

[19]  A. Wilson,et al.  Evidence from mtDNA sequences that common laboratory strains of inbred mice are descended from a single female , 1982, Nature.

[20]  O. Gotoh,et al.  RELATIONSHIP BETWEEN LABORATORY MICE AND THE SUBSPECIES MUS MUSCULUS DOMESTICUS BASED ON RESTRICTION ENDONUCLEASE CLEAVAGE PATTERNS OF MITOCHONDRIAL DNA , 1980 .

[21]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[22]  D. Hatat,et al.  Most classical Mus musculus domesticus laboratory mouse strains carry a Mus musculus musculus Y chromosome , 1985, Nature.