Commercially Available Outbred Mice for Genome-Wide Association Studies

Genome-wide association studies using commercially available outbred mice can detect genes involved in phenotypes of biomedical interest. Useful populations need high-frequency alleles to ensure high power to detect quantitative trait loci (QTLs), low linkage disequilibrium between markers to obtain accurate mapping resolution, and an absence of population structure to prevent false positive associations. We surveyed 66 colonies for inbreeding, genetic diversity, and linkage disequilibrium, and we demonstrate that some have haplotype blocks of less than 100 Kb, enabling gene-level mapping resolution. The same alleles contribute to variation in different colonies, so that when mapping progress stalls in one, another can be used in its stead. Colonies are genetically diverse: 45% of the total genetic variation is attributable to differences between colonies. However, quantitative differences in allele frequencies, rather than the existence of private alleles, are responsible for these population differences. The colonies derive from a limited pool of ancestral haplotypes resembling those found in inbred strains: over 95% of sequence variants segregating in outbred populations are found in inbred strains. Consequently it is possible to impute the sequence of any mouse from a dense SNP map combined with inbred strain sequence data, which opens up the possibility of cataloguing and testing all variants for association, a situation that has so far eluded studies in completely outbred populations. We demonstrate the colonies' potential by identifying a deletion in the promoter of H2-Ea as the molecular change that strongly contributes to setting the ratio of CD4+ and CD8+ lymphocytes.

[1]  Carol J. Bult,et al.  Mouse Phenome Database (MPD) , 2011, Nucleic Acids Res..

[2]  Nicole Soranzo,et al.  Quantitative trait loci for CD4:CD8 lymphocyte ratio are associated with risk of type 1 diabetes and HIV-1 immune control. , 2010, American journal of human genetics.

[3]  David B. Goldstein,et al.  Rare Variants Create Synthetic Genome-Wide Associations , 2010, PLoS biology.

[4]  K. Holsinger,et al.  Genetics in geographically structured populations: defining, estimating and interpreting FST , 2009, Nature Reviews Genetics.

[5]  Alkes L. Price,et al.  Reconstructing Indian Population History , 2009, Nature.

[6]  Yueming Ding,et al.  A customized and versatile high-density genotyping array for the mouse , 2009, Nature Methods.

[7]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[8]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[9]  Eleazar Eskin,et al.  High-Resolution Mapping of Gene Expression Using Association in an Outbred Mouse Stock , 2008, PLoS genetics.

[10]  Qizhai Li,et al.  Improved correction for population stratification in genome‐wide association studies by identifying hidden population structures , 2008, Genetic epidemiology.

[11]  Robert D Schnabel,et al.  SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries , 2008, Nature Methods.

[12]  M. Feldman,et al.  Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation , 2008 .

[13]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[14]  Eleazar Eskin,et al.  A sequence-based variation map of 8.27 million SNPs in inbred mouse strains , 2007, Nature.

[15]  Wei Wang,et al.  The polymorphism architecture of mouse genetic resources elucidated using genome-wide resequencing data: implications for QTL discovery and systems genetics , 2007, Mammalian Genome.

[16]  Matthew D Dean,et al.  Linkage Disequilibrium in Wild Mice , 2007, PLoS genetics.

[17]  Eric Rivals,et al.  Species-wide distribution of highly polymorphic minisatellite markers suggests past and present genetic exchanges among house mouse subspecies , 2007, Genome Biology.

[18]  Martin S. Taylor,et al.  A High-Resolution Single Nucleotide Polymorphism Genetic Map of the Mouse Genome , 2006, PLoS biology.

[19]  Martin S. Taylor,et al.  Genome-wide genetic association of complex traits in heterogeneous stock mice , 2006, Nature Genetics.

[20]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[21]  William Valdar,et al.  Simulating the Collaborative Cross: Power of Quantitative Trait Loci Detection and Mapping Resolution in Large Sets of Recombinant Inbred Strains of Mice , 2006, Genetics.

[22]  Dong Xie,et al.  An integrated system for genetic analysis , 2006, BMC Bioinformatics.

[23]  William Valdar,et al.  A protocol for high-throughput phenotyping, suitable for quantitative trait analysis in mice , 2006, Mammalian Genome.

[24]  E. Fisher,et al.  The origins and uses of mouse outbred stocks , 2005, Nature Genetics.

[25]  G. Abecasis,et al.  A note on exact tests of Hardy-Weinberg equilibrium. , 2005, American journal of human genetics.

[26]  N. Risch,et al.  Estimation of individual admixture: Analytical and study design considerations , 2005, Genetic epidemiology.

[27]  Mark Daly,et al.  Haploview: analysis and visualization of LD and haplotype maps , 2005, Bioinform..

[28]  Nengjun Yi,et al.  The Collaborative Cross, a community resource for the genetic analysis of complex traits , 2004, Nature Genetics.

[29]  Andrew P Morris,et al.  Genetic dissection of a behavioral quantitative trait locus shows that Rgs2 modulates anxiety in mice , 2004, Nature Genetics.

[30]  Janice M. Fullerton,et al.  Unexpected complexity in the haplotypes of commonly used inbred strains of laboratory mice. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[31]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[32]  A. C. Collins,et al.  A method for fine mapping quantitative trait loci in outbred animal stocks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Festing Mf,et al.  Warning: the use of heterogeneous mice may seriously damage your research. , 1999 .

[34]  M. Festing Warning: the use of heterogeneous mice may seriously damage your research. , 1999, Neurobiology of aging.

[35]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[36]  P. Green,et al.  Consed: a graphical tool for sequence finishing. , 1998, Genome research.

[37]  W. White,et al.  The Development and Maintenance of the Crl:CH!!J(SD)IGS BR Rat Breeding System , 1998 .

[38]  M. Beaumont,et al.  Evaluating loci for use in the genetic analysis of population structure , 1996, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[39]  C. Benoist,et al.  MHC-linked protection from diabetes dissociated from clonal deletion of T cells. , 1990, Science.

[40]  C. Benoist,et al.  Inbred and wild mice carry identical deletions in their E alpha MHC genes. , 1985, The EMBO journal.

[41]  V. E. Williams,et al.  Several mechanisms can account for defective E alpha gene expression in different mouse haplotypes. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Stephen J. O'Brien,et al.  Genetic variance of laboratory outbred Swiss mice , 1980, Nature.

[43]  Michael F. W. Festing,et al.  International Index of Laboratory Animals , 1975, The Lancet.

[44]  Lynch Cj The so-called Swiss mouse. , 1969 .

[45]  C. Lynch The so-called Swiss mouse. , 1969, Laboratory animal care.