Increased power of mixed models facilitates association mapping of 10 loci for metabolic traits in an isolated population.

The potential benefits of using population isolates in genetic mapping, such as reduced genetic, phenotypic and environmental heterogeneity, are offset by the challenges posed by the large amounts of direct and cryptic relatedness in these populations confounding basic assumptions of independence. We have evaluated four representative specialized methods for association testing in the presence of relatedness; (i) within-family (ii) within- and between-family and (iii) mixed-models methods, using simulated traits for 2906 subjects with known genome-wide genotype data from an extremely isolated population, the Island of Kosrae, Federated States of Micronesia. We report that mixed models optimally extract association information from such samples, demonstrating 88% power to rank the true variant as among the top 10 genome-wide with 56% achieving genome-wide significance, a >80% improvement over the other methods, and demonstrate that population isolates have similar power to non-isolate populations for observing variants of known effects. We then used the mixed-model method to reanalyze data for 17 published phenotypes relating to metabolic traits and electrocardiographic measures, along with another 8 previously unreported. We replicate nine genome-wide significant associations with known loci of plasma cholesterol, high-density lipoprotein, low-density lipoprotein, triglycerides, thyroid stimulating hormone, homocysteine, C-reactive protein and uric acid, with only one detected in the previous analysis of the same traits. Further, we leveraged shared identity-by-descent genetic segments in the region of the uric acid locus to fine-map the signal, refining the known locus by a factor of 4. Finally, we report a novel associations for height (rs17629022, P< 2.1 × 10⁻⁸).

[1]  L. Cardon,et al.  Linkage disequilibrium in young genetically isolated Dutch population , 2004, European Journal of Human Genetics.

[2]  Arthur S Slutsky,et al.  Genetics of asthma: University of Toronto , 1995 .

[3]  K. Frazer,et al.  Human genetic variation and its contribution to complex traits , 2009, Nature Reviews Genetics.

[4]  D. Strachan,et al.  LDL-cholesterol concentrations: a genome-wide association study , 2008, The Lancet.

[5]  J. Haines,et al.  Maternal lineages and Alzheimer disease risk in the Old Order Amish , 2005, Human Genetics.

[6]  H. Kang,et al.  Variance component model to account for sample structure in genome-wide association studies , 2010, Nature Genetics.

[7]  Christian Gieger,et al.  Loci influencing lipid levels and coronary heart disease risk in 16 European population cohorts , 2009, Nature Genetics.

[8]  A. Kong,et al.  The role of linkage studies for common diseases. , 2001, Current opinion in genetics & development.

[9]  M. Jarvelin,et al.  A Common Variant in the FTO Gene Is Associated with Body Mass Index and Predisposes to Childhood and Adult Obesity , 2007, Science.

[10]  R. Collins,et al.  Newly identified loci that influence lipid concentrations and risk of coronary artery disease , 2008, Nature Genetics.

[11]  Shahrul Mt-Isa,et al.  Genetic Loci associated with C-reactive protein levels and risk of coronary heart disease. , 2009, JAMA.

[12]  Subhajyoti De,et al.  Common variants near MC4R are associated with fat mass, weight and risk of obesity , 2008, Nature Genetics.

[13]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[14]  M. Daly,et al.  Use of a genetic isolate to identify rare disease variants: C7 on 5p associated with MS. , 2009, Human molecular genetics.

[15]  G. Abecasis,et al.  A general test of association for quantitative traits in nuclear families. , 2000, American journal of human genetics.

[16]  Benjamin M. Neale,et al.  Genome-Wide Association Studies in an Isolated Founder Population from the Pacific Island of Kosrae , 2009, PLoS genetics.

[17]  M. Muenke,et al.  Genetics of population isolates , 2002, Clinical genetics.

[18]  R. Collins,et al.  Novel Associations of CPS1, MUT, NOX4, and DPEP1 With Plasma Homocysteine in a Healthy Population: A Genome-Wide Evaluation of 13 974 Participants in the Women’s Genome Health Study , 2009, Circulation. Cardiovascular genetics.

[19]  W. Bodmer,et al.  Common and rare variants in multifactorial susceptibility to common diseases , 2008, Nature Genetics.

[20]  J K Hewitt,et al.  Combined linkage and association sib-pair analysis for quantitative traits. , 1999, American journal of human genetics.

[21]  Jasper Rine,et al.  The prevalence of folate-remedial MTHFR enzyme variants in humans , 2008, Proceedings of the National Academy of Sciences.

[22]  A. Hofman,et al.  Association of three genetic loci with uric acid concentration and risk of gout: a genome-wide association study , 2008, The Lancet.

[23]  M S McPeek,et al.  Estimation of variance components of quantitative traits in inbred populations. , 2000, American journal of human genetics.

[24]  Christoph Lange,et al.  A Family-Based Association Test for Repeatedly Measured Quantitative Traits Adjusting for Unknown Environmental and/or Polygenic Effects , 2004, Statistical applications in genetics and molecular biology.

[25]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[26]  Zhaoxia Yu,et al.  Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies. , 2009, American journal of human genetics.

[27]  M. Daly,et al.  Genome-wide association study of electrocardiographic conduction measures in an isolated founder population: Kosrae. , 2009, Heart rhythm.

[28]  Scott T. Weiss,et al.  On the Analysis of Genome-Wide Association Studies in Family-Based Designs: A Universal, Robust Analysis Approach and an Application to Four Genome-Wide Association Studies , 2009, PLoS genetics.

[29]  N. Freimer,et al.  An approach to investigating linkage for bipolar disorder using large Costa Rican pedigrees. , 1996, American journal of medical genetics.

[30]  Jonathan C. Cohen,et al.  A spectrum of PCSK9 alleles contributes to plasma levels of low-density lipoprotein cholesterol. , 2006, American journal of human genetics.

[31]  Bruce Winney,et al.  Multiple rare variants in different genes account for multifactorial inherited susceptibility to colorectal adenomas. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[32]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[33]  A. Kong,et al.  The genealogic approach to human genetics of disease. , 2001, Cancer journal.

[34]  C. Gieger,et al.  Identification of ten loci associated with height highlights new biological pathways in human growth , 2008, Nature Genetics.

[35]  David M. Evans,et al.  Genome-wide association analysis identifies 20 loci that influence adult height , 2008, Nature Genetics.

[36]  E. Petretto,et al.  Not all isolates are equal: linkage disequilibrium analysis on Xq13.3 reveals different patterns in Sardinian sub-populations , 2002, Human Genetics.

[37]  L. Almasy,et al.  Multipoint quantitative-trait linkage analysis in general pedigrees. , 1998, American journal of human genetics.

[38]  Jonathan C. Cohen,et al.  Multiple Rare Alleles Contribute to Low Plasma Levels of HDL Cholesterol , 2004, Science.

[39]  M S McPeek,et al.  The genetic dissection of complex traits in a founder population. , 2001, American journal of human genetics.

[40]  R. Collins,et al.  Common variants at 30 loci contribute to polygenic dyslipidemia , 2009, Nature Genetics.

[41]  C. Haley,et al.  Genomewide Rapid Association Using Mixed Model and Regression: A Fast and Simple Method For Genomewide Pedigree-Based Quantitative Trait Loci Association Analysis , 2007, Genetics.

[42]  D. Heckerman,et al.  Efficient Control of Population Structure in Model Organism Association Mapping , 2008, Genetics.

[43]  E. Génin,et al.  Complex trait mapping in isolated populations: Are specific statistical methods required? , 2005, European Journal of Human Genetics.

[44]  A S Slutsky,et al.  Asthma on Tristan da Cunha: looking for the genetic link. The University of Toronto Genetics of Asthma Research Group. , 1996, American journal of respiratory and critical care medicine.

[45]  Christian Gieger,et al.  Meta-Analysis of 28,141 Individuals Identifies Common Variants within Five New Loci That Influence Uric Acid Concentrations , 2009, PLoS genetics.

[46]  J. O’Connell,et al.  A Null Mutation in Human APOC3 Confers a Favorable Plasma Lipid Profile and Apparent Cardioprotection , 2008, Science.

[47]  J. Hirschhorn,et al.  Progress in Genome-Wide Association Studies of Human Height , 2009, Hormone Research in Paediatrics.

[48]  A. Taniguchi,et al.  Control of renal uric acid excretion and gout , 2008, Current opinion in rheumatology.

[49]  G. Abecasis,et al.  Association Analysis in a Variance Components Framework , 2001, Genetic epidemiology.

[50]  Kenneth Lange,et al.  Use of population isolates for mapping complex traits , 2000, Nature Reviews Genetics.

[51]  Sanjiv J. Shah,et al.  Whole-genome association study identifies STK39 as a hypertension susceptibility gene , 2009, Proceedings of the National Academy of Sciences.

[52]  Yurii S. Aulchenko,et al.  A Genomic Background Based Method for Association Analysis in Related Individuals , 2007, PloS one.

[53]  M. McPeek,et al.  Are common disease susceptibility alleles the same in outbred and founder populations? , 2004, European Journal of Human Genetics.

[54]  C. Hoggart,et al.  Genome-wide association analysis of metabolic traits in a birth cohort from a founder population , 2008, Nature Genetics.

[55]  J. Gulcher,et al.  Localization of a gene for peripheral arterial occlusive disease to chromosome 1p31. , 2002, American journal of human genetics.

[56]  P. Heutink,et al.  Gene finding in genetically isolated populations. , 2002, Human molecular genetics.

[57]  Zhiwu Zhang,et al.  Mixed linear model approach adapted for genome-wide association studies , 2010, Nature Genetics.

[58]  Y. Kanai,et al.  Human organic anion transporter 4 is a renal apical organic anion/dicarboxylate exchanger in the proximal tubules. , 2004, Journal of pharmacological sciences.

[59]  Eric Boerwinkle,et al.  Population-based resequencing of ANGPTL4 uncovers variations that reduce triglycerides and increase HDL , 2007, Nature Genetics.

[60]  Alexander Gusev,et al.  Whole population, genome-wide mapping of hidden relatedness. , 2009, Genome research.

[61]  Alexander Gusev,et al.  Systematic haplotype analysis resolves a complex plasma plant sterol locus on the Micronesian Island of Kosrae , 2009, Proceedings of the National Academy of Sciences.

[62]  Itsik Pe'er,et al.  Evaluating potential for whole-genome studies in Kosrae, an isolated population in Micronesia , 2006, Nature Genetics.

[63]  N. Cook,et al.  Loci related to metabolic-syndrome pathways including LEPR,HNF1A, IL6R, and GCKR associate with plasma C-reactive protein: the Women's Genome Health Study. , 2008, American journal of human genetics.

[64]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[65]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[66]  B. Han,et al.  Identification of 15 loci influencing height in a Korean population , 2010, Journal of Human Genetics.

[67]  Kari Stefansson,et al.  Common variants on 9q22.33 and 14q13.3 predispose to thyroid cancer in European populations , 2009, Nature Genetics.

[68]  Dolores Corella,et al.  Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans , 2008, Nature Genetics.

[69]  J. Terwilliger,et al.  A susceptibility locus for human systemic lupus erythematosus (hSLE1) on chromosome 2q. , 2000, Journal of autoimmunity.

[70]  B. Browning,et al.  A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. , 2009, American journal of human genetics.

[71]  T Varilo,et al.  Molecular genetics of the Finnish disease heritage. , 1999, Human molecular genetics.

[72]  Bjarni V. Halldórsson,et al.  Many sequence variants affecting diversity of adult human height , 2008, Nature Genetics.