A mixed-model approach for genome-wide association studies of correlated traits in structured populations

Genome-wide association studies (GWAS) are a standard approach for studying the genetics of natural variation. A major concern in GWAS is the need to account for the complicated dependence structure of the data, both between loci as well as between individuals. Mixed models have emerged as a general and flexible approach for correcting for population structure in GWAS. Here, we extend this linear mixed-model approach to carry out GWAS of correlated phenotypes, deriving a fully parameterized multi-trait mixed model (MTMM) that considers both the within-trait and between-trait variance components simultaneously for multiple traits. We apply this to data from a human cohort for correlated blood lipid traits from the Northern Finland Birth Cohort 1966 and show greatly increased power to detect pleiotropic loci that affect more than one blood lipid trait. We also apply this approach to an Arabidopsis thaliana data set for flowering measurements in two different locations, identifying loci whose effect depends on the environment.

[1]  Robin Thompson,et al.  ASREML user guide release 1.0 , 2002 .

[2]  R. L. Quaas,et al.  Multiple Trait Evaluation Using Relatives' Records , 1976 .

[3]  William Valdar,et al.  Genetic and Environmental Effects on Complex Traits in Mice , 2006, Genetics.

[4]  C. R. Henderson Applications of linear models in animal breeding , 1984 .

[5]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[6]  Zhiwu Zhang,et al.  Mixed linear model approach adapted for genome-wide association studies , 2010, Nature Genetics.

[7]  Alkes L. Price,et al.  New approaches to population stratification in genome-wide association studies , 2010, Nature Reviews Genetics.

[8]  Joy Bergelson,et al.  Association mapping of local climate-sensitive quantitative trait loci in Arabidopsis thaliana , 2010, Proceedings of the National Academy of Sciences.

[9]  C. Haley,et al.  Multitrait least squares for quantitative trait loci detection. , 2000, Genetics.

[10]  Z B Zeng,et al.  Multiple trait analysis of genetic mapping for quantitative trait loci. , 1995, Genetics.

[11]  Simon C. Potter,et al.  Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis , 2011, Nature.

[12]  Tanya M. Teslovich,et al.  Biological, Clinical, and Population Relevance of 95 Loci for Blood Lipids , 2010, Nature.

[13]  Hong-Wen Deng,et al.  Univariate/Multivariate Genome-Wide Association Scans Using Data from Families and Unrelated Samples , 2009, PloS one.

[14]  P. Visscher,et al.  Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs , 2012, Nature Genetics.

[15]  Manuel A. R. Ferreira,et al.  A multivariate test of association , 2009, Bioinform..

[16]  Alkes L. Price,et al.  Single-Tissue and Cross-Tissue Heritability of Gene Expression Via Identity-by-Descent in Related or Unrelated Individuals , 2011, PLoS genetics.

[17]  Carole Ober,et al.  Gene-environment interactions in human disease: nuisance or opportunity? , 2011, Trends in genetics : TIG.

[18]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[19]  Peter J. Bradbury,et al.  The Genetic Architecture of Maize Flowering Time , 2009, Science.

[20]  Rongcheng Lin,et al.  Arabidopsis FHY3/FAR1 Gene Family and Distinct Roles of Its Members in Light Control of Arabidopsis Development1 , 2004, Plant Physiology.

[21]  J. Cheverud Genetics and analysis of quantitative traits , 1999 .

[22]  P. Visscher,et al.  Estimating missing heritability for disease from genome-wide association studies. , 2011, American journal of human genetics.

[23]  L. Kruglyak,et al.  Gene–Environment Interaction in Yeast Gene Expression , 2008, PLoS biology.

[24]  H. Kang,et al.  Variance component model to account for sample structure in genome-wide association studies , 2010, Nature Genetics.

[25]  D. Heckerman,et al.  Efficient Control of Population Structure in Model Organism Association Mapping , 2008, Genetics.

[26]  C. Hoggart,et al.  Genome-wide association analysis of metabolic traits in a birth cohort from a founder population , 2008, Nature Genetics.

[27]  Stephan Ripke,et al.  Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs , 2012, Nature Genetics.

[28]  M. Nordborg,et al.  Conditions Under Which Genome-Wide Association Studies Will be Positively Misleading , 2010, Genetics.

[29]  Ian J. Deary,et al.  Genetic contributions to stability and change in intelligence from childhood to old age , 2012, Nature.

[30]  Peter J. Bradbury,et al.  Genome-wide association study of leaf architecture in the maize nested association mapping population , 2011, Nature Genetics.

[31]  Josée Dupuis,et al.  Meta‐analysis of gene‐environment interaction: joint estimation of SNP and SNP × environment regression coefficients , 2011, Genetic epidemiology.

[32]  Beate Ritz,et al.  Genome-Wide Gene-Environment Study Identifies Glutamate Receptor Gene GRIN2A as a Parkinson's Disease Modifier Gene via Interaction with Coffee , 2011, PLoS genetics.

[33]  A. Auton,et al.  Genome-wide patterns of genetic variation in worldwide Arabidopsis thaliana accessions from the RegMap panel , 2011, Nature Genetics.

[34]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[35]  Qifa Zhang,et al.  Genome-wide association studies of 14 agronomic traits in rice landraces , 2010, Nature Genetics.

[36]  Keyan Zhao,et al.  An Arabidopsis Example of Association Mapping in Structured Samples , 2006, PLoS genetics.

[37]  B. Hayes,et al.  Genome-wide association mapping in Norwegian Red cattle identifies quantitative trait loci for fertility and milk production on BTA12. , 2011, Animal genetics.

[38]  E. Xing,et al.  Statistical Estimation of Correlated Genome Associations to a Quantitative Trait Network , 2009, PLoS genetics.

[39]  Russell D. Wolfinger,et al.  Geographical Genomics of Human Leukocyte Gene Expression Variation in Southern Morocco , 2009, Nature Genetics.

[40]  Bjarni J. Vilhjálmsson,et al.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines , 2010 .

[41]  L. Penrose,et al.  THE CORRELATION BETWEEN RELATIVES ON THE SUPPOSITION OF MENDELIAN INHERITANCE , 2022 .

[42]  D. Thomas,et al.  Gene–environment-wide association studies: emerging approaches , 2010, Nature Reviews Genetics.

[43]  H. Piepho,et al.  Multi-trait association mapping in sugar beet (Beta vulgaris L.) , 2008, Theoretical and Applied Genetics.

[44]  R. D'Agostino,et al.  A genome-wide association study for blood lipid phenotypes in the Framingham Heart Study , 2007, BMC Medical Genetics.

[45]  O. Kempthorne The correlation between relatives on the supposition of mendelian inheritance , 1968 .