Genome‐Wide Association Mapping With Longitudinal Data

Many genome‐wide association studies have been performed on population cohorts that contain phenotype measurements at multiple time points. However, standard association methodologies only consider one time point. In this paper, we propose a mixed‐model‐based approach for performing association mapping which utilizes multiple phenotype measurements for each individual. We introduce an analytical approach to calculate statistical power and show that this model leads to increased power when compared to traditional approaches. Moreover, we show that by using this model we are able to differentiate the genetic, environmental, and residual error contributions to the phenotype. Using predictions of these components, we show how the proportion of the phenotype due to environment and genetics can be predicted and show that the ranking of individuals based on these predictions is very accurate. The software implementing this method may be found at http://genetics.cs.ucla.edu/longGWAS/. Genet. Epidemiol. 36:463‐471, 2012. © 2012 Wiley Periodicals, Inc.

[1]  Christian Gieger,et al.  Loci influencing lipid levels and coronary heart disease risk in 16 European population cohorts , 2009, Nature Genetics.

[2]  D. Harville Maximum Likelihood Approaches to Variance Component Estimation and to Related Problems , 1977 .

[3]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[4]  R. D'Agostino,et al.  A genome-wide association study for blood lipid phenotypes in the Framingham Heart Study , 2007, BMC Medical Genetics.

[5]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[6]  C. R. Henderson SIRE EVALUATION AND GENETIC TRENDS , 1973 .

[7]  C. Hoggart,et al.  Genome-wide association analysis of metabolic traits in a birth cohort from a founder population , 2008, Nature Genetics.

[8]  Raphael Mrode,et al.  Linear models for the prediction of animal breeding values , 1996 .

[9]  J. Pritchard,et al.  Confounding from Cryptic Relatedness in Case-Control Association Studies , 2005, PLoS genetics.

[10]  J Blangero,et al.  Power of variance component linkage analysis to detect quantitative trait loci. , 1999, Annals of human genetics.

[11]  Iuliana Ionita-Laza,et al.  Genomewide weighted hypothesis testing in family-based association studies, with an application to a 100K scan. , 2007, American journal of human genetics.

[12]  R. Lyngsoe G. Hellenthal,et al.  Genome-wide association analysis , 2007 .

[13]  Ying Liu,et al.  FaST linear mixed models for genome-wide association studies , 2011, Nature Methods.

[14]  A. Rao,et al.  Estimation of Genetic Parameters: principles , 2003 .

[15]  David Heckerman,et al.  Correction for hidden confounders in the genetic analysis of gene expression , 2010, Proceedings of the National Academy of Sciences.

[16]  K. Roeder,et al.  Unbiased methods for population‐based association studies , 2001, Genetic epidemiology.

[17]  G. Siest,et al.  Extension of variance components approach to incorporate temporal trends and longitudinal pedigree data analysis , 2002, Genetic epidemiology.

[18]  D. Heckerman,et al.  Efficient Control of Population Structure in Model Organism Association Mapping , 2008, Genetics.

[19]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[20]  D. Schaid Mathematical and Statistical Methods for Genetic Analysis , 1999 .

[21]  D. Balding A tutorial on statistical methods for population association studies , 2006, Nature Reviews Genetics.

[22]  Eleazar Eskin,et al.  A high-resolution association mapping panel for the dissection of complex traits in mice. , 2010, Genome research.

[23]  Yusuke Nakamura,et al.  Genome-wide association study of hematological and biochemical traits in a Japanese population , 2010, Nature Genetics.

[24]  Piter Bijma,et al.  Estimating Relatedness Between Individuals in General Populations With a Focus on Their Use in Conservation Programs , 2006, Genetics.

[25]  Mark Von Tress,et al.  Generalized, Linear, and Mixed Models , 2003, Technometrics.

[26]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[27]  Jason H. Moore,et al.  Missing heritability and strategies for finding the underlying causes of complex disease , 2010, Nature Reviews Genetics.

[28]  R. Jennrich,et al.  Unbalanced repeated-measures models with structured covariance matrices. , 1986, Biometrics.

[29]  C Cannings,et al.  Mathematical and Statistical Methods for Genetic Analysis (2nd ed) , 2004, Heredity.

[30]  S. Gabriel,et al.  Efficiency and power in genetic association studies , 2005, Nature Genetics.

[31]  Heping Zhang,et al.  Statistical Inference in Mixed Models and Analysis of Twin and Family Data , 2011, Biometrics.

[32]  H. Kang,et al.  Variance component model to account for sample structure in genome-wide association studies , 2010, Nature Genetics.