Increasing the power of genome wide association studies in natural populations using repeated measures – evaluation and implementation

Genomewide association studies (GWAS) enable detailed dissections of the genetic basis for organisms' ability to adapt to a changing environment. In long‐term studies of natural populations, individuals are often marked at one point in their life and then repeatedly recaptured. It is therefore essential that a method for GWAS includes the process of repeated sampling. In a GWAS, the effects of thousands of single‐nucleotide polymorphisms (SNPs) need to be fitted and any model development is constrained by the computational requirements. A method is therefore required that can fit a highly hierarchical model and at the same time is computationally fast enough to be useful. Our method fits fixed SNP effects in a linear mixed model that can include both random polygenic effects and permanent environmental effects. In this way, the model can correct for population structure and model repeated measures. The covariance structure of the linear mixed model is first estimated and subsequently used in a generalized least squares setting to fit the SNP effects. The method was evaluated in a simulation study based on observed genotypes from a long‐term study of collared flycatchers in Sweden. The method we present here was successful in estimating permanent environmental effects from simulated repeated measures data. Additionally, we found that especially for variable phenotypes having large variation between years, the repeated measurements model has a substantial increase in power compared to a model using average phenotypes as a response. The method is available in the R package RepeatABEL. It increases the power in GWAS having repeated measures, especially for long‐term studies of natural populations, and the R implementation is expected to facilitate modelling of longitudinal data for studies of both animal and human populations.

[1]  Julien G. A. Martin,et al.  Heritability and genetic correlations of personality traits in a wild population of yellow‐bellied marmots (Marmota flaviventris) , 2015, Journal of evolutionary biology.

[2]  A. Qvarnström,et al.  Climate adaptation and speciation: particular focus on reproductive barriers in Ficedula flycatchers , 2015, Evolutionary applications.

[3]  H. Ellegren,et al.  Genome-wide association mapping in a wild avian population identifies a link between genetic and phenotypic variation in a life-history trait , 2015, Proceedings of the Royal Society B: Biological Sciences.

[4]  Thomas Lumley,et al.  Generalized estimating equations for genome‐wide association studies using longitudinal phenotype data , 2015, Statistics in medicine.

[5]  D. Coltman,et al.  Comparing measures of breeding inequality and opportunity for selection with sexual selection on a quantitative character in bighorn rams , 2015, Journal of evolutionary biology.

[6]  Joseph Beyene,et al.  Longitudinal Data Analysis in Genome‐Wide Association Studies , 2014, Genetic epidemiology.

[7]  Pall I. Olason,et al.  Estimation of linkage disequilibrium and interspecific gene flow in Ficedula flycatchers by a newly developed 50k single-nucleotide polymorphism array , 2014, Molecular ecology resources.

[8]  D. de Koning,et al.  Why Breeding Values Estimated Using Familial Data Should Not Be Used for Genome-Wide Association Studies , 2013, G3: Genes, Genomes, Genetics.

[9]  T. Clutton‐Brock,et al.  Life history trade-offs at a single locus maintain sexually selected genetic variation , 2013, Nature.

[10]  J. Slate,et al.  Genomic dissection of variation in clutch size and egg mass in a wild great tit (Parus major) population , 2013, Molecular ecology.

[11]  L. Kruuk,et al.  Reproductive senescence in female Soay sheep: variation across traits and contributions of individual ageing and selective disappearance , 2013 .

[12]  Albert Hofman,et al.  Fast linear mixed model computations for genome‐wide association studies with longitudinal data , 2013, Statistics in medicine.

[13]  Pall I. Olason,et al.  The genomic landscape of species divergence in Ficedula flycatchers , 2012, Nature.

[14]  Eleazar Eskin,et al.  Genome-wide association studies in mice , 2012, Nature Reviews Genetics.

[15]  T. Clutton‐Brock,et al.  Re-mating across years and intralineage polygyny are associated with greater than expected levels of inbreeding in wild red deer , 2012, Journal of evolutionary biology.

[16]  G. S. van Doorn,et al.  Environment-dependent selection on mate choice in a natural population of birds. , 2012, Ecology letters.

[17]  J. Galindo,et al.  Applications of next generation sequencing in molecular ecology of non-model organisms , 2011, Heredity.

[18]  J. Slate,et al.  Genome‐wide association mapping identifies the genetic basis of discrete and quantitative variation in sexual weaponry in a wild sheep population , 2011, Molecular ecology.

[19]  Marcel E. Visser,et al.  Speeding Up Microevolution: The Effects of Increasing Temperature on Selection and Genetic Variance in a Wild Bird Population , 2011, PLoS biology.

[20]  T. Clutton‐Brock,et al.  Individuals and populations: the role of long-term, individual-based studies of animals in ecology and evolutionary biology. , 2010, Trends in ecology & evolution.

[21]  H. Ellegren,et al.  Speciation in Ficedula flycatchers , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[22]  J. Slate,et al.  Genome mapping in intensively studied wild vertebrate populations. , 2010, Trends in genetics : TIG.

[23]  Zachary A. Szpiech,et al.  Genome-wide association studies in diverse populations , 2010, Nature Reviews Genetics.

[24]  Jarrod D. Hadfield,et al.  The Misuse of BLUP in Ecology and Evolution , 2009, The American Naturalist.

[25]  William Valdar,et al.  Mapping in Structured Populations by Resample Model Averaging , 2009, Genetics.

[26]  A. Qvarnström,et al.  Life-history divergence facilitates regional coexistence of competing Ficedula flycatchers. , 2009, Ecology.

[27]  P. VanRaden,et al.  Efficient methods to compute genomic predictions. , 2008, Journal of dairy science.

[28]  L. Gustafsson,et al.  Sex Chromosome-Linked Species Recognition and Evolution of Reproductive Isolation in Flycatchers , 2007, Science.

[29]  C. Haley,et al.  Genomewide Rapid Association Using Mixed Model and Regression: A Fast and Simple Method For Genomewide Pedigree-Based Quantitative Trait Loci Association Analysis , 2007, Genetics.

[30]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[31]  L. Gustafsson,et al.  Testing the genetics underlying the co-evolution of mate choice and ornament in the wild , 2006, Nature.

[32]  E. Postma Implications of the difference between true and predicted breeding values for the study of natural selection and micro‐evolution , 2006, Journal of evolutionary biology.

[33]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[34]  Lars Gustafsson,et al.  Large–scale geographical variation confirms that climate change causes birds to lay earlier , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[35]  Atle Mysterud,et al.  The role of males in the dynamics of ungulate populations , 2002 .

[36]  L. Kruuk,et al.  Cryptic evolution in a wild bird population , 2001, Nature.

[37]  Tomas Pärt,et al.  Adaptive plasticity in mate preference linked to differences in reproductive effort , 2000, Nature.

[38]  Glenn-Peter Stre,et al.  A sexually selected character displacement in flycatchers reinforces premating isolation , 1997, Nature.

[39]  T. Pärt Male philopatry confers a mating advantage in the migratory collared flycatcher, Ficedula albicollis , 1994, Animal Behaviour.

[40]  L. Gustafsson,et al.  Acceleration of senescence in the collared flycatcher Ficedula albicollis by reproductive costs , 1990, Nature.

[41]  L. Gustafsson,et al.  BREEDING DISPERSAL IN THE COLLARED FLYCATCHER (FICEDULA ALBICOLLIS): POSSIBLE CAUSES AND REPRODUCTIVE CONSEQUENCES , 1989 .

[42]  H. Ellegren Genome sequencing and population genomics in non-model organisms. , 2014, Trends in ecology & evolution.

[43]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[44]  Moudud Alam,et al.  hglm: A Package for Fitting Hierarchical Generalized Linear Models , 2010, R J..

[45]  Yurii S. Aulchenko,et al.  BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btm108 Genetics and population analysis GenABEL: an R library for genome-wide association analysis , 2022 .

[46]  Robin Thompson,et al.  ASREML user guide release 1.0 , 2002 .