Detecting correlation between allele frequencies and environmental variables as a signature of selection. A fast computational approach for genome-wide studies

Abstract Genomic regions (or loci) displaying outstanding correlation with some environmental variables are likely to be under selection and this is the rationale of recent methods of identifying selected loci and retrieving functional information about them. To be efficient, such methods need to be able to disentangle the potential effect of environmental variables from the confounding effect of population history. For the routine analysis of genome-wide datasets, one also needs fast inference and model selection algorithms. We propose a method based on an explicit spatial model which is an instance of spatial generalized linear mixed model (SGLMM). For inference, we make use of the INLA–SPDE theoretical and computational framework developed by Rue et al. (2009) and Lindgren et al. (2011) . The method we propose allows one to quantify the correlation between genotypes and environmental variables. It works for the most common types of genetic markers, obtained either at the individual or at the population level. Analyzing the simulated data produced under a geostatistical model then under an explicit model of selection, we show that the method is efficient. We also re-analyze a dataset relative to nineteen pine weevils ( Hylobius abietis ) populations across Europe. The method proposed appears also as a statistically sound alternative to the Mantel tests for testing the association between the genetic and environmental variables.

[1]  P Taberlet,et al.  Genetic structure of the forest pest Hylobius abietis on conifer plantations at different spatial scales in Europe , 2006, Heredity.

[2]  Zhe Jiang,et al.  Spatial Statistics , 2013 .

[3]  Nourollah Ahmadi,et al.  Detecting selection along environmental gradients: analysis of eight methods and their effectiveness for outbreeding and selfing populations , 2013, Molecular ecology.

[4]  Gilles Guillot,et al.  Dismantling the Mantel tests , 2011, 1112.0651.

[5]  Steven G. Schroeder,et al.  A Large-Scale Screen for Artificial Selection in Maize Identifies Candidate Agronomic Loci for Domestication and Crop Improvementw⃞ , 2005, The Plant Cell Online.

[6]  David B. Witonsky,et al.  Using Environmental Correlations to Identify Loci Underlying Local Adaptation , 2010, Genetics.

[7]  J. Chilès,et al.  Geostatistics: Modeling Spatial Uncertainty , 1999 .

[8]  H. Rue,et al.  Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations , 2009 .

[9]  H. Rue,et al.  An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach , 2011 .

[10]  M. Blaxter,et al.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing , 2011, Nature Reviews Genetics.

[11]  P Taberlet,et al.  A spatial analysis method (SAM) to detect candidate loci for selection: towards a landscape genomics approach to adaptation , 2007, Molecular ecology.

[12]  Mathieu Gautier,et al.  The Genome Response to Artificial Selection: A Case Study in Dairy Cattle , 2009, PloS one.

[13]  R. Nielsen Molecular signatures of natural selection. , 2005, Annual review of genetics.

[14]  Isabelle Olivieri,et al.  Monitoring adaptive genetic responses to environmental change , 2012, Molecular ecology.

[15]  Stéphane Dupas,et al.  SimAdapt: an individual‐based genetic model for simulating landscape management impacts on populations , 2013 .