Selection and explosive growth alter genetic architecture and hamper the detection of causal rare variants

The role of rare alleles in complex phenotypes has been hotly debated, but most rare variant association tests (RVATs) do not account for the evolutionary forces that affect genetic architecture. Here, we use simulation and numerical algorithms to show that explosive population growth, as experienced by human populations, can dramatically increase the impact of very rare alleles on trait variance. We then assess the ability of RVATs to detect causal loci using simulations and human RNA-seq data. Surprisingly, we find that statistical performance is worst for phenotypes in which genetic variance is due mainly to rare alleles, and explosive population growth decreases power. Although many studies have attempted to identify causal rare variants, few have reported novel associations. This has sometimes been interpreted to mean that rare variants make negligible contributions to complex trait heritability. Our work shows that RVATs are not robust to realistic human evolutionary forces, so general conclusions about the impact of rare variants on complex traits may be premature.

[1]  Yun S. Song,et al.  Transition Densities and Sample Frequency Spectra of Diffusion Processes with Selection and Variable Population Size , 2015, Genetics.

[2]  J. Pritchard,et al.  The allelic architecture of human disease genes: common disease-common variant...or not? , 2002, Human molecular genetics.

[3]  P. Visscher,et al.  Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index , 2015, Nature Genetics.

[4]  Ryan D. Hernandez,et al.  Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data , 2009, PLoS genetics.

[5]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[6]  Pedro G. Ferreira,et al.  Transcriptome and genome sequencing uncovers functional variation in humans , 2013, Nature.

[7]  Shamil R Sunyaev,et al.  Pooled association tests for rare variants in exon-resequencing studies. , 2010, American journal of human genetics.

[8]  Kathryn Roeder,et al.  Testing for an Unusual Distribution of Rare Variants , 2011, PLoS genetics.

[9]  Anand Bhaskar,et al.  Efficient inference of population size histories and locus-specific mutation rates from large-sample genomic variation data , 2014, bioRxiv.

[10]  Scott T. Weiss,et al.  Ethnic-specific associations of rare and low-frequency DNA sequence variants with asthma , 2015, Nature Communications.

[11]  Anders Albrechtsen,et al.  Natural Selection Affects Multiple Aspects of Genetic Variation at Putatively Neutral Sites across the Human Genome , 2011, PLoS genetics.

[12]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[13]  K. Frazer,et al.  Common vs. rare allele hypotheses for complex diseases. , 2009, Current opinion in genetics & development.

[14]  D. Reich,et al.  Dominance of Deleterious Alleles Controls the Response to a Population Bottleneck , 2015, PLoS genetics.

[15]  Ryan D. Hernandez,et al.  Population Genetics of Rare Variants and Complex Diseases , 2013, Human Heredity.

[16]  Ryan D. Hernandez,et al.  A flexible forward simulator for populations subject to selection and demography , 2008, Bioinform..

[17]  Jonghwan Kim,et al.  Mapping the chromosomal targets of STAT1 by Sequence Tag Analysis of Genomic Enrichment (STAGE). , 2007, Genome research.

[18]  Claudio J. Verzilli,et al.  An Abundance of Rare Functional Variants in 202 Drug Target Genes Sequenced in 14,002 People , 2012, Science.

[19]  S. Gabriel,et al.  Calibrating a coalescent simulation of human genome sequence variation. , 2005, Genome research.

[20]  John S. Witte,et al.  Comprehensive Approach to Analyzing Rare Genetic Variants , 2010, PloS one.

[21]  Molly Przeworski,et al.  How reliable are empirical genomic scans for selective sweeps? , 2006, Genome research.

[22]  Gabor T. Marth,et al.  Demographic history and rare allele sharing among human populations , 2011, Proceedings of the National Academy of Sciences.

[23]  D. Reich,et al.  The contribution of rare variation to prostate cancer heritability , 2015, Nature Genetics.

[24]  Kirk E Lohmueller,et al.  The distribution of deleterious genetic variation in human populations. , 2014, Current opinion in genetics & development.

[25]  E. Zeggini,et al.  An Evaluation of Statistical Approaches to Rare Variant Analysis in Genetic Association Studies , 2009, Genetic epidemiology.

[26]  Ryan D. Hernandez,et al.  Robust Forward Simulations of Recurrent Hitchhiking , 2013, Genetics.

[27]  R. Carroll,et al.  Distribution of allele frequencies and effect sizes and their interrelationships for common genetic susceptibility variants , 2011, Proceedings of the National Academy of Sciences.

[28]  A. Eyre-Walker,et al.  The Distribution of Fitness Effects of New Deleterious Amino Acid Mutations in Humans , 2006, Genetics.

[29]  Ryan D. Hernandez,et al.  Population Genetic Simulations of Complex Phenotypes with Implications for Rare Variant Association Tests , 2015, Genetic epidemiology.

[30]  Xihong Lin,et al.  Optimal tests for rare variant effects in sequencing association studies. , 2012, Biostatistics.

[31]  Ryan D. Hernandez,et al.  Assessing the Evolutionary Impact of Amino Acid Mutations in the Human Genome , 2008, PLoS genetics.

[32]  J. Pritchard,et al.  The deleterious mutation load is insensitive to recent population history , 2013, Nature Genetics.

[33]  D. Reich,et al.  No evidence that selection has been less effective at removing deleterious mutations in Europeans than in Africans , 2014, Nature Genetics.

[34]  C. Hoggart,et al.  Sequence-Level Population Simulations Over Large Genomic Regions , 2007, Genetics.

[35]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[36]  Transition densities and sample frequency spectra of diffusion processes with selection and variable population size , 2015 .

[37]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[38]  J. Pritchard Are rare variants responsible for susceptibility to complex diseases? , 2001, American journal of human genetics.

[39]  Ilan Gronau,et al.  Genome-wide inference of natural selection on human transcription factor binding sites , 2013, Nature Genetics.

[40]  P. Keightley,et al.  Joint Inference of the Distribution of Fitness Effects of Deleterious Mutations and Population Demography Based on Nucleotide Polymorphism Frequencies , 2007, Genetics.

[41]  Suzanne M. Leal,et al.  A Novel Adaptive Method for the Analysis of Next-Generation Sequencing Data to Detect Complex Trait Associations with Rare Variants Due to Gene Main Effects and Interactions , 2010, PLoS genetics.

[42]  Kevin R. Thornton,et al.  Properties and Modeling of GWAS when Complex Disease Risk Is Due to Non-Complementing, Deleterious Mutations in Genes of Large Effect , 2013, PLoS genetics.

[43]  M. Daly,et al.  Searching for missing heritability: Designing rare variant association studies , 2014, Proceedings of the National Academy of Sciences.

[44]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[45]  Søren Brunak,et al.  Whole-exome sequencing of 2,000 Danish individuals and the role of rare coding variants in type 2 diabetes. , 2013, American journal of human genetics.

[46]  A. Clark,et al.  Population Growth Inflates the Per-Individual Number of Deleterious Mutations and Reduces Their Mean Effect , 2013, Genetics.

[47]  K. Lohmueller The Impact of Population Demography and Selection on the Genetic Architecture of Complex Traits , 2013, PLoS genetics.

[48]  Francisco M. De La Vega,et al.  Genomics for the world , 2011, Nature.

[49]  A. Clark,et al.  Recent Explosive Human Population Growth Has Resulted in an Excess of Rare Genetic Variants , 2012, Science.

[50]  Paul J. Rathouz,et al.  An Evolutionary Framework for Association Testing in Resequencing Studies , 2010, PLoS genetics.

[51]  W. Thilly,et al.  A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST). , 2007, Mutation research.

[52]  M. Rieder,et al.  Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. , 2012, American journal of human genetics.

[53]  Kyle J. Gaulton,et al.  The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease , 2015, PLoS genetics.

[54]  Zachary A. Szpiech,et al.  Genome-wide association studies in diverse populations , 2010, Nature Reviews Genetics.

[55]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[56]  Ellen M. Schmidt,et al.  No large-effect low-frequency coding variation found for myocardial infarction. , 2014, Human molecular genetics.

[57]  John Novembre,et al.  Global distribution of genomic diversity underscores rich complex history of continental human populations. , 2009, Genome research.

[58]  E. Lander,et al.  On the allelic spectrum of human disease. , 2001, Trends in genetics : TIG.

[59]  Kathryn Roeder,et al.  Most genetic risk for autism resides with common variation , 2014, Nature Genetics.

[60]  Ryan D. Hernandez,et al.  Evolutionary Processes Acting on Candidate cis-Regulatory Regions in Humans Inferred from Patterns of Polymorphism and Divergence , 2009, PLoS genetics.