The Impact of Population Demography and Selection on the Genetic Architecture of Complex Traits

Population genetic studies have found evidence for dramatic population growth in recent human history. It is unclear how this recent population growth, combined with the effects of negative natural selection, has affected patterns of deleterious variation, as well as the number, frequency, and effect sizes of mutations that contribute risk to complex traits. Because researchers are performing exome sequencing studies aimed at uncovering the role of low-frequency variants in the risk of complex traits, this topic is of critical importance. Here I use simulations under population genetic models where a proportion of the heritability of the trait is accounted for by mutations in a subset of the exome. I show that recent population growth increases the proportion of nonsynonymous variants segregating in the population, but does not affect the genetic load relative to a population that did not expand. Under a model where a mutation's effect on a trait is correlated with its effect on fitness, rare variants explain a greater portion of the additive genetic variance of the trait in a population that has recently expanded than in a population that did not recently expand. Further, when using a single-marker test, for a given false-positive rate and sample size, recent population growth decreases the expected number of significant associations with the trait relative to the number detected in a population that did not expand. However, in a model where there is no correlation between a mutation's effect on fitness and the effect on the trait, common variants account for much of the additive genetic variance, regardless of demography. Moreover, here demography does not affect the number of significant associations detected. These findings suggest recent population history may be an important factor influencing the power of association tests and in accounting for the missing heritability of certain complex traits.

[1]  J. B. S. Haldane,et al.  The Effect of Variation of Fitness , 1937, The American Naturalist.

[2]  Chao Qian,et al.  Population , 1940, State Rankings 2020: A Statistical View of America.

[3]  E. Dempster,et al.  Heritability of Threshold Characters. , 1950, Genetics.

[4]  H. Muller,et al.  Our load of mutations. , 1950, American journal of human genetics.

[5]  T. Ohta,et al.  The Average Number of Generations until Fixation of a Mutant Gene in a Finite Population. , 1969, Genetics.

[6]  D. F. Roberts,et al.  Heritability of stature in a West African population , 1978, Annals of human genetics.

[7]  D. Hartl,et al.  Population genetics of polymorphism and divergence. , 1992, Genetics.

[8]  T. Mackay,et al.  Effects of single P-element insertions on bristle number and viability in Drosophila melanogaster. , 1996, Genetics.

[9]  Mario Pirastu,et al.  Population choice in mapping genes for complex diseases , 1999, Nature Genetics.

[10]  N. Risch Searching for genetic determinants in the new millennium , 2000, Nature.

[11]  Kenneth Lange,et al.  Use of population isolates for mapping complex traits , 2000, Nature Reviews Genetics.

[12]  J. Pritchard,et al.  Linkage disequilibrium in humans: models and data. , 2001, American journal of human genetics.

[13]  J. Pritchard Are rare variants responsible for susceptibility to complex diseases? , 2001, American journal of human genetics.

[14]  Pardis C Sabeti,et al.  Linkage disequilibrium in the human genome , 2001, Nature.

[15]  L. Kruglyak,et al.  Patterns of linkage disequilibrium in the human genome , 2002, Nature Reviews Genetics.

[16]  K. Czene,et al.  Environmental and heritable causes of cancer among 9.6 million individuals in the Swedish family‐cancer database , 2002, International journal of cancer.

[17]  B. Charlesworth,et al.  A polygenic basis for late-onset disease. , 2003, Trends in genetics : TIG.

[18]  Deborah A Nickerson,et al.  Population History and Natural Selection Shape Patterns of Genetic Variation in 132 Genes , 2004, PLoS biology.

[19]  Gabor T. Marth,et al.  The Allele Frequency Spectrum in Genome-Wide Human Variation Data Reveals Signals of Differential Demographic History in Three Large World Populations , 2004, Genetics.

[20]  R. Hudson,et al.  Interrogating multiple aspects of variation in a full resequencing data set to infer human population size changes. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Chiara Sabatti,et al.  Magnitude and distribution of linkage disequilibrium in population isolates and implications for genome-wide association studies , 2006, Nature Genetics.

[22]  G. Abecasis,et al.  Heritability of Cardiovascular and Personality Traits in 6,148 Sardinians , 2006, PLoS genetics.

[23]  James A. Cuff,et al.  Distinguishing protein-coding and noncoding genes in the human genome , 2007, Proceedings of the National Academy of Sciences.

[24]  J. Mullikin,et al.  Measurement of the human allele frequency spectrum demonstrates greater genetic drift in East Asians than in Europeans , 2007, Nature Genetics.

[25]  S. Pavard,et al.  Negative Selection on BRCA1 Susceptibility Alleles Sheds Light on the Population Genetics of Late-Onset Diseases and Aging Theory , 2007, PloS one.

[26]  C. Hoggart,et al.  Sequence-Level Population Simulations Over Large Genomic Regions , 2007, Genetics.

[27]  M. Spitz,et al.  Shifting paradigm of association studies: value of rare single-nucleotide polymorphisms. , 2008, American journal of human genetics.

[28]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[29]  Hanlee P. Ji,et al.  Next-generation DNA sequencing , 2008, Nature Biotechnology.

[30]  W. G. Hill,et al.  Heritability in the genomics era — concepts and misconceptions , 2008, Nature Reviews Genetics.

[31]  Ryan D. Hernandez,et al.  Proportionally more deleterious genetic variation in European than in African populations , 2008, Nature.

[32]  Ryan D. Hernandez,et al.  A flexible forward simulator for populations subject to selection and demography , 2008, Bioinform..

[33]  M. Daly,et al.  Genetic Mapping in Human Disease , 2008, Science.

[34]  Ryan D. Hernandez,et al.  Assessing the Evolutionary Impact of Amino Acid Mutations in the Human Genome , 2008, PLoS genetics.

[35]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[36]  Ryan D. Hernandez,et al.  Evolutionary Processes Acting on Candidate cis-Regulatory Regions in Humans Inferred from Patterns of Polymorphism and Divergence , 2009, PLoS genetics.

[37]  Suzanne M. Leal,et al.  Discovery of Rare Variants via Sequencing: Implications for the Design of Complex Trait Association Studies , 2009, PLoS genetics.

[38]  K. Frazer,et al.  Common vs. rare allele hypotheses for complex diseases. , 2009, Current opinion in genetics & development.

[39]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[40]  P. Visscher,et al.  Common polygenic variation contributes to risk of schizophrenia and bipolar disorder , 2009, Nature.

[41]  C. Rotimi,et al.  Transferability and Fine-Mapping of Genome-Wide Associated Loci for Adult Height across Human Populations , 2009, PloS one.

[42]  Ryan D. Hernandez,et al.  Inferring the Joint Demographic History of Multiple Populations from Multidimensional SNP Frequency Data , 2009, PLoS genetics.

[43]  Kirk E Lohmueller,et al.  Detecting ancient admixture and estimating demographic parameters in multiple human populations. , 2009, Molecular biology and evolution.

[44]  Kirk E Lohmueller,et al.  Methods for Human Demographic Inference Using Haplotype Patterns From Genomewide Single-Nucleotide Polymorphism Data , 2009, Genetics.

[45]  J. Stamatoyannopoulos,et al.  Power of deep, all-exon resequencing for discovery of human trait genes , 2009, Proceedings of the National Academy of Sciences.

[46]  Emily H Turner,et al.  Targeted Capture and Massively Parallel Sequencing of Twelve Human Exomes , 2009, Nature.

[47]  M. King,et al.  Genetic Heterogeneity in Human Disease , 2010, Cell.

[48]  D. Altshuler,et al.  Consistent Association of Type 2 Diabetes Risk Variants Found in Europeans in Diverse Racial and Ethnic Groups , 2010, PLoS genetics.

[49]  A. Clark,et al.  The Effect of Recent Admixture on Inference of Ancient Human Population History , 2010, Genetics.

[50]  Jason H. Moore,et al.  Missing heritability and strategies for finding the underlying causes of complex disease , 2010, Nature Reviews Genetics.

[51]  Taylor J. Maxwell,et al.  Deep resequencing reveals excess rare recent variants consistent with explosive population growth , 2010, Nature communications.

[52]  Ayellet V. Segrè,et al.  Hundreds of variants clustered in genomic loci and biological pathways affect human height , 2010, Nature.

[53]  P. Shannon,et al.  Exome sequencing identifies the cause of a Mendelian disorder , 2009, Nature Genetics.

[54]  C. E. Pearson,et al.  Table S2: Trans-factors and trinucleotide repeat instability Trans-factor , 2010 .

[55]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[56]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[57]  A. Eyre-Walker Evolution in health and medicine Sackler colloquium: Genetic architecture of a complex trait and its implications for fitness and genome-wide association studies. , 2010, Proceedings of the National Academy of Sciences of the United States of America.

[58]  D. Goldstein,et al.  Uncovering the roles of rare variants in common disease through whole-genome sequencing , 2010, Nature Reviews Genetics.

[59]  Emily H Turner,et al.  Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome , 2010, Nature Genetics.

[60]  J. Shendure,et al.  Massively parallel sequencing and rare disease. , 2010, Human molecular genetics.

[61]  J. Lupski,et al.  Clan Genomics and the Complex Architecture of Human Disease , 2011, Cell.

[62]  J. Shendure,et al.  Exome sequencing as a tool for Mendelian disease gene discovery , 2011, Nature Reviews Genetics.

[63]  Naomi R. Wray,et al.  Synthetic Associations Created by Rare Variants Do Not Explain Most GWAS Results , 2011, PLoS biology.

[64]  N. Laird,et al.  Identifying causal rare variants of disease through family-based analysis of Genetics Analysis Workshop 17 data set , 2011, BMC proceedings.

[65]  L. Groop,et al.  Heritability and familiality of type 2 diabetes and related quantitative traits in the Botnia Study , 2011, Diabetologia.

[66]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[67]  B. Stranger,et al.  Progress and Promise of Genome-Wide Association Studies for Human Complex Trait Genetics , 2011, Genetics.

[68]  Jonathan L. Haines,et al.  Correcting Away the Hidden Heritability , 2011, Annals of human genetics.

[69]  Jacob A. Tennessen,et al.  The promise and limitations of population exomics for human evolution studies , 2011, Genome Biology.

[70]  M. Feldman,et al.  Genome-Wide Association Study SNPs in the Human Genome Diversity Project Populations: Does Selection Affect Unlinked SNPs with Shared Trait Associations? , 2011, PLoS genetics.

[71]  F. Agakov,et al.  Abundant pleiotropy in human complex diseases and traits. , 2011, American journal of human genetics.

[72]  Francisco M. De La Vega,et al.  Genomics for the world , 2011, Nature.

[73]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[74]  Paul Flicek,et al.  The functional spectrum of low-frequency coding variation , 2011, Genome Biology.

[75]  C. Lajonchere,et al.  Genetic heritability and shared environmental factors among twin pairs with autism. , 2011, Archives of general psychiatry.

[76]  R. Carroll,et al.  Distribution of allele frequencies and effect sizes and their interrelationships for common genetic susceptibility variants , 2011, Proceedings of the National Academy of Sciences.

[77]  Eden R. Martin,et al.  Reconsidering Association Testing Methods Using Single-Variant Test Statistics as Alternatives to Pooling Tests for Sequence Data with Rare Variants , 2012, PloS one.

[78]  M. Gerstein,et al.  The Centers for Mendelian Genomics: A new large‐scale initiative to identify the genes underlying rare Mendelian conditions , 2012, American journal of medical genetics. Part A.

[79]  Alexander A. Morgan,et al.  Type 2 Diabetes Risk Alleles Demonstrate Extreme Directional Differentiation among Human Populations, Compared to Other Diseases , 2012, PLoS genetics.

[80]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[81]  A. Clark,et al.  Recent Explosive Human Population Growth Has Resulted in an Excess of Rare Genetic Variants , 2012, Science.

[82]  Adam Kiezun,et al.  Exome sequencing and the genetic basis of complex traits , 2012, Nature Genetics.

[83]  K. Shianna,et al.  Exome sequencing followed by large-scale genotyping fails to identify single rare variants of large effect in idiopathic generalized epilepsy. , 2012, American journal of human genetics.

[84]  Claudio J. Verzilli,et al.  An Abundance of Rare Functional Variants in 202 Drug Target Genes Sequenced in 14,002 People , 2012, Science.

[85]  Jari Tiihonen,et al.  Exome sequencing followed by large-scale genotyping suggests a limited role for moderately rare risk factors of strong effect in schizophrenia. , 2012, American journal of human genetics.

[86]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[87]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[88]  Peter Kraft,et al.  Heritability in the genome-wide association era , 2012, Human Genetics.

[89]  Ryan D. Hernandez,et al.  Population Genetics of Rare Variants and Complex Diseases , 2013, Human Heredity.

[90]  G. Abecasis,et al.  Exome sequencing and complex disease: practical aspects of rare variant association studies , 2012, Human molecular genetics.

[91]  Greg Gibson,et al.  Rare and common variants: twenty arguments , 2012, Nature Reviews Genetics.

[92]  Kevin R. Thornton,et al.  Properties and Modeling of GWAS when Complex Disease Risk Is Due to Non-Complementing, Deleterious Mutations in Genes of Large Effect , 2013, PLoS genetics.

[93]  Arcadi Navarro,et al.  High Trans-ethnic Replicability of GWAS Results Implies Common Causal Variants , 2013, PLoS genetics.

[94]  C. Carlson,et al.  Generalization and Dilution of Association Results from European GWAS in Populations of Non-European Ancestry: The PAGE Study , 2013, PLoS biology.

[95]  S. Gabriel,et al.  Analysis of 6,515 exomes reveals a recent origin of most human protein-coding variants , 2012, Nature.

[96]  D. MacArthur,et al.  Negligible impact of rare autoimmune-locus coding-region variants on missing heritability , 2013, Nature.

[97]  A. Clark,et al.  Population Growth Inflates the Per-Individual Number of Deleterious Mutations and Reduces Their Mean Effect , 2013, Genetics.

[98]  Kathryn Roeder,et al.  Analysis of Rare, Exonic Variation amongst Subjects with Autism Spectrum Disorders and Population Controls , 2013, PLoS genetics.

[99]  Margaret A. Pericak-Vance,et al.  Identification of a Rare Coding Variant in Complement 3 Associated with Age-related Macular Degeneration , 2013, Nature Genetics.

[100]  Jason Flannick,et al.  Evaluating empirical bounds on complex disease genetic architecture , 2013, Nature Genetics.

[101]  Sivakumar Gowrisankar,et al.  Rare variants in CFI, C3 and C9 are associated with high risk of advanced age-related macular degeneration , 2013, Nature Genetics.

[102]  Kari Stefansson,et al.  A rare nonsynonymous sequence variant in C3 is associated with high risk of age-related macular degeneration , 2013, Nature Genetics.

[103]  R. Gibbs,et al.  Neutral genomic regions refine models of recent rapid human population growth , 2013, Proceedings of the National Academy of Sciences.

[104]  Søren Brunak,et al.  Whole-exome sequencing of 2,000 Danish individuals and the role of rare coding variants in type 2 diabetes. , 2013, American journal of human genetics.

[105]  M. Daly,et al.  Searching for missing heritability: Designing rare variant association studies , 2014, Proceedings of the National Academy of Sciences.

[106]  Jennifer G. Robinson,et al.  Whole-exome sequencing identifies rare and low-frequency coding variants associated with LDL cholesterol. , 2014, American journal of human genetics.

[107]  Eric S. Lander,et al.  A polygenic burden of rare disruptive mutations in schizophrenia , 2014, Nature.

[108]  J. Pritchard,et al.  The deleterious mutation load is insensitive to recent population history , 2013, Nature Genetics.

[109]  F. THE MUTATION LOAD IN SMALL POPULATIONS , 2022 .