Inference of Evolutionary Forces Acting on Human Biological Pathways

Because natural selection is likely to act on multiple genes underlying a given phenotypic trait, we study here the potential effect of ongoing and past selection on the genetic diversity of human biological pathways. We first show that genes included in gene sets are generally under stronger selective constraints than other genes and that their evolutionary response is correlated. We then introduce a new procedure to detect selection at the pathway level based on a decomposition of the classical McDonald–Kreitman test extended to multiple genes. This new test, called 2DNS, detects outlier gene sets and takes into account past demographic effects and evolutionary constraints specific to gene sets. Selective forces acting on gene sets can be easily identified by a mere visual inspection of the position of the gene sets relative to their two-dimensional null distribution. We thus find several outlier gene sets that show signals of positive, balancing, or purifying selection but also others showing an ancient relaxation of selective constraints. The principle of the 2DNS test can also be applied to other genomic contrasts. For instance, the comparison of patterns of polymorphisms private to African and non-African populations reveals that most pathways show a higher proportion of nonsynonymous mutations in non-Africans than in Africans, potentially due to different demographic histories and selective pressures.

[1]  Ryan D. Hernandez,et al.  Natural selection on protein-coding genes in the human genome , 2005, Nature.

[2]  Scott M. Williams,et al.  Signatures of natural selection on genetic variants affecting complex human traits , 2013, Applied & translational genomics.

[3]  Serafim Batzoglou,et al.  Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP++ , 2010, PLoS Comput. Biol..

[4]  Ann M Demogines,et al.  Rapid evolution of BRCA1 and BRCA2 in humans and other primates , 2014, BMC Evolutionary Biology.

[5]  M. Kirkpatrick,et al.  On the accumulation of deleterious mutations during range expansions. , 2013, Molecular ecology.

[6]  A. R. Wagner Molecular Biology and Evolution , 2001 .

[7]  J. Shendure,et al.  Primate evolution of the recombination regulator PRDM9 , 2014, Nature Communications.

[8]  Kenneth H. Buetow,et al.  PID: the Pathway Interaction Database , 2008, Nucleic Acids Res..

[9]  Steven J Mack,et al.  Balancing selection and heterogeneity across the classical human leukocyte antigen loci: a meta-analytic review of 497 population studies. , 2008, Human immunology.

[10]  Adam Eyre-Walker,et al.  Adaptive protein evolution in Drosophila , 2002, Nature.

[11]  Philipp W. Messer,et al.  Genome-wide signals of positive selection in human evolution , 2014, Genome research.

[12]  W. J. Kent,et al.  The UCSC Genome Browser , 2003, Current protocols in bioinformatics.

[13]  Ryan D. Hernandez,et al.  Proportionally more deleterious genetic variation in European than in African populations , 2008, Nature.

[14]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[15]  Henning Hermjakob,et al.  The Reactome pathway Knowledgebase , 2015, Nucleic acids research.

[16]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[17]  S. Subramanian The Abundance of Deleterious Polymorphisms in Humans , 2012, Genetics.

[18]  Daniel R. Zerbino,et al.  Ensembl 2014 , 2013, Nucleic Acids Res..

[19]  Laurent Excoffier,et al.  Evidence for polygenic adaptation to pathogens in the human genome. , 2013, Molecular biology and evolution.

[20]  Renata C. Geer,et al.  The NCBI BioSystems database , 2009, Nucleic Acids Res..

[21]  G. Coop,et al.  A Population Genetic Signal of Polygenic Adaptation , 2013, PLoS genetics.

[22]  Mariann Bienz,et al.  β-Catenin: A Pivot between Cell Adhesion and Wnt Signalling , 2005, Current Biology.

[23]  J. Hayes,et al.  Glutathione S-Transferase Polymorphisms and Their Biological Consequences , 2000, Pharmacology.

[24]  Susumu Goto,et al.  Data, information, knowledge and principle: back to metabolism in KEGG , 2013, Nucleic Acids Res..

[25]  Philipp W. Messer,et al.  Frequent adaptation and the McDonald–Kreitman test , 2012, Proceedings of the National Academy of Sciences.

[26]  R. Greenberg Biometry , 1969, The Yale Journal of Biology and Medicine.

[27]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[28]  Robert B. Hartlage,et al.  This PDF file includes: Materials and Methods , 2009 .

[29]  Yasuhiro Go,et al.  Similar numbers but different repertoires of olfactory receptor genes in humans and chimpanzees. , 2008, Molecular biology and evolution.

[30]  S. Pääbo,et al.  Loss of Olfactory Receptor Genes Coincides with the Acquisition of Full Trichromatic Vision in Primates , 2004, PLoS biology.

[31]  R. Nielsen Molecular signatures of natural selection. , 2005, Annual review of genetics.

[32]  M. Kreitman,et al.  Adaptive protein evolution at the Adh locus in Drosophila , 1991, Nature.

[33]  A. Townsend,et al.  Hepcidin regulation by innate immune and infectious stimuli. , 2011, Blood.

[34]  Alfonso Valencia,et al.  Protein co-evolution, co-adaptation and interactions , 2008, The EMBO journal.

[35]  Liqing Zhang,et al.  Human SNPs reveal no evidence of frequent positive selection. , 2005, Molecular biology and evolution.

[36]  J. Parsch,et al.  The influence of demography and weak selection on the McDonald-Kreitman test: an empirical study in Drosophila. , 2008, Molecular biology and evolution.

[37]  Wen-Hsiung Li,et al.  Mammalian housekeeping genes evolve more slowly than tissue-specific genes. , 2004, Molecular biology and evolution.

[38]  F. Baudat,et al.  Meiotic recombination in mammals: localization and regulation , 2013, Nature Reviews Genetics.

[39]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[40]  H. Innan Modified Hudson–Kreitman–Aguadé Test and Two-Dimensional Evaluation of Neutrality Tests , 2006, Genetics.

[41]  Carlos Bustamante,et al.  Genomic scans for selective sweeps using SNP data. , 2005, Genome research.

[42]  Adam Eyre-Walker,et al.  Changing effective population size and the McDonald-Kreitman test. , 2002, Genetics.

[43]  C. Ponting,et al.  Signatures of adaptive evolution within human non-coding sequence. , 2006, Human molecular genetics.

[44]  P. Keightley,et al.  Estimating the rate of adaptive molecular evolution in the presence of slightly deleterious mutations and population size change. , 2009, Molecular biology and evolution.

[45]  Monica A. Giovanni,et al.  Adenomatous polyposis coli plays a key role, in vivo, in coordinating assembly of the neuronal nicotinic postsynaptic complex , 2008, Molecular and Cellular Neuroscience.

[46]  Gregory E. Jordan,et al.  A Scan for Human-Specific Relaxation of Negative Selection Reveals Unexpected Polymorphism in Proteasome Genes , 2013, Molecular biology and evolution.

[47]  A. E. Hirsh,et al.  Evolutionary Rate in the Protein Interaction Network , 2002, Science.

[48]  Charlotte M. Deane,et al.  Protein protein interactions, evolutionary rate, abundance and age , 2006, BMC Bioinformatics.

[49]  Joseph K. Pickrell,et al.  The Genetics of Human Adaptation: Hard Sweeps, Soft Sweeps, and Polygenic Adaptation , 2010, Current Biology.

[50]  Hunter B. Fraser,et al.  Gene expression drives local adaptation in humans , 2013, Genome research.

[51]  Ryan D. Hernandez,et al.  Classic Selective Sweeps Were Rare in Recent Human Evolution , 2011, Science.

[52]  J. Mattick,et al.  Genome research , 1990, Nature.

[53]  Justin C. Fay,et al.  Testing the neutral theory of molecular evolution with genomic data from Drosophila , 2002, Nature.

[54]  C. Ponting,et al.  Accelerated Evolution of the Prdm9 Speciation Gene across Diverse Metazoan Taxa , 2009, PLoS genetics.

[55]  B. Efron Bayes' Theorem in the 21st Century , 2013, Science.

[56]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[57]  Desmond G. Higgins,et al.  Loss of Olfactory Receptor Function in Hominin Evolution , 2014, PloS one.

[58]  S. Wuchty Evolution and topology in the yeast protein interaction network. , 2004, Genome research.

[59]  S. Batzoglou,et al.  Distribution and intensity of constraint in mammalian genomic sequence. , 2005, Genome research.

[60]  Chris T. A. Evelo,et al.  WikiPathways: building research communities on biological pathways , 2011, Nucleic Acids Res..

[61]  B. Stranger,et al.  Progress and Promise of Genome-Wide Association Studies for Human Complex Trait Genetics , 2011, Genetics.

[62]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[63]  Gustavo Glusman,et al.  A comparison of the human and chimpanzee olfactory receptor gene repertoires. , 2005, Genome research.

[64]  Rappold,et al.  Human Molecular Genetics , 1996, Nature Medicine.

[65]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..