Detecting gene subnetworks under selection in biological pathways

Abstract Advances in high throughput sequencing technologies have created a gap between data production and functional data analysis. Indeed, phenotypes result from interactions between numerous genes, but traditional methods treat loci independently, missing important knowledge brought by network-level emerging properties. Therefore, detecting selection acting on multiple genes affecting the evolution of complex traits remains challenging. In this context, gene network analysis provides a powerful framework to study the evolution of adaptive traits and facilitates the interpretation of genome-wide data. We developed a method to analyse gene networks that is suitable to evidence polygenic selection. The general idea is to search biological pathways for subnetworks of genes that directly interact with each other and that present unusual evolutionary features. Subnetwork search is a typical combinatorial optimization problem that we solve using a simulated annealing approach. We have applied our methodology to find signals of adaptation to high-altitude in human populations. We show that this adaptation has a clear polygenic basis and is influenced by many genetic components. Our approach, implemented in the R package signet, improves on gene-level classical tests for selection by identifying both new candidate genes and new biological processes involved in adaptation to altitude.

[1]  P. Pérez-Piñera,et al.  Pleiotrophin, a multifunctional angiogenic factor: mechanisms and pathways in normal and pathological angiogenesis , 2008, Current opinion in hematology.

[2]  Cristina Mitrea,et al.  Methods and approaches in the topology-based analysis of biological pathways , 2013, Front. Physiol..

[3]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[4]  Fernando Racimo,et al.  Approximation to the distribution of fitness effects across functional categories in human segregating polymorphisms , 2014, bioRxiv.

[5]  A. Bigham Genetics of human origin and evolution: high-altitude adaptations. , 2016, Current opinion in genetics & development.

[6]  David B. Witonsky,et al.  Using Environmental Correlations to Identify Loci Underlying Local Adaptation , 2010, Genetics.

[7]  Joy Bergelson,et al.  References and Notes Supporting Online Material Adaptation to Climate across the Arabidopsis Thaliana Genome , 2022 .

[8]  Thomas Wiehe,et al.  The Effect of Strongly Selected Substitutions on Neutral Polymorphism: Analytical Results Based on Diffusion Theory , 1992 .

[9]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[10]  N L Kaplan,et al.  The "hitchhiking effect" revisited. , 1989, Genetics.

[11]  Josephine T. Daub,et al.  Widespread signals of convergent adaptation to high altitude in Asia and America , 2014, bioRxiv.

[12]  Laurent Excoffier,et al.  Evidence for polygenic adaptation to pathogens in the human genome. , 2013, Molecular biology and evolution.

[13]  Asan,et al.  Altitude adaptation in Tibet caused by introgression of Denisovan-like DNA , 2014, Nature.

[14]  W. Stephan,et al.  Response of Polygenic Traits Under Stabilizing Selection and Mutation When Loci Have Unequal Effects , 2015, G3: Genes, Genomes, Genetics.

[15]  M. King,et al.  Genetic Heterogeneity in Human Disease , 2010, Cell.

[16]  W. Stephan Signatures of positive selection: from selective sweeps at individual loci to subtle allele frequency changes in polygenic adaptation , 2016, Molecular ecology.

[17]  V. Le Corre,et al.  The genetic differentiation at quantitative trait loci under local adaptation , 2012, Molecular ecology.

[18]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[19]  Damian Szklarczyk,et al.  The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible , 2016, Nucleic Acids Res..

[20]  Sujoy Ghosh,et al.  Redundancy control in pathway databases (ReCiPa): an application for improving gene-set enrichment analysis in Omics studies and "Big data" biology. , 2013, Omics : a journal of integrative biology.

[21]  A. Bigham,et al.  Human high-altitude adaptation: forward genetics meets the HIF pathway , 2014, Genes & development.

[22]  H. A. Orr,et al.  Testing natural selection vs. genetic drift in phenotypic evolution using quantitative trait locus data. , 1998, Genetics.

[23]  Gabriele Sales,et al.  graphite - a Bioconductor package to convert pathway topology to gene network , 2012, BMC Bioinformatics.

[24]  Benjamin J. Raphael,et al.  Gene and Network Analysis of Common Variants Reveals Novel Associations in Multiple Complex Diseases , 2016, Genetics.

[25]  M. Fares,et al.  Recent Positive Selection Has Acted on Genes Encoding Proteins with More Interactions within the Whole Human Interactome , 2015, Genome biology and evolution.

[26]  Wei Wang,et al.  Natural selection on EPAS1 (HIF2α) associated with low hemoglobin concentration in Tibetan highlanders , 2010, Proceedings of the National Academy of Sciences.

[27]  C. Beall Two routes to functional adaptation: Tibetan and Andean high-altitude natives , 2007, Proceedings of the National Academy of Sciences.

[28]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[29]  Joaquin Dopazo,et al.  The role of the interactome in the maintenance of deleterious variability in human populations , 2014, Molecular systems biology.

[30]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Alistair I. Mees,et al.  Convergence of an annealing algorithm , 1986, Math. Program..

[32]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[33]  H. P. de Vladar,et al.  Stability and Response of Polygenic Traits to Stabilizing Selection and Mutation , 2014, Genetics.

[34]  Alan M. Moses,et al.  Evidence for widespread adaptive evolution of gene expression in budding yeast , 2010, Proceedings of the National Academy of Sciences.

[35]  Marcelo P. Segura-Lepe,et al.  Rare and low-frequency coding variants alter human adult height , 2016, Nature.

[36]  Hui Zhang,et al.  Genetic variations in Tibetan populations and high-altitude adaptation at the Himalayas. , 2011, Molecular biology and evolution.

[37]  Scott M. Williams,et al.  Shadows of complexity: what biological networks reveal about epistasis and pleiotropy , 2009, BioEssays : news and reviews in molecular, cellular and developmental biology.

[38]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[39]  J. Kutok,et al.  Essential role for Ptpn11 in survival of hematopoietic stem and progenitor cells. , 2011, Blood.

[40]  Ann Marie Craig,et al.  Neurexin–neuroligin signaling in synapse development , 2007, Current Opinion in Neurobiology.

[41]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[42]  H. Bickeböller,et al.  Inclusion of a priori information in genome‐wide association analysis , 2009, Genetic epidemiology.

[43]  John Maynard Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[44]  Sharon R Grossman,et al.  Detecting natural selection in genomic data. , 2013, Annual review of genetics.

[45]  S. Chakraborty,et al.  Positive Selection and Centrality in the Yeast and Fly Protein-Protein Interaction Networks , 2016, BioMed research international.

[46]  W. G. Hill,et al.  Heritability in the genomics era — concepts and misconceptions , 2008, Nature Reviews Genetics.

[47]  W. Stephan,et al.  Inferring positive selection in humans from genomic data , 2015, Investigative Genetics.

[48]  David J. Galas,et al.  RCytoscape: tools for exploratory network analysis , 2013, BMC Bioinformatics.

[49]  F. Hospital,et al.  Selective Sweep at a Quantitative Trait Locus in the Presence of Background Genetic Variation , 2008, Genetics.

[50]  Luca Pagani,et al.  Genetic Signatures Reveal High-Altitude Adaptation in a Set of Ethiopian Populations , 2013, Molecular biology and evolution.

[51]  Kenneth H. Buetow,et al.  PID: the Pathway Interaction Database , 2008, Nucleic Acids Res..

[52]  J. Gale Hitch-Hiking Effect of a Favourable Gene , 1980 .

[53]  Ross M. Fraser,et al.  Defining the role of common variation in the genomic and biological architecture of adult human height , 2014, Nature Genetics.

[54]  Kara Dolinski,et al.  The BioGRID interaction database: 2017 update , 2016, Nucleic Acids Res..

[55]  P. Luisi,et al.  Network-level and population genetics analysis of the insulin/TOR signal transduction pathway across human populations. , 2012, Molecular biology and evolution.

[56]  Robert Tibshirani,et al.  Statistical methods for identifying differentially expressed genes in DNA microarrays. , 2003, Methods in molecular biology.

[57]  J. Dopazo,et al.  Discovering the hidden sub-network component in a ranked list of genes or proteins derived from genomic experiments , 2012, Nucleic acids research.

[58]  Jonathan K. Pritchard,et al.  Adaptation – not by sweeps alone , 2010, Nature Reviews Genetics.

[59]  G. Coop,et al.  A Population Genetic Signal of Polygenic Adaptation , 2013, PLoS genetics.

[60]  L. Mascia,et al.  The synaptic proteins neurexins and neuroligins are widely expressed in the vascular system and contribute to its functions , 2009, Proceedings of the National Academy of Sciences.

[61]  Tobias Müller,et al.  Identifying functional modules in protein–protein interaction networks: an integrated exact approach , 2008, ISMB.

[62]  F. Ayala,et al.  Human Adaptations to Diet, Subsistence, and Ecoregion Are Due to Subtle Shifts in Allele Frequency , 2010 .

[63]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[64]  Nourollah Ahmadi,et al.  Detecting selection along environmental gradients: analysis of eight methods and their effectiveness for outbreeding and selfing populations , 2013, Molecular ecology.

[65]  G. Coop,et al.  Robust Identification of Local Adaptation from Allele Frequencies , 2012, Genetics.

[66]  Kyle J. Gaulton,et al.  Detection of human adaptation during the past 2000 years , 2016, Science.

[67]  F. Bussolino,et al.  Neuroligin 1 Induces Blood Vessel Maturation by Cooperating with the α6 Integrin* , 2014, The Journal of Biological Chemistry.

[68]  Rui Mei,et al.  Identifying Signatures of Natural Selection in Tibetan and Andean Populations Using Dense Genome Scan Data , 2010, PLoS genetics.