Patterns of Positive Selection in Six Mammalian Genomes

Genome-wide scans for positively selected genes (PSGs) in mammals have provided insight into the dynamics of genome evolution, the genetic basis of differences between species, and the functions of individual genes. However, previous scans have been limited in power and accuracy owing to small numbers of available genomes. Here we present the most comprehensive examination of mammalian PSGs to date, using the six high-coverage genome assemblies now available for eutherian mammals. The increased phylogenetic depth of this dataset results in substantially improved statistical power, and permits several new lineage- and clade-specific tests to be applied. Of ∼16,500 human genes with high-confidence orthologs in at least two other species, 400 genes showed significant evidence of positive selection (FDR<0.05), according to a standard likelihood ratio test. An additional 144 genes showed evidence of positive selection on particular lineages or clades. As in previous studies, the identified PSGs were enriched for roles in defense/immunity, chemosensory perception, and reproduction, but enrichments were also evident for more specific functions, such as complement-mediated immunity and taste perception. Several pathways were strongly enriched for PSGs, suggesting possible co-evolution of interacting genes. A novel Bayesian analysis of the possible “selection histories” of each gene indicated that most PSGs have switched multiple times between positive selection and nonselection, suggesting that positive selection is often episodic. A detailed analysis of Affymetrix exon array data indicated that PSGs are expressed at significantly lower levels, and in a more tissue-specific manner, than non-PSGs. Genes that are specifically expressed in the spleen, testes, liver, and breast are significantly enriched for PSGs, but no evidence was found for an enrichment for PSGs among brain-specific genes. This study provides additional evidence for widespread positive selection in mammalian evolution and new genome-wide insights into the functional implications of positive selection.

[1]  D. Wildman,et al.  Distinct genomic signatures of adaptation in pre- and postnatal environments during human evolution , 2008, Proceedings of the National Academy of Sciences.

[2]  Alex Wong,et al.  Evolution of protein-coding genes in Drosophila. , 2008, Trends in genetics : TIG.

[3]  Timothy B Sackton,et al.  Mutations in smooth muscle α-actin (ACTA2) lead to thoracic aortic aneurysms and dissections , 2007, Nature Genetics.

[4]  A. Levine,et al.  p53 regulates maternal reproduction through LIF , 2007, Nature.

[5]  Melanie A. Huntley,et al.  Evolution of genes and genomes on the Drosophila phylogeny , 2007, Nature.

[6]  A. Clark,et al.  Recent and ongoing selection in the human genome , 2007, Nature Reviews Genetics.

[7]  Colin N. Dewey,et al.  Population Genomics: Whole-Genome Analysis of Polymorphism and Divergence in Drosophila simulans , 2007, PLoS biology.

[8]  Fernando A. Villanea,et al.  Diet and the evolution of human amylase gene copy number variation , 2007, Nature Genetics.

[9]  Eric Gouaux,et al.  Structure of acid-sensing ion channel 1 at 1.9 A resolution and low pH. , 2007, Nature.

[10]  Olivier Fedrigo,et al.  Promoter regions of many neural- and nutrition-related genes have experienced positive selection during human evolution , 2007, Nature Genetics.

[11]  Hirohisa Kishino,et al.  Population genetics without intraspecific data. , 2007, Molecular biology and evolution.

[12]  Su Yeon Kim,et al.  Adaptive Evolution of Conserved Noncoding Elements in Mammals , 2007, PLoS genetics.

[13]  David L. Robertson,et al.  Specificity in protein interactions and its relationship with sequence diversity and coevolution , 2007, Proceedings of the National Academy of Sciences.

[14]  Jianzhi Zhang,et al.  More genes underwent positive selection in chimpanzee evolution than in human evolution , 2007, Proceedings of the National Academy of Sciences.

[15]  Maria Anisimova,et al.  Multiple hypothesis testing to detect lineages under positive selection that affects only a few sites. , 2007, Molecular biology and evolution.

[16]  Carlos D Bustamante,et al.  Localizing Recent Adaptive Evolution in the Human Genome , 2007, PLoS genetics.

[17]  Ryan D. Hernandez,et al.  Demographic Histories and Patterns of Linkage Disequilibrium in Chinese and Indian Rhesus Macaques , 2007, Science.

[18]  David N. Messina,et al.  Evolutionary and Biomedical Insights from the Rhesus Macaque Genome , 2007, Science.

[19]  J. Casasnovas,et al.  Structures of T Cell Immunoglobulin Mucin Receptors 1 and 2 Reveal Mechanisms for Regulation of Immune Responses by the TIM Receptor Family , 2007, Immunity.

[20]  G. Freeman,et al.  Immunoglobulin A (IgA) Is a Natural Ligand of Hepatitis A Virus Cellular Receptor 1 (HAVCR1), and the Association of IgA with HAVCR1 Enhances Virus-Receptor Interactions , 2007, Journal of Virology.

[21]  M. Laan,et al.  The evolution and genomic landscape of CGB1 and CGB2 genes , 2007, Molecular and Cellular Endocrinology.

[22]  S. Sugano,et al.  Rate of Evolution in Brain-Expressed Genes in Humans and Other Primates , 2006, PLoS biology.

[23]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[24]  W. Stephan,et al.  Pervasive adaptive evolution among interactors of the Drosophila hybrid inviability gene, Nup96. , 2007, Molecular biology and evolution.

[25]  Yi Xing,et al.  Exon arrays provide accurate assessments of gene expression , 2007, Genome Biology.

[26]  Jianzhi Zhang,et al.  Did brain-specific genes evolve faster in humans than in chimpanzees? , 2006, Trends in genetics : TIG.

[27]  David Haussler,et al.  Forces Shaping the Fastest Evolving Regions in the Human Genome , 2006, PLoS genetics.

[28]  Kyle Summers,et al.  Positive selection in the evolution of cancer , 2006, Biological reviews of the Cambridge Philosophical Society.

[29]  Yasuhiro Go Proceedings of the SMBE Tri-National Young Investigators' Workshop 2005. Lineage-specific expansions and contractions of the bitter taste receptor gene repertoire in vertebrates. , 2006, Molecular biology and evolution.

[30]  David Haussler,et al.  The UCSC Known Genes , 2006, Bioinform..

[31]  Joaquín Dopazo,et al.  Positive Selection, Relaxation, and Acceleration in the Evolution of the Human and Chimp Genome , 2006, PLoS Comput. Biol..

[32]  Terence P. Speed,et al.  Expression profiling in primates reveals a rapid evolution of human transcription factors , 2006, Nature.

[33]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[34]  Pierre Baldi,et al.  Global landscape of recent inferred Darwinian selection for Homo sapiens , 2006, Proc. Natl. Acad. Sci. USA.

[35]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[36]  K. Hastings Strong evolutionary conservation of broadly expressed protein isoforms in the troponin I gene family and other vertebrate gene families , 1996, Journal of Molecular Evolution.

[37]  M. Owen,et al.  Mutations in the gene encoding GlyT2 (SLC6A5) define a presynaptic component of human startle disease , 2006, Nature Genetics.

[38]  James A. Cuff,et al.  Genome sequence, comparative analysis and haplotype structure of the domestic dog , 2005, Nature.

[39]  R. Nielsen,et al.  Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level. , 2005, Molecular biology and evolution.

[40]  R. Nielsen Molecular signatures of natural selection. , 2005, Annual review of genetics.

[41]  Deborah A Nickerson,et al.  Genomic regions exhibiting positive selection identified from dense genotype data. , 2005, Genome research.

[42]  Ryan D. Hernandez,et al.  Natural selection on protein-coding genes in the human genome , 2005, Nature.

[43]  Yana Zhang,et al.  Cancer immunotherapy targeting Sp17: When should the laboratory findings be translated to the clinics? , 2005, American journal of hematology.

[44]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[45]  D. Kwiatkowski How malaria has affected the human genome and what human genetics can teach us about malaria. , 2005, American journal of human genetics.

[46]  C. Wilke,et al.  Why highly expressed proteins evolve slowly. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Timothy B Sackton,et al.  A Scan for Positively Selected Genes in the Genomes of Humans and Chimpanzees , 2005, PLoS biology.

[48]  W. Wong,et al.  Bayes empirical bayes inference of amino acid sites under positive selection. , 2005, Molecular biology and evolution.

[49]  Jean L. Chang,et al.  An initial strategy for the systematic identification of functional elements in the human genome by low-redundancy comparative sequencing. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Doron Lancet,et al.  Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification , 2005, Bioinform..

[51]  Ziheng Yang,et al.  The power of phylogenetic comparison in revealing protein function. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[52]  M. Lercher,et al.  Explorer Evidence for Widespread Degradation of Gene Control Regions in Hominid Genomes , 2015 .

[53]  James G. R. Gilbert,et al.  The vertebrate genome annotation (Vega) database , 2004, Nucleic Acids Res..

[54]  Tatiana A. Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[55]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[56]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[57]  Christine M. Malcom,et al.  Accelerated Evolution of Nervous System Genes in the Origin of Homo sapiens , 2004, Cell.

[58]  K. Kangawa,et al.  Neuromedin U is involved in nociceptive reflexes and adaptation to environmental stimuli in mice. , 2004, Biochemical and biophysical research communications.

[59]  Bruce T Lahn,et al.  Positive selection on the human genome. , 2004, Human molecular genetics.

[60]  Stéphane Guindon,et al.  Modeling the site-specific variation of selection patterns along lineages. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[61]  K. Kangawa,et al.  The gut-brain peptide neuromedin U is involved in the mammalian circadian oscillator system. , 2004, Biochemical and biophysical research communications.

[62]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[63]  D. Haussler,et al.  Aligning multiple genomic sequences with the threaded blockset aligner. , 2004, Genome research.

[64]  M. Adams,et al.  Inferring Nonneutral Evolution from Human-Chimp-Mouse Orthologous Gene Trios , 2003, Science.

[65]  N. Ryba,et al.  The Receptors for Mammalian Sweet and Umami Taste , 2003, Cell.

[66]  D. Haussler,et al.  Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[67]  Ziheng Yang,et al.  Estimating the distribution of selection coefficients from phylogenetic data with applications to mitochondrial and viral DNA. , 2003, Molecular biology and evolution.

[68]  X. Puente,et al.  Human and mouse proteases: a comparative genomic approach , 2003, Nature Reviews Genetics.

[69]  G. Freeman,et al.  The TIM gene family: emerging roles in immunity and disease , 2003, Nature Reviews Immunology.

[70]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[71]  D. Swallow,et al.  The maltase-glucoamylase gene: Common ancestry to sucrase-isomaltase with complementary starch digestion activities , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[72]  Anushya Muruganujan,et al.  PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification , 2003, Nucleic Acids Res..

[73]  J. Wall Estimating ancestral population sizes and divergence times. , 2003, Genetics.

[74]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[75]  E. Koonin,et al.  Orthology, paralogy and proposed classification for paralog subtypes. , 2002, Trends in genetics : TIG.

[76]  Dara G Torgerson,et al.  Mammalian sperm proteins are rapidly evolving: evidence of positive selection in functionally diverse genes. , 2002, Molecular biology and evolution.

[77]  R. Nielsen,et al.  Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. , 2002, Molecular biology and evolution.

[78]  H. Ostrer,et al.  Dominant and recessive deafness caused by mutations of a novel gene, TMC1, required for cochlear hair-cell function , 2002, Nature Genetics.

[79]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[80]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[81]  Feng-Chi Chen,et al.  Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees. , 2001, American journal of human genetics.

[82]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[83]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[84]  L. Duret,et al.  Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. , 2000, Molecular biology and evolution.

[85]  M. Kreitman,et al.  Methods to detect selection in populations with applications to the human. , 2000, Annual review of genomics and human genetics.

[86]  J P Vincent,et al.  Neurotensin and neurotensin receptors. , 1999, Trends in pharmacological sciences.

[87]  O. Tollersrud,et al.  Spectrum of Mutations in α-Mannosidosis , 1999 .

[88]  R. Nielsen,et al.  Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. , 1998, Genetics.

[89]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[90]  W. Messier,et al.  Episodic adaptive evolution of primate lysozymes , 1997, Nature.

[91]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[92]  T. Yang-Feng,et al.  Genomic structure and mapping of the chromosomal gene for transcobalamin I (TCN1): comparison to human intrinsic factor. , 1992, Genomics.

[93]  J H Gillespie,et al.  The molecular clock may be an episodic clock. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[94]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[95]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[96]  Edwin B. Frost An Atlas of Representative Stellar Spectra from λ 4870 to λ 3300 , 1901 .