Adaptive Evolution of Conserved Noncoding Elements in Mammals

Conserved noncoding elements (CNCs) are an abundant feature of vertebrate genomes. Some CNCs have been shown to act as cis-regulatory modules, but the function of most CNCs remains unclear. To study the evolution of CNCs, we have developed a statistical method called the “shared rates test” to identify CNCs that show significant variation in substitution rates across branches of a phylogenetic tree. We report an application of this method to alignments of 98,910 CNCs from the human, chimpanzee, dog, mouse, and rat genomes. We find that ∼68% of CNCs evolve according to a null model where, for each CNC, a single parameter models the level of constraint acting throughout the phylogeny linking these five species. The remaining ∼32% of CNCs show departures from the basic model including speed-ups and slow-downs on particular branches and occasionally multiple rate changes on different branches. We find that a subset of the significant CNCs have evolved significantly faster than the local neutral rate on a particular branch, providing strong evidence for adaptive evolution in these CNCs. The distribution of these signals on the phylogeny suggests that adaptive evolution of CNCs occurs in occasional short bursts of evolution. Our analyses suggest a large set of promising targets for future functional studies of adaptation.

[1]  C. Ponting,et al.  Finishing the euchromatic sequence of the human genome , 2004 .

[2]  H. Kishino,et al.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA , 2005, Journal of Molecular Evolution.

[3]  Tim Hubbard Finishing the euchromatic sequence of the human genome , 2004 .

[4]  Lior Pachter,et al.  Identification of evolutionary hotspots in the rodent genomes. , 2004, Genome research.

[5]  H. Ellegren,et al.  Mutation rate variation in the mammalian genome. , 2003, Current opinion in genetics & development.

[6]  Terence P. Speed,et al.  Expression profiling in primates reveals a rapid evolution of human transcription factors , 2006, Nature.

[7]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[8]  E. Birney,et al.  Comparison of human chromosome 21 conserved nongenic sequences (CNGs) with the mouse and dog genomes shows that their selective constraint is independent of their genic environment. , 2004, Genome research.

[9]  Shyam Prabhakar,et al.  Mapping cis-regulatory domains in the human genome using multi-species conservation of synteny. , 2005, Human molecular genetics.

[10]  S. Pääbo,et al.  Accelerated Evolution of Conserved Noncoding Sequences in Humans , 2006, Science.

[11]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[12]  James A. Cuff,et al.  Genome sequence, comparative analysis and haplotype structure of the domestic dog , 2005, Nature.

[13]  Nancy F. Hansen,et al.  Comparative analyses of multi-species sequences from targeted genomic regions , 2003, Nature.

[14]  J. Durbin,et al.  Testing for serial correlation in least squares regression. II. , 1950, Biometrika.

[15]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..

[16]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[17]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[18]  David Haussler,et al.  Forces Shaping the Fastest Evolving Regions in the Human Genome , 2006, PLoS genetics.

[19]  David Haussler,et al.  New Methods for Detecting Lineage-Specific Selection , 2006, RECOMB.

[20]  Nadav Ahituv,et al.  Exploiting human--fish genome comparisons for deciphering gene regulation. , 2004, Human molecular genetics.

[21]  L. Duret,et al.  Adaptation or biased gene conversion? Extending the null hypothesis of molecular evolution. , 2007, Trends in genetics : TIG.

[22]  David Haussler,et al.  Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. , 2003, Genome research.

[23]  D. Berry,et al.  Bayesian multiple comparisons using dirichlet process priors , 1998 .

[24]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[25]  Klaudia Walter,et al.  Highly Conserved Non-Coding Sequences Are Associated with Vertebrate Development , 2004, PLoS biology.

[26]  M. Nóbrega,et al.  Scanning Human Gene Deserts for Long-Range Enhancers , 2003, Science.

[27]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[28]  Daniel J. Gaffney,et al.  The scale of mutational variation in the murid genome. , 2005, Genome research.

[29]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[30]  J. Felsenstein,et al.  A Hidden Markov Model approach to variation among sites in rate of evolution. , 1996, Molecular biology and evolution.

[31]  S. Batzoglou,et al.  Quantitative estimates of sequence divergence for comparative analyses of mammalian genomes. , 2003, Genome research.

[32]  Matthew W. Hahn,et al.  Ancient and Recent Positive Selection Transformed Opioid cis-Regulation in Humans , 2005, PLoS biology.

[33]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[34]  A. Reymond,et al.  Conserved non-genic sequences — an unexpected feature of mammalian genomes , 2005, Nature Reviews Genetics.

[35]  H. Kondoh,et al.  Efficient identification of regulatory sequences in the chicken genome by a powerful combination of embryo electroporation and genome comparison , 2004, Mechanisms of Development.

[36]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[37]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[38]  Hongkai Ji,et al.  Why do human diversity levels vary at a megabase scale? , 2005, Genome research.

[39]  Shamil Sunyaev,et al.  Small fitness effect of mutations in highly conserved non-coding regions. , 2005, Human molecular genetics.

[40]  S. Batzoglou,et al.  Distribution and intensity of constraint in mammalian genomic sequence. , 2005, Genome research.

[41]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[42]  Tanya Vavouri,et al.  Ancient duplicated conserved noncoding elements in vertebrates: a genomic and functional analysis. , 2006, Genome research.

[43]  M. Adams,et al.  Inferring Nonneutral Evolution from Human-Chimp-Mouse Orthologous Gene Trios , 2003, Science.

[44]  J. Durbin,et al.  Testing for serial correlation in least squares regression. I. , 1950, Biometrika.

[45]  Shamil Sunyaev,et al.  Evolutionary constraints in conserved nongenic sequences of mammals. , 2005, Genome research.

[46]  D. Haussler,et al.  Article Identification and Characterization of Multi-Species Conserved Sequences , 2022 .

[47]  Alan M. Moses,et al.  In vivo enhancer analysis of human conserved non-coding sequences , 2006, Nature.

[48]  W. Ewens The sampling theory of selectively neutral alleles. , 1972, Theoretical population biology.

[49]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[50]  Hans Ellegren,et al.  Deterministic mutation rate variation in the human genome. , 2002, Genome research.

[51]  Shyam Prabhakar,et al.  Close sequence comparisons are sufficient to identify human cis-regulatory elements. , 2005, Genome research.

[52]  Martin J Lercher,et al.  Gene expression, synteny, and local similarity in human noncoding mutation rates. , 2004, Molecular biology and evolution.

[53]  D. Haussler,et al.  An RNA gene expressed during cortical development evolved rapidly in humans , 2006, Nature.

[54]  J. Tena,et al.  A functional survey of the enhancer activity of conserved non-coding sequences from vertebrate Iroquois cluster gene deserts. , 2005, Genome research.

[55]  Rob J Hyndman,et al.  Using R to Teach Econometrics , 2002 .

[56]  Laurent Excoffier,et al.  Conserved noncoding sequences are selectively constrained and not mutation cold spots , 2006, Nature Genetics.

[57]  Colin N. Dewey,et al.  Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution , 2004, Nature.

[58]  Eric S. Lander,et al.  Genetic evidence for complex speciation of humans and chimpanzees , 2006, Nature.