Evidence for turnover of functional noncoding DNA in mammalian genome evolution.

The vast majority of the mammalian genome does not code for proteins, and a fundamental question in genomics is: What proportion of the noncoding mammalian genome is functional? Most attempts to address this issue use sequence comparisons between highly diverged mammals such as human and mouse to identify conservation due to negative selection. But such comparisons will underestimate the true proportion of functional noncoding DNA if there is turnover, if patterns of negative selection change over time. Here we test whether the inferred level of negative selection differs between different pairwise species comparisons. Using a multiple alignment of more than a megabase of contiguous sequence from eight mammalian species, we find a strong negative relationship between inferred levels of negative selection and pairwise divergence using 21 pairwise comparisons. This result suggests that there is a high rate of turnover of functional noncoding elements in the mammalian genome, so measures of functional constraint based on human-mouse comparisons may seriously underestimate the true value.

[1]  Chuong B. Do,et al.  Access the most recent version at doi: 10.1101/gr.926603 References , 2003 .

[2]  M. Pagel,et al.  The comparative method in evolutionary biology , 1991 .

[3]  David Haussler,et al.  Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. , 2003, Genome research.

[4]  D. Haussler,et al.  Article Identification and Characterization of Multi-Species Conserved Sequences , 2022 .

[5]  I-Min A. Dubchak,et al.  Active conservation of noncoding sequences revealed by three-way species comparisons. , 2000, Genome research.

[6]  C. V. Jongeneel,et al.  Numerous potentially functional but non-genic conserved sequences on human chromosome 21 , 2002, Nature.

[7]  W. Miller,et al.  Distinguishing regulatory DNA from neutral sites. , 2003, Genome research.

[8]  M. Ludwig,et al.  Functional evolution of noncoding DNA. , 2002, Current opinion in genetics & development.

[9]  Jurg Ott,et al.  Distribution and characterization of regulatory elements in the human genome. , 2002, Genome research.

[10]  J. Fickett,et al.  Discovery and modeling of transcriptional regulatory regions. , 2000, Current opinion in biotechnology.

[11]  Sudhir Kumar,et al.  Neutral substitutions occur at a faster rate in exons than in noncoding DNA in primate genomes. , 2003, Genome research.

[12]  S. Pääbo,et al.  Intra- and Interspecific Variation in Primate Gene Expression Patterns , 2002, Science.

[13]  S. Batzoglou,et al.  Characterization of evolutionary rates and constraints in three Mammalian genomes. , 2004, Genome research.

[14]  Hans Ellegren,et al.  Deterministic mutation rate variation in the human genome. , 2002, Genome research.

[15]  M. Nei,et al.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. , 1993, Molecular biology and evolution.

[16]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[17]  S. Palumbi,et al.  High intron sequence conservation across three mammalian orders suggests functional constraints. , 2003, Molecular biology and evolution.

[18]  S. Batzoglou,et al.  Quantitative estimates of sequence divergence for comparative analyses of mammalian genomes. , 2003, Genome research.

[19]  A. Clark,et al.  Evolution of transcription factor binding sites in Mammalian gene regulatory regions: conservation and turnover. , 2002, Molecular biology and evolution.

[20]  A. Ogurtsov,et al.  Selective constraint in intergenic regions of human and mouse genomes. , 2001, Trends in genetics : TIG.

[21]  N. Patel,et al.  Evidence for stabilizing selection in a eukaryotic enhancer element , 2000, Nature.

[22]  R. Britten,et al.  Mobile elements inserted in the distant past have taken on important functions. , 1997, Gene.

[23]  Alexey S Kondrashov,et al.  Patterns in spontaneous mutation revealed by human-baboon sequence comparison. , 2002, Trends in genetics : TIG.

[24]  Ziheng Yang Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: Approximate methods , 1994, Journal of Molecular Evolution.

[25]  Eugene V Koonin,et al.  A significant fraction of conserved noncoding DNA in human and mouse consists of predicted matrix attachment regions. , 2003, Trends in genetics : TIG.

[26]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[27]  Jon D. McAuliffe,et al.  Phylogenetic Shadowing of Primate Sequences to Find Functional Regions of the Human Genome , 2003, Science.

[28]  H. Ellegren,et al.  Mutation rate variation in the mammalian genome. , 2003, Current opinion in genetics & development.

[29]  David G. Harris,et al.  Conserved fragments of transposable elements in intergenic regions: evidence for widespread recruitment of MIR- and L2-derived sequences within the mouse and human genomes. , 2003, Genetical research.

[30]  Lior Pachter,et al.  MAVID multiple alignment server , 2003, Nucleic Acids Res..

[31]  T. Ohta Near-neutrality in evolution of genes and gene regulation , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[32]  L. Pachter,et al.  Strategies and tools for whole-genome alignments. , 2002, Genome research.

[33]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[34]  Christopher B. Burge,et al.  DNA Sequence Evolution with Neighbor-Dependent Mutation , 2003, J. Comput. Biol..

[35]  Hidetoshi Shimodaira,et al.  Consistency of SINE insertion topology and flanking sequence tree: quantifying relationships among cetartiodactyls. , 2000, Molecular biology and evolution.

[36]  Nancy F. Hansen,et al.  Comparative analyses of multi-species sequences from targeted genomic regions , 2003, Nature.

[37]  Wen-Hsiung Li,et al.  Mutation rates differ among regions of the mammalian genome , 1989, Nature.