Characterization of evolutionary rates and constraints in three Mammalian genomes.

We present an analysis of rates and patterns of microevolutionary phenomena that have shaped the human, mouse, and rat genomes since their last common ancestor. We find evidence for a shift in the mutational spectrum between the mouse and rat lineages, with the net effect being a relative increase in GC content in the rat genome. Our estimate for the neutral point substitution rate separating the two rodents is 0.196 substitutions per site, and 0.65 substitutions per site for the tree relating all three mammals. Small insertions and deletions of 1-10 bp in length ("microindels") occur at approximately 5% of the point substitution rate. Inferred regional correlations in evolutionary rates between lineages and between types of sites support the idea that rates of evolution are influenced by local genomic or cell biological context. No substantial correlations between rates of point substitutions and rates of microindels are found, however, implying that the influences that affect these processes are distinct. Finally, we have identified those regions in the human genome that are evolving slowly, which are likely to include functional elements important to human biology. At least 5% of the human genome is under substantial constraint, most of which is noncoding.

[1]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[2]  S. Jeffery Evolution of Protein Molecules , 1979 .

[3]  A. Bird,et al.  The expected equilibrium of the CpG dinucleotide in vertebrate genomes under a mutation model. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[4]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[5]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[6]  D. Petrov,et al.  Evidence for DNA loss as a determinant of genome size. , 2000, Science.

[7]  R. Hardison Conserved noncoding sequences are reliable guides to regulatory elements. , 2000, Trends in genetics : TIG.

[8]  F. Ruddle,et al.  An efficient cis-element discovery method using multiple sequence comparisons based on evolutionary relationships. , 2001, Genomics.

[9]  J. Cheng,et al.  Human and mouse ABCA1 comparative sequencing and transgenesis studies revealing novel regulatory sequences. , 2001, Genomics.

[10]  D. Balding,et al.  Models of sequence evolution for DNA sequences containing gaps. , 2001, Molecular biology and evolution.

[11]  A. Ogurtsov,et al.  Selective constraint in intergenic regions of human and mouse genomes. , 2001, Trends in genetics : TIG.

[12]  Berthold Göttgens,et al.  Transcriptional regulation of the stem cell leukemia gene (SCL)--comparative analysis of five vertebrate SCL loci. , 2002, Genome research.

[13]  C. V. Jongeneel,et al.  Numerous potentially functional but non-genic conserved sequences on human chromosome 21 , 2002, Nature.

[14]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[15]  Tao Jiang,et al.  Finding Genes by Computer: Probabilistic and Discriminative Approaches , 2002 .

[16]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[17]  Dmitri A Petrov,et al.  Mutational equilibrium model of genome size evolution. , 2002, Theoretical population biology.

[18]  Arend Sidow,et al.  Sequence First. Ask Questions Later. , 2002, Cell.

[19]  Nancy F. Hansen,et al.  Comparative analyses of multi-species sequences from targeted genomic regions , 2003, Nature.

[20]  L. Pachter,et al.  Strategies and tools for whole-genome alignments. , 2002, Genome research.

[21]  Jon D. McAuliffe,et al.  Phylogenetic Shadowing of Primate Sequences to Find Functional Regions of the Human Genome , 2003, Science.

[22]  Chuong B. Do,et al.  Access the most recent version at doi: 10.1101/gr.926603 References , 2003 .

[23]  S. Batzoglou,et al.  Quantitative estimates of sequence divergence for comparative analyses of mammalian genomes. , 2003, Genome research.

[24]  Arend Sidow,et al.  Genomic regulatory regions: insights from comparative sequence analysis. , 2003, Current opinion in genetics & development.

[25]  S. Palumbi,et al.  High intron sequence conservation across three mammalian orders suggests functional constraints. , 2003, Molecular biology and evolution.

[26]  David Haussler,et al.  Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. , 2003, Genome research.

[27]  Ziheng Yang Estimating the pattern of nucleotide substitution , 1994, Journal of Molecular Evolution.

[28]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[29]  D. Haussler,et al.  Phylogenetic estimation of context-dependent substitution rates by maximum likelihood. , 2003, Molecular biology and evolution.

[30]  Inna Dubchak,et al.  Automated whole-genome multiple alignment of rat, mouse, and human. , 2004, Genome research.

[31]  Wen-Hsiung Li,et al.  Nonrandomness of point mutation as reflected in nucleotide substitutions in pseudogenes and its evolutionary implications , 2005, Journal of Molecular Evolution.

[32]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.