Universal Pacemaker of Genome Evolution in Animals and Fungi and Variation of Evolutionary Rates in Diverse Organisms

Gene evolution is traditionally considered within the framework of the molecular clock (MC) model whereby each gene is characterized by an approximately constant rate of evolution. Recent comparative analysis of numerous phylogenies of prokaryotic genes has shown that a different model of evolution, denoted the Universal PaceMaker (UPM), which postulates conservation of relative, rather than absolute evolutionary rates, yields a better fit to the phylogenetic data. Here, we show that the UPM model is a better fit than the MC for genome wide sets of phylogenetic trees from six species of Drosophila and nine species of yeast, with extremely high statistical significance. Unlike the prokaryotic phylogenies that include distant organisms and multiple horizontal gene transfers, these are simple data sets that cover groups of closely related organisms and consist of gene trees with the same topology as the species tree. The results indicate that both lineage-specific and gene-specific rates are important in genome evolution but the lineage-specific contribution is greater. Similar to the MC, the gene evolution rates under the UPM are strongly overdispersed, approximately 2-fold compared with the expectation from sampling error alone. However, we show that neither Drosophila nor yeast genes form distinct clusters in the tree space. Thus, the gene-specific deviations from the UPM, although substantial, are uncorrelated and most likely depend on selective factors that are largely unique to individual genes. Thus, the UPM appears to be a key feature of genome evolution across the history of cellular life.

[1]  C. Wilke,et al.  The evolutionary consequences of erroneous protein synthesis , 2009, Nature Reviews Genetics.

[2]  S. Renner Relaxed molecular clocks for dating historical plant dispersal events. , 2005, Trends in plant science.

[3]  Robert Tibshirani,et al.  Estimating the number of clusters in a data set via the gap statistic , 2000 .

[4]  N. Grishin,et al.  From complete genomes to measures of substitution rate variability within and between proteins. , 2000, Genome research.

[5]  R. Lanfear,et al.  Watching the clock: studying variation in rates of molecular evolution between species. , 2010, Trends in ecology & evolution.

[6]  Melanie A. Huntley,et al.  Evolution of genes and genomes on the Drosophila phylogeny , 2007, Nature.

[7]  Sagi Snir,et al.  Stability along with Extreme Variability in Core Genome Evolution , 2013, Genome biology and evolution.

[8]  Liran Carmel,et al.  Unifying measures of gene function and evolution , 2006, Proceedings of the Royal Society B: Biological Sciences.

[9]  M. Kimura Molecular evolutionary clock and the neutral theory , 2005, Journal of Molecular Evolution.

[10]  Emile Zuckerkandl,et al.  On the molecular evolutionary clock , 2005, Journal of Molecular Evolution.

[11]  A. Rambaut,et al.  BEAST: Bayesian evolutionary analysis by sampling trees , 2007, BMC Evolutionary Biology.

[12]  C. Wilke Molecular clock in neutral protein evolution , 2004, BMC Genetics.

[13]  David James Sherman,et al.  Génolevures: protein families and synteny among complete hemiascomycetous yeast proteomes and genomes , 2008, Nucleic Acids Res..

[14]  D. Cutler,et al.  Understanding the overdispersed molecular clock. , 2000, Genetics.

[15]  S. Ho,et al.  Relaxed Phylogenetics and Dating with Confidence , 2006, PLoS biology.

[16]  Audra E. Kosh,et al.  Linear Algebra and its Applications , 1992 .

[17]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[18]  N. M. Brooke,et al.  A molecular timescale for vertebrate evolution , 1998, Nature.

[19]  H. Kishino,et al.  Estimating the rate of evolution of the rate of molecular evolution. , 1998, Molecular biology and evolution.

[20]  Trevor Bedford,et al.  Overdispersion of the molecular clock: temporal variation of gene-specific substitution rates in Drosophila. , 2008, Molecular biology and evolution.

[21]  G. Butler,et al.  Yeast genome evolution—the origin of the species , 2007, Yeast.

[22]  Harkamal Walia,et al.  Protein abundances are more conserved than mRNA abundances across diverse taxa , 2010, Proteomics.

[23]  E. Koonin,et al.  Search for a 'Tree of Life' in the thicket of the phylogenetic forest , 2009, Journal of biology.

[24]  Trevor Bedford,et al.  Overdispersion of the Molecular Clock Varies Between Yeast, Drosophila and Mammals , 2008, Genetics.

[25]  L. Bromham The genome as a life-history character: why rate of molecular evolution varies between mammal species , 2011, Philosophical Transactions of the Royal Society B: Biological Sciences.

[26]  Olaf R. P. Bininda-Emonds,et al.  Fast Genes and Slow Clades: Comparative Rates of Molecular Evolution in Mammals , 2007, Evolutionary bioinformatics online.

[27]  Eugene V Koonin,et al.  The universal distribution of evolutionary rates of genes and distinct characteristics of eukaryotic genes of different apparent ages , 2009, Proceedings of the National Academy of Sciences.

[28]  Nicolas Galtier,et al.  The erratic mitochondrial clock: variations of mutation rate, not population size, affect mtDNA diversity across birds and mammals , 2009, BMC Evolutionary Biology.

[29]  Claus O. Wilke,et al.  Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution , 2008, Cell.

[30]  F J Ayala,et al.  Erratic overdispersion of three molecular clocks: GPDH, SOD, and XDH , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[31]  B. Dujon Yeast evolutionary genomics , 2010, Nature Reviews Genetics.

[32]  Alexandros Stamatakis,et al.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models , 2006, Bioinform..

[33]  M. Suchard,et al.  Bayesian random local clocks, or one rate to rule them all , 2010, BMC Biology.

[34]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[35]  N. Takahata,et al.  On the overdispersed molecular clock. , 1987, Genetics.

[36]  Andrew P. Martin,et al.  Rates of mitochondrial DNA evolution in sharks are slow compared with mammals , 1992, Nature.

[37]  Yuri I Wolf,et al.  Coping with the quantitative genomics 'elephant': the correlation between the gene dispensability and evolution rate. , 2006, Trends in genetics : TIG.

[38]  Sagi Snir,et al.  Universal Pacemaker of Genome Evolution , 2012, PLoS Comput. Biol..

[39]  C. Pál,et al.  Highly expressed genes in yeast evolve slowly. , 2001, Genetics.

[40]  D. Graur,et al.  Reading the entrails of chickens: molecular timescales of evolution and the illusion of precision. , 2004, Trends in genetics : TIG.

[41]  Lindell Bromham,et al.  Molecular dating when rates vary. , 2005, Trends in ecology & evolution.

[42]  L. Duret,et al.  Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. , 2000, Molecular biology and evolution.

[43]  D. Penny,et al.  The modern molecular clock , 2003, Nature Reviews Genetics.

[44]  S. Blair Hedges,et al.  The origin and evolution of model organisms , 2002, Nature Reviews Genetics.

[45]  L. Bromham Why do species vary in their rate of molecular evolution? , 2009, Biology Letters.

[46]  Mira V. Han,et al.  Gene Family Evolution across 12 Drosophila Genomes , 2007, PLoS genetics.

[47]  Eugene V Koonin,et al.  Constant relative rate of protein evolution and detection of functional diversification among bacterial, archaeal and eukaryotic proteins , 2001, Genome Biology.

[48]  F J Ayala,et al.  Neutralism and selectionism: the molecular clock. , 2000, Gene.

[49]  L. Pauling,et al.  Evolutionary Divergence and Convergence in Proteins , 1965 .