Rooting the ribosomal tree of life.

The origin of the genetic code and the rooting of the tree of life (ToL) are two of the most challenging problems in the study of life's early evolution. Although both have been the focus of numerous investigations utilizing a variety of methods, until now, each problem has been addressed independently. Typically, attempts to root the ToL have relied on phylogenies of genes with ancient duplications, which are subject to artifacts of tree reconstruction and horizontal gene transfer, or specific physiological characters believed to be primitive, which are often based on subjective criteria. Here, we demonstrate a unique method for rooting based on the identification of amino acid usage biases comprising the residual signature of a more primitive genetic code. Using a phylogenetic tree of concatenated ribosomal proteins, our analysis of amino acid compositional bias detects a strong and unique signal associated with the early expansion of the genetic code, placing the root of the translation machinery along the bacterial branch.

[1]  F. Cramer,et al.  Site of aminoacylation of tRNAs from Escherichia coli with respect to the 2'- or 3'-hydroxyl group of the terminal adenosine. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[2]  C. Woese,et al.  Phylogenetic structure of the prokaryotic domain: The primary kingdoms , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[3]  M. O. Dayhoff,et al.  Origins of prokaryotes, eukaryotes, mitochondria, and chloroplasts. , 1978, Science.

[4]  S. Osawa,et al.  Evolutionary relationship of archaebacteria, eubacteria, and eukaryotes inferred from phylogenetic trees of duplicated genes. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Masasuke Yoshida,et al.  Evolution of the vacuolar H+-ATPase: implications for the origin of eukaryotes. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[6]  K. Watanabe,et al.  Proline residues responsible for thermostability occur with high frequency in the loop regions of an extremely thermostable oligo-1,6-glucosidase from Bacillus thermoglucosidasius KP1006. , 1991, The Journal of biological chemistry.

[7]  Lynn Margulis,et al.  Symbiosis in Cell Evolution: Microbial Communities in the Archean and Proterozoic Eons , 1992 .

[8]  M. W. Gray,et al.  Origin and evolution of organelle genomes. , 1993, Current opinion in genetics & development.

[9]  J. Schofield Molecular studies on an ancient gene encoding for carbamoyl-phosphate synthetase. , 1993, Clinical science.

[10]  H. Himeno,et al.  Escherichia coli seryl-tRNA synthetase recognizes tRNA(Ser) by its characteristic tertiary structure. , 1994, Journal of molecular biology.

[11]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[12]  J. Gogarten,et al.  The early evolution of cellular life. , 1995, Trends in ecology & evolution.

[13]  W. Doolittle,et al.  Root of the universal tree of life based on ancient aminoacyl-tRNA synthetase gene duplications. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[14]  M. Delarue,et al.  Structure of phenylalanyl-tRNA synthetase from Thermus thermophilus , 1995, Nature Structural Biology.

[15]  J D Palmer,et al.  The root of the universal tree and the origin of eukaryotes based on elongation factor phylogeny. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[16]  R. L. Charlebois,et al.  Phylogenetic analysis of carbamoylphosphate synthetase genes: complex evolutionary history includes an internal duplication within a gene which can root the tree of life. , 1996, Molecular biology and evolution.

[17]  P. Dennis,et al.  Evolutionary divergence and salinity-mediated selection in halophilic archaea. , 1997, Microbiology and molecular biology reviews : MMBR.

[18]  P. Dennis,et al.  Evolutionary divergence and salinity-mediated selection in halophilic archaea , 1997 .

[19]  Simonetta Gribaldo,et al.  The Root of the Universal Tree of Life Inferred from Anciently Duplicated Genes Encoding Components of the Protein-Targeting Machinery , 1998, Journal of Molecular Evolution.

[20]  P. Forterre,et al.  The Rooting of the Universal Tree of Life Is Not Reliable , 1999, Journal of Molecular Evolution.

[21]  D Penny,et al.  The nature of the last universal common ancestor. , 1999, Current opinion in genetics & development.

[22]  E N Trifonov,et al.  Consensus temporal order of amino acids and evolution of the triplet code. , 2000, Gene.

[23]  K. Nishikawa,et al.  Protein surface amino acid compositions distinctively differ between thermophilic and mesophilic bacteria. , 2001, Journal of molecular biology.

[24]  Laura F. Landweber,et al.  Rewiring the keyboard: evolvability of the genetic code , 2001, Nature Reviews Genetics.

[25]  T. Cavalier-smith,et al.  The neomuran origin of archaebacteria, the negibacterial root of the universal tree and bacterial megaclassification. , 2002, International journal of systematic and evolutionary microbiology.

[26]  W. Hennig Phylogenetic Systematics , 2002 .

[27]  J. R. Fresco,et al.  Increased Frequency of Cysteine, Tyrosine, and Phenylalanine Residues Since the Last Universal Ancestor* , 2002, Molecular & Cellular Proteomics.

[28]  W. Doolittle,et al.  Prokaryotic evolution in light of gene transfer. , 2002, Molecular biology and evolution.

[29]  Mona Singh,et al.  Evolution of amino acid frequencies in proteins over deep time: inferred order of introduction of amino acids into the genetic code. , 2002, Molecular biology and evolution.

[30]  N. Pace,et al.  The genetic core of the universal ancestor. , 2003, Genome research.

[31]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[32]  Mona Singh,et al.  A novel method for estimating ancestral amino acid composition and its application to proteins of the Last Universal Ancestor , 2004, Bioinform..

[33]  N. Grishin,et al.  Reconstruction of ancestral protein sequences and its applications , 2004, BMC Evolutionary Biology.

[34]  J. Gogarten,et al.  Cladogenesis, coalescence and the evolution of the three domains of life. , 2004, Trends in genetics : TIG.

[35]  H. Philippe,et al.  A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. , 2004, Molecular biology and evolution.

[36]  Liron Klipcan,et al.  Amino acid biogenesis, evolution of the genetic code and aminoacyl-tRNA synthetases. , 2004, Journal of theoretical biology.

[37]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[38]  J. Gogarten,et al.  Evolutionary and Diagnostic Implications of Intragenomic Heterogeneity in the 16S rRNA Gene in Aeromonas Strains , 2005, Journal of bacteriology.

[39]  Hyman Hartman,et al.  Speculations on the evolution of the genetic code IV the evolution of the aminoacyl-tRNA synthetases , 1995, Origins of Life and Evolution of the Biosphere.

[40]  J. Peter Gogarten,et al.  Ancient gene duplications and the root(s) of the tree of life , 2005, Protoplasma.

[41]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[42]  Manuel A. S. Santos,et al.  Evolution of the genetic code in yeasts , 2006, Yeast.

[43]  T. Cavalier-smith,et al.  Rooting the tree of life by transition analyses , 2006, Biology Direct.

[44]  Jacqueline A. Servin,et al.  Evidence that the root of the tree of life is not within the Archaea. , 2006, Molecular biology and evolution.

[45]  D. Penny,et al.  Response to Dagan and Martin , 2007, BioEssays : news and reviews in molecular, cellular and developmental biology.

[46]  Peer Bork,et al.  Genome-Wide Experimental Determination of Barriers to Horizontal Gene Transfer , 2007, Science.

[47]  A. Lazcano,et al.  Question 7: Comparative Genomics and Early Cell Evolution: A Cautionary Methodological Note , 2007, Origins of Life and Evolution of Biospheres.

[48]  Penny,et al.  Testing hypotheses without considering predictions , 2007 .

[49]  X.-X. Zhou,et al.  Differences in amino acids composition and coupling patterns between mesophilic and thermophilic proteins , 2007, Amino Acids.

[50]  J. Gogarten,et al.  Planetary Systems and the Origins of Life: Horizontal gene transfer, gene histories, and the root of the tree of life , 2007 .

[51]  M. Di Giulio The tree of life might be rooted in the branch leading to Nanoarchaeota. , 2007, Gene.

[52]  J. Gogarten,et al.  Signature of a Primitive Genetic Code in Ancient Protein Lineages , 2007, Journal of Molecular Evolution.

[53]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[54]  D. Penny,et al.  Evaluating hypotheses for the origin of eukaryotes. , 2007, BioEssays : news and reviews in molecular, cellular and developmental biology.

[55]  Jacqueline A. Servin,et al.  Rooting the tree of life using nonubiquitous genes. , 2007, Molecular biology and evolution.

[56]  Jacqueline A. Servin,et al.  Evidence for a gram-positive, eubacterial root of the tree of life. , 2007, Molecular biology and evolution.

[57]  S. Harris,et al.  The archaebacterial origin of eukaryotes , 2008, Proceedings of the National Academy of Sciences.

[58]  M. Gouy,et al.  Parallel adaptations to high temperatures in the Archaean eon , 2008, Nature.

[59]  Akash Ranjan,et al.  Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome , 2008, Nucleic acids research.

[60]  Nicolas Lartillot,et al.  A site- and time-heterogeneous model of amino acid replacement. , 2008, Molecular biology and evolution.

[61]  Tal Dagan,et al.  Modular networks and cumulative impact of lateral transfer in prokaryote genome evolution , 2008, Proceedings of the National Academy of Sciences.

[62]  R. Wellinger,et al.  A novel approach for organelle-specific DNA damage targeting reveals different susceptibility of mitochondrial DNA to the anticancer drugs camptothecin and topotecan , 2009, Nucleic acids research.

[63]  P. Higgs A four-column theory for the origin of the genetic code: tracing the evolutionary pathways that gave rise to an optimized code , 2009, Biology Direct.

[64]  G. Fournier Genetic code evolution and amino acid composition analysis , 2009 .

[65]  Maureen A. O’Malley,et al.  Prokaryotic evolution and the tree of life are two different things , 2009, Biology Direct.

[66]  Wanil Kim,et al.  Mouse period 2 mRNA circadian oscillation is modulated by PTB–mediated rhythmic mRNA degradation , 2008, Nucleic acids research.

[67]  J. Gogarten,et al.  Trees in the Web of Life , 2009, Journal of biology.