Translation: The Universal Structural Core of Life

Abstract The Universal Gene Set of Life (UGSL) is common to genomes of all extant organisms. The UGSL is small, consisting of <100 genes, and is dominated by genes encoding the translation system. Here we extend the search for biological universality to three dimensions. We characterize and quantitate the universality of structure of macromolecules that are common to all of life. We determine that around 90% of prokaryotic ribosomal RNA (rRNA) forms a common core, which is the structural and functional foundation of rRNAs of all cytoplasmic ribosomes. We have established a database, which we call the Sparse and Efficient Representation of the Extant Biology (the SEREB database). This database contains complete and cross-validated rRNA sequences of species chosen, as far as possible, to sparsely and efficiently sample all known phyla. Atomic-resolution structures of ribosomes provide data for structural comparison and validation of sequence-based models. We developed a similarity statistic called pairing adjusted sequence entropy, which characterizes paired nucleotides by their adherence to covariation and unpaired nucleotides by conventional conservation of identity. For canonically paired nucleotides the unit of structure is the nucleotide pair. For unpaired nucleotides, the unit of structure is the nucleotide. By quantitatively defining the common core of rRNA, we systematize the conservation and divergence of the translational system across the tree of life, and can begin to understand the unique evolutionary pressures that cause its universality. We explore the relationship between ribosomal size and diversity, geological time, and organismal complexity.

[1]  Evan Bolton,et al.  Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..

[2]  P. Forterre,et al.  Lokiarchaea are close relatives of Euryarchaeota, not bridging the gap between prokaryotes and eukaryotes , 2017, PLoS genetics.

[3]  A. Lupas,et al.  Ribosomal proteins as documents of the transition from unstructured (poly)peptides to folded proteins. , 2017, Journal of structural biology.

[4]  A. Petrov,et al.  Frozen in Time: The History of Proteins , 2017, Molecular biology and evolution.

[5]  Kazutaka Katoh,et al.  A simple method to control over-alignment in the MAFFT multiple sequence alignment program , 2016, Bioinform..

[6]  G. Fox,et al.  Centers of motion associated with EF-Tu binding to the ribosome , 2016, RNA biology.

[7]  Loren Dean Williams,et al.  History of the ribosome and the origin of translation , 2015, Proceedings of the National Academy of Sciences.

[8]  Benjamin J. Raphael,et al.  Universal and domain-specific sequences in 23S–28S ribosomal RNA identified by computational phylogenetics , 2015, RNA.

[9]  P. Penczek,et al.  Structural Snapshots of Actively Translating Human Ribosomes , 2015, Cell.

[10]  B. Klaholz,et al.  Structure of the human 80S ribosome , 2015, Nature.

[11]  Chiaolong Hsiao,et al.  Evolution of the ribosome at atomic resolution , 2014, Proceedings of the National Academy of Sciences.

[12]  David H Burkhardt,et al.  Quantifying Absolute Protein Synthesis Rates Reveals Principles Underlying Allocation of Cellular Resources , 2014, Cell.

[13]  V. G. Panse,et al.  A new system for naming ribosomal proteins. , 2014, Current opinion in structural biology.

[14]  Daniel N. Wilson,et al.  Structures of the human and Drosophila 80S ribosome , 2013, Nature.

[15]  Pelin Yilmaz,et al.  The SILVA ribosomal RNA gene database project: improved data processing and web-based tools , 2012, Nucleic Acids Res..

[16]  R. Gutell,et al.  Structural Constraints Identified with Covariation Analysis in Ribosomal RNA , 2012, PloS one.

[17]  M. Yusupov,et al.  One core, two shells: bacterial and eukaryotic ribosomes , 2012, Nature Structural &Molecular Biology.

[18]  T. Hwa,et al.  Interdependence of Cell Growth and Gene Expression: Origins and Consequences , 2010, Science.

[19]  Ziheng Yang,et al.  The Timetree of Life , 2010 .

[20]  Y. Mandel-Gutfreund,et al.  Structural signatures of antibiotic binding sites on the ribosome , 2010, Nucleic acids research.

[21]  A. Elofsson,et al.  Structure is three to ten times more conserved than sequence—A study of structural response in protein cores , 2009, Proteins.

[22]  Chiaolong Hsiao,et al.  Peeling the onion: ribosomes are ancient molecular fossils. , 2009, Molecular biology and evolution.

[23]  Matthew Belousoff,et al.  The evolving ribosome: from non-coded peptide bond formation to sophisticated translation machinery. , 2009, Research in microbiology.

[24]  Ilana Agmon,et al.  The Dimeric Proto-Ribosome: Structural Details and Possible Implications on the Origin of Life , 2009, International journal of molecular sciences.

[25]  N. Grishin,et al.  PROMALS3D: a tool for multiple protein sequence and structure alignments , 2008, Nucleic acids research.

[26]  Craig L. Zirbel,et al.  FR3D: finding local and composite recurrent structural motifs in RNA 3D structures , 2007, Journal of mathematical biology.

[27]  M. Kimura The role of compensatory neutral mutations in molecular evolution , 1985, Journal of Genetics.

[28]  Ed Zintel,et al.  Resources , 1998, IT Prof..

[29]  R. Knight,et al.  Evolutionary rates vary among rRNA structural elements , 2007, Nucleic acids research.

[30]  Joel Dudley,et al.  TimeTree: a public knowledge-base of divergence times among organisms , 2006, Bioinform..

[31]  Julio O. Ortiz,et al.  Mapping 70S ribosomes in intact cells by cryoelectron tomography and pattern recognition. , 2006, Journal of structural biology.

[32]  M. Selmer,et al.  Structure of the 70S Ribosome Complexed with mRNA and tRNA , 2006, Science.

[33]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[34]  Patrick J. Paddison,et al.  Second-generation shRNA libraries covering the mouse and human genomes , 2005, Nature Genetics.

[35]  Hung-Chung Huang,et al.  The application of cluster analysis in the intercomparison of loop structures in RNA. , 2005, RNA.

[36]  A. Emili,et al.  Interaction network containing conserved and essential protein complexes in Escherichia coli , 2005, Nature.

[37]  Temple F. Smith,et al.  Ribosomal protein-sequence block structure suggests complex prokaryotic evolution with implications for the origin of eukaryotes. , 2004, Molecular phylogenetics and evolution.

[38]  Robert L Charlebois,et al.  Chlamydia: 780.57 (sd = 1.81), range 778–784, n =7 Cyanobacteria: 820.50 (sd = 23.53), range 776–844, n =8 , 2022 .

[39]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[40]  M. Lynch,et al.  The Origins of Genome Complexity , 2003, Science.

[41]  Eugene V. Koonin,et al.  Comparative genomics, minimal gene-sets and the last universal common ancestor , 2003, Nature Reviews Microbiology.

[42]  N. Pace,et al.  The genetic core of the universal ancestor. , 2003, Genome research.

[43]  Scott M Stagg,et al.  Modeling a minimal ribosome based on comparative sequence analysis. , 2002, Journal of molecular biology.

[44]  Frank Schluenzen,et al.  Antibiotics targeting ribosomes: crystallographic studies. , 2002, Current drug targets. Infectious disorders.

[45]  A. Yonath High-resolution structures of large ribosomal subunits from mesophilic eubacteria and halophilic archaea at various functional States. , 2002, Current protein & peptide science.

[46]  A. Yonath The search and its outcome: high-resolution structures of ribosomal particles from mesophilic, thermophilic, and halophilic bacteria at various functional states. , 2002, Annual review of biophysics and biomolecular structure.

[47]  Nan Yu,et al.  The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs , 2002, BMC Bioinformatics.

[48]  E. Westhof,et al.  Geometric nomenclature and classification of RNA base pairs. , 2001, RNA.

[49]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[50]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[51]  J. Parsch,et al.  Comparative sequence analysis and patterns of covariation in RNA secondary structures. , 2000, Genetics.

[52]  J. Bachellerie,et al.  Evolution of large subunit rRNA structure. The 3' terminal domain contains elements of secondary structure specific to major phylogenetic groups. , 1989, Biochimie.

[53]  J. Bachellerie,et al.  Comparisons of large subunit rRNAs reveal some eukaryote-specific elements of secondary structure. , 1987, Biochimie.

[54]  J. Erickson,et al.  Variation among human 28S ribosomal RNA genes. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[55]  A. Yonath,et al.  Characterization and crystallization of ribosomal particles from Halobacterium marismortui , 1985 .

[56]  S. Gerbi,et al.  Xenopus laevis 28S ribosomal RNA: a secondary structure model and its evolutionary and functional implications. , 1984, Nucleic acids research.

[57]  J. Bachellerie,et al.  The complete nucleotide sequence of mouse 28S rRNA gene. Implications for the process of size increase of the large subunit rRNA in higher eukaryotes. , 1984, Nucleic acids research.

[58]  R. Gourse,et al.  Sequence analysis of 28S ribosomal DNA from the amphibian Xenopus laevis. , 1983, Nucleic acids research.

[59]  A Yonath,et al.  Crystallization of Escherichia coli ribosomes , 1982, FEBS letters.

[60]  Lila L. Gatlin,et al.  Information theory and the living system , 1972 .

[61]  L. L. Gatlin,et al.  The information content of DNA. , 1966, Journal of theoretical biology.

[62]  ROY MARKHAM,et al.  Structure of Ribonucleic Acid , 1951, Nature.