Quantifying the relationship between sequence and three-dimensional structure conservation in RNA

BackgroundIn recent years, the number of available RNA structures has rapidly grown reflecting the increased interest on RNA biology. Similarly to the studies carried out two decades ago for proteins, which gave the fundamental grounds for developing comparative protein structure prediction methods, we are now able to quantify the relationship between sequence and structure conservation in RNA.ResultsHere we introduce an all-against-all sequence- and three-dimensional (3D) structure-based comparison of a representative set of RNA structures, which have allowed us to quantitatively confirm that: (i) there is a measurable relationship between sequence and structure conservation that weakens for alignments resulting in below 60% sequence identity, (ii) evolution tends to conserve more RNA structure than sequence, and (iii) there is a twilight zone for RNA homology detection.DiscussionThe computational analysis here presented quantitatively describes the relationship between sequence and structure for RNA molecules and defines a twilight zone region for detecting RNA homology. Our work could represent the theoretical basis and limitations for future developments in comparative RNA 3D structure prediction.

[1]  T. Earnest,et al.  Crystal Structure of the Ribosome at 5.5 Å Resolution , 2001, Science.

[2]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[3]  Matthew W Vaughn,et al.  It's a Small RNA World, After All , 2005, Science.

[4]  F. Major,et al.  The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data , 2008, Nature.

[5]  B. Rost Twilight zone of protein sequence alignments. , 1999, Protein engineering.

[6]  Arne Elofsson,et al.  MaxSub: an automated measure for the assessment of protein structure prediction quality , 2000, Bioinform..

[7]  E. Westhof,et al.  Analysis of RNA motifs. , 2003, Current opinion in structural biology.

[8]  A M Lesk,et al.  The evolution of protein structures. , 1987, Cold Spring Harbor symposia on quantitative biology.

[9]  Makio Tamura,et al.  Sequence and structural conservation in RNA ribose zippers. , 2002, Journal of molecular biology.

[10]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..

[11]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[12]  Quincy Teng,et al.  Structural Biology , 2013, Springer US.

[13]  I. Tinoco,et al.  RNA folding and unfolding. , 2004, Current opinion in structural biology.

[14]  Eric Westhof,et al.  Evolution of RNA Architecture , 2004, Science.

[15]  W. Olson,et al.  3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. , 2003, Nucleic acids research.

[16]  D. Baker,et al.  Automated de novo prediction of native-like RNA tertiary structures , 2007, Proceedings of the National Academy of Sciences.

[17]  François Major,et al.  A comparative analysis of the triloops in all high-resolution RNA structures reveals sequence structure relationships. , 2007, RNA.

[18]  Osvaldo Olmea,et al.  MAMMOTH (Matching molecular models obtained from theory): An automated method for model comparison , 2002, Protein science : a publication of the Protein Society.

[19]  A. Godzik,et al.  Computational protein function prediction: Are we making progress? , 2007, Cellular and Molecular Life Sciences.

[20]  Ruth Nussinov,et al.  ARTS: alignment of RNA tertiary structures , 2005, ECCB/JBI.

[21]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[22]  D. W. Staple,et al.  Open access, freely available online Primer Pseudoknots: RNA Structures with Diverse Functions , 2022 .

[23]  Scott M Stagg,et al.  Modeling a minimal ribosome based on comparative sequence analysis. , 2002, Journal of molecular biology.

[24]  Y Van de Peer,et al.  Distribution of substitution rates and location of insertion sites in the tertiary structure of ribosomal RNA. , 2001, Nucleic acids research.

[25]  M. Levitt Detailed Molecular Model for Transfer Ribonucleic Acid , 1969, Nature.

[26]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[27]  H. Wolfson,et al.  Analysis and classification of RNA tertiary structures. , 2008, RNA.

[28]  Peter Clote,et al.  DIAL: a web server for the pairwise alignment of two RNA three-dimensional structures using nucleotide, dihedral angle and base-pairing similarities , 2007, Nucleic Acids Res..

[29]  Marc A. Martí-Renom,et al.  RNA structure alignment by a unit-vector approach , 2008, ECCB.

[30]  Dirk Walther,et al.  Sequence–structure relationships in RNA loops: establishing the basis for loop homology modeling , 2009, Nucleic acids research.

[31]  Robert D. Finn,et al.  Rfam: updates to the RNA families database , 2008, Nucleic Acids Res..

[32]  Jon M. Kleinberg,et al.  Fast Detection of Common Geometric Substructure in Proteins , 1999, J. Comput. Biol..

[33]  Magdalena A. Jonikas,et al.  Coarse-grained modeling of large RNA molecules with knowledge-based potentials and structural filters. , 2009, RNA.

[34]  Steven E. Brenner,et al.  SCOR: Structural Classification of RNA, version 2.0 , 2004, Nucleic Acids Res..

[35]  S. Brunak,et al.  RNA secondary structure and squence conservation in C1 region of human immunodeficiency virus type 1 env gene. , 2002, AIDS research and human retroviruses.

[36]  Jennifer A. Doudna,et al.  The chemical repertoire of natural ribozymes , 2002, Nature.

[37]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[38]  A. Sali,et al.  Comparative protein structure modeling of genes and genomes. , 2000, Annual review of biophysics and biomolecular structure.

[39]  C. Vonrhein,et al.  Structure of the 30S ribosomal subunit , 2000, Nature.

[40]  C. Sander,et al.  Dali: a network tool for protein structure comparison. , 1995, Trends in biochemical sciences.

[41]  Temple F. Smith,et al.  The origin and evolution of the ribosome , 2008, Biology Direct.

[42]  N. Pace,et al.  The RNA moiety of ribonuclease P is the catalytic subunit of the enzyme , 1983, Cell.

[43]  T. Cech,et al.  Self-splicing RNA: Autoexcision and autocyclization of the ribosomal RNA intervening sequence of tetrahymena , 1982, Cell.

[44]  S. Holbrook Structural principles from large RNAs. , 2008, Annual review of biophysics.

[45]  G. Caetano-Anollés Tracing the evolution of RNA structure in ribosomes. , 2002, Nucleic acids research.

[46]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[47]  Chris Sander,et al.  Completeness in structural genomics , 2001, Nature Structural Biology.

[48]  Anna Marie Pyle,et al.  Evaluating and learning from RNA pseudotorsional space: quantitative validation of a reduced representation for RNA structure. , 2007, Journal of molecular biology.

[49]  Lior Pachter,et al.  Specific alignment of structured RNA: stochastic grammars and sequence annealing , 2008, Bioinform..

[50]  Eric Westhof,et al.  Structural biology. Evolution of RNA architecture. , 2004, Science.

[51]  Harry F Noller,et al.  RNA Structure: Reading the Ribosome , 2005, Science.

[52]  V. Ramakrishnan,et al.  Structure of the 30 S ribosomal subunit , 2022 .

[53]  Feng Ding,et al.  iFoldRNA: three-dimensional RNA structure prediction and folding , 2008, Bioinform..

[54]  Tao Pan,et al.  RNA folding: models and perspectives. , 2003, Current opinion in structural biology.

[55]  G. Rose,et al.  RNABase: an annotated database of RNA structures , 2003, Nucleic Acids Res..

[56]  W. B. Arendall,et al.  RNA backbone is rotameric , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[57]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[58]  John P. Overington,et al.  Derivation of rules for comparative protein modeling from a database of protein structure alignments , 1994, Protein science : a publication of the Protein Society.

[59]  L. Chew,et al.  Unit‐vector RMS (URMS) as a tool to analyze molecular dynamics trajectories , 1999, Proteins.

[60]  Emidio Capriotti,et al.  Computational RNA Structure Prediction , 2008 .

[61]  Haruki Nakamura,et al.  The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data , 2006, Nucleic Acids Res..

[62]  Marc A. Martí-Renom,et al.  SARA: a server for function annotation of RNA structures , 2009, Nucleic Acids Res..

[63]  Eric Westhof,et al.  The Dynamic Landscapes of RNA Architecture , 2009, Cell.

[64]  A. Lesk,et al.  How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. , 1980, Journal of molecular biology.