Modeling RNA tertiary structure motifs by graph-grammars

A new approach, graph-grammars, to encode RNA tertiary structure patterns is introduced and exemplified with the classical sarcin–ricin motif. The sarcin–ricin motif is found in the stem of the crucial ribosomal loop E (also referred to as the sarcin–ricin loop), which is sensitive to the α-sarcin and ricin toxins. Here, we generate a graph-grammar for the sarcin-ricin motif and apply it to derive putative sequences that would fold in this motif. The biological relevance of the derived sequences is confirmed by a comparison with those found in known sarcin–ricin sites in an alignment of over 800 bacterial 23S ribosomal RNAs. The comparison raised alternative alignments in few sarcin–ricin sites, which were assessed using tertiary structure predictions and 3D modeling. The sarcin–ricin motif graph-grammar was built with indivisible nucleotide interaction cycles that were recently observed in structured RNAs. A comparison of the sequences and 3D structures of each cycle that constitute the sarcin–ricin motif gave us additional insights about RNA sequence–structure relationships. In particular, this analysis revealed the sequence space of an RNA motif depends on a structural context that goes beyond the single base pairing and base-stacking interactions.

[1]  Azriel Rosenfeld,et al.  Graph Grammars and Their Application to Computer Science , 1990, Lecture Notes in Computer Science.

[2]  Tamar Schlick,et al.  Modular RNA architecture revealed by computational analysis of existing pseudoknots and ribosomal RNAs , 2005, Nucleic acids research.

[3]  Anne Condon,et al.  A new algorithm for RNA secondary structure design. , 2004, Journal of molecular biology.

[4]  Philippe Vismara,et al.  Union of all the Minimum Cycle Bases of a Graph , 1997, Electron. J. Comb..

[5]  P. Gendron,et al.  Quantitative analysis of nucleic acid three-dimensional structures. , 2001, Journal of molecular biology.

[6]  F. Major,et al.  RNA canonical and non-canonical base pairing types: a recognition method and complete repertoire. , 2002, Nucleic acids research.

[7]  R. Gutell,et al.  The accuracy of ribosomal RNA comparative structure models. , 2002, Current opinion in structural biology.

[8]  Thomas Lengauer Bioinformatics : from genomes to therapies , 2007 .

[9]  R. Durbin,et al.  RNA sequence analysis using covariance models. , 1994, Nucleic acids research.

[10]  Eric Westhof,et al.  Recurrent structural RNA motifs, Isostericity Matrices and sequence alignments , 2005, Nucleic acids research.

[11]  Daniel Gautheret,et al.  The ERPIN server: an interface to profile-based RNA motif identification , 2004, Nucleic Acids Res..

[12]  I. Wool,et al.  The conformation of the sarcin/ricin loop from 28S ribosomal RNA. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[13]  H. Hansma,et al.  Building Programmable Jigsaw Puzzles with RNA , 2004, Science.

[14]  P. Moore,et al.  The sarcin/ricin loop, a modular RNA. , 1995, Journal of molecular biology.

[15]  Manfred Nagl,et al.  Graph-Grammars and Their Application to Computer Science , 1986, Lecture Notes in Computer Science.

[16]  C. Kundrot,et al.  RNA Tertiary Structure Mediation by Adenosine Platforms , 1996, Science.

[17]  Sean R. Eddy,et al.  Rfam: an RNA family database , 2003, Nucleic Acids Res..

[18]  Manfred Nagl,et al.  Graph-Grammars and Their Application to Computer Science , 1982, Lecture Notes in Computer Science.

[19]  E. Westhof,et al.  Geometric nomenclature and classification of RNA base pairs. , 2001, RNA.

[20]  Joseph Douglas Horton,et al.  A Polynomial-Time Algorithm to Find the Shortest Cycle Basis of a Graph , 1987, SIAM J. Comput..

[21]  Daniel Gautheret,et al.  Pattern searching/alignment with RNA primary and secondary structures: an effective descriptor for tRNA , 1990, Comput. Appl. Biosci..

[22]  D. Gautheret,et al.  Direct RNA motif definition and identification from multiple sequence alignments using secondary structure profiles. , 2001, Journal of molecular biology.

[23]  E. Westhof,et al.  The building blocks and motifs of RNA architecture. , 2006, Current opinion in structural biology.

[24]  Simon de Givry,et al.  Searching RNA motifs and their intermolecular contacts with constraint networks , 2020 .

[25]  François Major,et al.  Automated extraction and classification of RNA tertiary structure cyclic motifs , 2006, Nucleic acids research.

[26]  Manfred Nagl Set theoretic approaches to graph grammars , 1986, Graph-Grammars and Their Application to Computer Science.

[27]  R. C. Underwood,et al.  Stochastic context-free grammars for tRNA modeling. , 1994, Nucleic acids research.

[28]  Zasha Weinberg,et al.  Exploiting conserved structure for faster annotation of non-coding RNAs without loss of accuracy , 2004, ISMB/ECCB.

[29]  Chris Jones,et al.  An integrated modeling environment based on attributed graphs and graph-grammars , 1993, Decis. Support Syst..

[30]  N. B. Leontisa,et al.  Motif prediction in ribosomal RNAs Lessons and prospects for automated motif prediction in homologous RNA molecules , 2002 .

[31]  Stephen R Holbrook,et al.  RNA structure: the long and the short of it , 2005, Current Opinion in Structural Biology.

[32]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[33]  G Lapalme,et al.  The combination of symbolic and numerical computation for three-dimensional modeling of RNA. , 1991, Science.

[34]  Alfred V. Aho,et al.  The Design and Analysis of Computer Algorithms , 1974 .

[35]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..