Motif prediction in ribosomal RNAs Lessons and prospects for automated motif prediction in homologous RNA molecules

The traditional way to infer RNA secondary structure involves an iterative process of alignment and evaluation of covariation statistics between all positions possibly involved in basepairing. Watson–Crick basepairs typically show covariations that score well when examples of two or more possible basepairs occur. This is not necessarily the case for non-Watson–Crick basepairing geometries. For example, for sheared (trans Hoogsteen/Sugar edge) pairs, one base is highly conserved (always A or mostly A with some C or U), while the other can vary (G or A and sometimes C and U as well). RNA motifs consist of ordered, stacked arrays of non-Watson–Crick basepairs that in the secondary structure representation form hairpin or internal loops, multi-stem junctions, and even pseudoknots. Although RNA motifs occur recurrently and contribute in a modular fashion to RNA architecture, it is usually not apparent which bases interact and whether it is by edge-to-edge H-bonding or solely by stacking interactions. Using a modular sequence-analysis approach, recurrent motifs related to the sarcin–ricin loop of 23S RNA and to loop E from 5S RNA were predicted in universally conserved regions of the large ribosomal RNAs (16Sand 23S-like) before the publication of high-resolution, atomic-level structures of representative examples of 16S and 23S rRNA molecules in their native contexts. This provides the opportunity to evaluate the predictive power of motif-level sequence analysis, with the goal of automating the process for predicting RNA motifs in genomic sequences. The process of inferring structure from sequence by constructing accurate alignments is a circular one. The crucial link that allows a productive iteration of motif modeling and realignment is the comparison of the sequence variations for each putative pair with the corresponding isostericity matrix to determine which basepairs are consistent both with the sequence and the geometrical data. © 2002 Société française de biochimie et biologie moléculaire / Éditions scientifiques et médicales Elsevier SAS. All rights reserved

[1]  E. Westhof,et al.  Modelling of the three-dimensional architecture of group I catalytic introns based on comparative sequence analysis. , 1990, Journal of molecular biology.

[2]  G. Varani,et al.  The conformation of loop E of eukaryotic 5S ribosomal RNA. , 1993, Biochemistry.

[3]  D Gautheret,et al.  A major family of motifs involving G.A mismatches in ribosomal RNA. , 1994, Journal of molecular biology.

[4]  Eric Westhof,et al.  The non-Watson-Crick base pairs and their associated isostericity matrices. , 2002, Nucleic acids research.

[5]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[6]  Frank Schluenzen,et al.  High Resolution Structure of the Large Ribosomal Subunit from a Mesophilic Eubacterium , 2001, Cell.

[7]  E Westhof,et al.  The 5S rRNA loop E: chemical probing and phylogenetic data versus crystal structure. , 1998, RNA.

[8]  E. Westhof,et al.  A common motif organizes the structure of multi-helix loops in 16 S and 23 S ribosomal RNAs. , 1998, Journal of molecular biology.

[9]  V. Ramakrishnan,et al.  Structure of the 30 S ribosomal subunit , 2022 .

[10]  R. Gutell,et al.  A story: unpaired adenosine bases in ribosomal RNAs. , 2000, Journal of molecular biology.

[11]  K. Flaherty,et al.  Three-dimensional structure of a hammerhead ribozyme , 1994, Nature.

[12]  Yves Van de Peer,et al.  The European Large Subunit Ribosomal RNA database , 2000, Nucleic Acids Res..

[13]  Jennifer A. Doudna,et al.  A universal mode of helix packing in RNA , 2001, Nature Structural Biology.

[14]  I. Wool,et al.  The conformation of the sarcin/ricin loop from 28S ribosomal RNA. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[15]  C. Kundrot,et al.  RNA Tertiary Structure Mediation by Adenosine Platforms , 1996, Science.

[16]  E. Westhof,et al.  Geometric nomenclature and classification of RNA base pairs. , 2001, RNA.

[17]  C. Vonrhein,et al.  Structure of the 30S ribosomal subunit , 2000, Nature.

[18]  A E Dahlberg,et al.  A conformational switch in Escherichia coli 16S ribosomal RNA during decoding of messenger RNA. , 1997, Science.

[19]  E Westhof,et al.  Conserved geometrical base-pairing patterns in RNA , 1998, Quarterly Reviews of Biophysics.

[20]  J. Frank,et al.  Major rearrangements in the 70S ribosomal 3D structure caused by a conformational switch in 16S ribosomal RNA , 1999, The EMBO journal.

[21]  E. Westhof,et al.  RNA folding: beyond Watson-Crick pairs. , 2000, Structure.

[22]  F. Schluenzen,et al.  Structure of Functionally Activated Small Ribosomal Subunit at 3.3 Å Resolution , 2000, Cell.