Geometric nomenclature and classification of RNA base pairs.

Non-Watson-Crick base pairs mediate specific interactions responsible for RNA-RNA self-assembly and RNA-protein recognition. An unambiguous and descriptive nomenclature with well-defined and nonoverlapping parameters is needed to communicate concisely structural information about RNA base pairs. The definitions should reflect underlying molecular structures and interactions and, thus, facilitate automated annotation, classification, and comparison of new RNA structures. We propose a classification based on the observation that the planar edge-to-edge, hydrogen-bonding interactions between RNA bases involve one of three distinct edges: the Watson-Crick edge, the Hoogsteen edge, and the Sugar edge (which includes the 2'-OH and which has also been referred to as the Shallow-groove edge). Bases can interact in either of two orientations with respect to the glycosidic bonds, cis or trans relative to the hydrogen bonds. This gives rise to 12 basic geometric types with at least two H bonds connecting the bases. For each geometric type, the relative orientations of the strands can be easily deduced. High-resolution examples of 11 of the 12 geometries are presently available. Bifurcated pairs, in which a single exocyclic carbonyl or amino group of one base directly contacts the edge of a second base, and water-inserted pairs, in which single functional groups on each base interact directly, are intermediate between two of the standard geometries. The nomenclature facilitates the recognition of isosteric relationships among base pairs within each geometry, and thus facilitates the recognition of recurrent three-dimensional motifs from comparison of homologous sequences. Graphical conventions are proposed for displaying non-Watson-Crick interactions on a secondary structure diagram. The utility of the classification in homology modeling of RNA tertiary motifs is illustrated.

[1]  Francis Crick,et al.  Codon--anticodon pairing: the wobble hypothesis. , 1966, Journal of Molecular Biology.

[2]  F. Crick Codon--anticodon pairing: the wobble hypothesis. , 1966, Journal of molecular biology.

[3]  B. Dujon,et al.  Comparison of fungal mitochondrial introns reveals extensive homologies in RNA secondary structure. , 1982, Biochimie.

[4]  Wolfram Saenger,et al.  Principles of Nucleic Acid Structure , 1983 .

[5]  E Westhof,et al.  Restrained refinement of two crystalline forms of yeast aspartic acid and phenylalanine transfer RNA crystals. , 1987, Acta crystallographica. Section A, Foundations of crystallography.

[6]  R. Lavery,et al.  A comprehensive classification of nucleic acid structural families based on strand direction and base pairing. , 1992, Nucleic acids research.

[7]  R. Gutell,et al.  A comparative database of group I intron structures. , 1994, Nucleic acids research.

[8]  C. Kundrot,et al.  Crystal Structure of a Group I Ribozyme Domain: Principles of RNA Packing , 1996, Science.

[9]  N. Guex,et al.  SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modeling , 1997, Electrophoresis.

[10]  T. Steitz,et al.  Metals, Motifs, and Recognition in the Crystal Structure of a 5S rRNA Domain , 1997, Cell.

[11]  E. Westhof,et al.  A common motif organizes the structure of multi-helix loops in 16 S and 23 S ribosomal RNAs. , 1998, Journal of molecular biology.

[12]  T. Steitz,et al.  Crystal structure of the ribosomal RNA domain essential for binding elongation factors. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[13]  E Westhof,et al.  The 5S rRNA loop E: chemical probing and phylogenetic data versus crystal structure. , 1998, RNA.

[14]  E Westhof,et al.  Conserved geometrical base-pairing patterns in RNA , 1998, Quarterly Reviews of Biophysics.

[15]  A. Ferré-D’Amaré,et al.  RNA folds: insights from recent crystal structures. , 1999, Annual review of biophysics and biomolecular structure.

[16]  Eric Westhof,et al.  Recurrent RNA Motifs , 1999 .

[17]  D. Patel,et al.  Stitching together RNA tertiary architectures. , 1999, Journal of molecular biology.

[18]  T. Earnest,et al.  X-ray crystal structures of 70S ribosome functional complexes. , 1999, Science.

[19]  J. Berger,et al.  Minor groove RNA triplex in the crystal structure of a ribosomal frameshifting viral pseudoknot , 1999, Nature Structural Biology.

[20]  Batey,et al.  Tertiary Motifs in RNA Structure and Folding. , 1999, Angewandte Chemie.

[21]  F. Schluenzen,et al.  Structure of Functionally Activated Small Ribosomal Subunit , 2000 .

[22]  George E. Fox,et al.  Database of non-canonical base pairs found in known RNA structures , 2000, Nucleic Acids Res..

[23]  J. Doudna,et al.  Crystal structure of the ribonucleoprotein core of the signal recognition particle. , 2000, Science.

[24]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[25]  T. Sixma,et al.  Crystal structure of the ffh and EF-G binding sites in the conserved domain IV of Escherichia coli 4.5S RNA. , 2000, Structure.

[26]  F. Schluenzen,et al.  Structure of Functionally Activated Small Ribosomal Subunit at 3.3 Å Resolution , 2000, Cell.

[27]  T. Steitz,et al.  The structural basis of ribosome activity in peptide bond synthesis. , 2000, Science.

[28]  C. Vonrhein,et al.  Structure of the 30S ribosomal subunit , 2000, Nature.

[29]  E. Westhof,et al.  RNA folding: beyond Watson-Crick pairs. , 2000, Structure.

[30]  E Westhof,et al.  On the wobble GoU and related pairs. , 2000, RNA.

[31]  G. Varani,et al.  The G x U wobble base pair. A fundamental building block of RNA structure crucial to RNA function in diverse biological systems. , 2000, EMBO reports.

[32]  E Westhof,et al.  A potential RNA drug target in the hepatitis C virus internal ribosomal entry site. , 2000, RNA.