The non-Watson-Crick base pairs and their associated isostericity matrices.

RNA molecules exhibit complex structures in which a large fraction of the bases engage in non-Watson-Crick base pairing, forming motifs that mediate long-range RNA-RNA interactions and create binding sites for proteins and small molecule ligands. The rapidly growing number of three-dimensional RNA structures at atomic resolution requires that databases contain the annotation of such base pairs. An unambiguous and descriptive nomenclature was proposed recently in which RNA base pairs were classified by the base edges participating in the interaction (Watson-Crick, Hoogsteen/CH or sugar edge) and the orientation of the glycosidic bonds relative to the hydrogen bonds (cis or trans). Twelve basic geometric families were identified and all 12 have been observed in crystal structures. For each base pairing family, we present here the 4 x 4 'isostericity matrices' summarizing the geometric relationships between the 16 pairwise combinations of the four standard bases, A, C, G and U. Whenever available, a representative example of each observed base pair from X-ray crystal structures (3.0 A resolution or better) is provided or, otherwise, theoretically plausible models. This format makes apparent the recurrent geometric patterns that are observed and helps identify isosteric pairs that co-vary or interchange in sequences of homologous molecules while maintaining conserved three-dimensional motifs.

[1]  T. Steitz,et al.  Metals, Motifs, and Recognition in the Crystal Structure of a 5S rRNA Domain , 1997, Cell.

[2]  E Westhof,et al.  Crystal structure of paromomycin docked into the eubacterial ribosomal decoding A site. , 2001, Structure.

[3]  G. Stormo,et al.  Identifying constraints on the higher-order structure of RNA: continued development and application of comparative sequence analysis methods. , 1992, Nucleic acids research.

[4]  K. Hoogsteen,et al.  The crystal and molecular structure of a hydrogen-bonded complex between 1-methylthymine and 9-methyladenine , 1963 .

[5]  J. Šponer,et al.  Molecular dynamics of the frame-shifting pseudoknot from beet western yellows virus: the role of non-Watson-Crick base-pairing, ordered hydration, cation binding and base mutations on stability and unfolding. , 2001, Journal of molecular biology.

[6]  Crystal structure of a 14 bp RNA duplex with non-symmetrical tandem GxU wobble base pairs. , 1998, Nucleic acids research.

[7]  F. Schluenzen,et al.  Structural basis for the interaction of antibiotics with the peptidyl transferase centre in eubacteria , 2001, Nature.

[8]  T. Cech,et al.  A preorganized active site in the crystal structure of the Tetrahymena ribozyme. , 1998, Science.

[9]  M. Sundaralingam,et al.  The structure of r(UUCGCG) has a 5′-UU-overhang exhibiting Hoogsteen-like trans U•U base pairs , 1996, Nature Structural Biology.

[10]  A. Klug,et al.  The crystal structure of an AII-RNAhammerhead ribozyme: A proposed mechanism for RNA catalytic cleavage , 1995, Cell.

[11]  E. Lattman,et al.  Crystal structure of a conserved ribosomal protein-RNA complex. , 1999, Science.

[12]  P. Moore,et al.  The sarcin/ricin loop, a modular RNA. , 1995, Journal of molecular biology.

[13]  S Thirup,et al.  Crystal Structure of the Ternary Complex of Phe-tRNAPhe, EF-Tu, and a GTP Analog , 1995, Science.

[14]  Gautam R. Desiraju,et al.  The C-h···o hydrogen bond:  structural implications and supramolecular design. , 1996, Accounts of chemical research.

[15]  E Westhof,et al.  Restrained refinement of two crystalline forms of yeast aspartic acid and phenylalanine transfer RNA crystals. , 1987, Acta crystallographica. Section A, Foundations of crystallography.

[16]  E. Westhof,et al.  Modelling of the three-dimensional architecture of group I catalytic introns based on comparative sequence analysis. , 1990, Journal of molecular biology.

[17]  Charles Wilson,et al.  The structural basis for molecular recognition by the vitamin B 12 RNA aptamer , 2000, Nature Structural Biology.

[18]  J. Ebel,et al.  Probing the structure of RNAs in solution. , 1987, Nucleic acids research.

[19]  R. Darnell,et al.  Sequence-Specific RNA Binding by a Nova KH Domain Implications for Paraneoplastic Disease and the Fragile X Syndrome , 2000, Cell.

[20]  A'-form RNA double helix in the single crystal structure of r(UGAGCUUCGGCUC). , 1998, Nucleic acids research.

[21]  S R Holbrook,et al.  A curved RNA helix incorporating an internal loop with G.A and A.A non-Watson-Crick base pairing. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[22]  G. Varani,et al.  Structure of an unusually stable RNA hairpin. , 1991, Biochemistry.

[23]  T. Steitz,et al.  Crystal structure of the ribosomal RNA domain essential for binding elongation factors. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[24]  J. Berger,et al.  Minor groove RNA triplex in the crystal structure of a ribosomal frameshifting viral pseudoknot , 1999, Nature Structural Biology.

[25]  D. Ecker,et al.  RNAMotif, an RNA secondary structure definition and search algorithm. , 2001, Nucleic acids research.

[26]  T. Steitz,et al.  Crystal structures of three misacylating mutants of Escherichia coli glutaminyl-tRNA synthetase complexed with tRNA(Gln) and ATP. , 1996, Biochemistry.

[27]  V. Ramakrishnan,et al.  Functional insights from the structure of the 30S ribosomal subunit and its interactions with antibiotics , 2000, Nature.

[28]  S Thirup,et al.  The crystal structure of Cys-tRNACys-EF-Tu-GDPNP reveals general and specific features in the ternary complex and in tRNA. , 1999, Structure.

[29]  N. Guex,et al.  SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modeling , 1997, Electrophoresis.

[30]  François Michel,et al.  The guanosine binding site of the Tetrahymena ribozyme , 1989, Nature.

[31]  J. SantaLucia,et al.  In vivo determination of RNA structure-function relationships: analysis of the 790 loop in ribosomal RNA. , 1997, Journal of molecular biology.

[32]  E Westhof,et al.  The 5S rRNA loop E: chemical probing and phylogenetic data versus crystal structure. , 1998, RNA.

[33]  G. Varani,et al.  The conformation of loop E of eukaryotic 5S ribosomal RNA. , 1993, Biochemistry.

[34]  E. Westhof,et al.  A common motif organizes the structure of multi-helix loops in 16 S and 23 S ribosomal RNAs. , 1998, Journal of molecular biology.

[35]  J. Doudna,et al.  Crystal structure of the ribonucleoprotein core of the signal recognition particle. , 2000, Science.

[36]  C. Kundrot,et al.  Crystal Structure of a Group I Ribozyme Domain: Principles of RNA Packing , 1996, Science.

[37]  E Westhof,et al.  Hydration of RNA base pairs. , 1998, Journal of biomolecular structure & dynamics.

[38]  E. Westhof,et al.  Geometric nomenclature and classification of RNA base pairs. , 2001, RNA.

[39]  Thomas A. Steitz,et al.  RNA tertiary interactions in the large ribosomal subunit: The A-minor motif , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[40]  E Westhof,et al.  On the wobble GoU and related pairs. , 2000, RNA.

[41]  H. Heus,et al.  Structural features that give rise to the unusual stability of RNA hairpins containing GNRA loops. , 1991, Science.

[42]  J. McCutcheon,et al.  A Detailed View of a Ribosomal Active Site The Structure of the L11–RNA Complex , 1999, Cell.

[43]  S. Lietzke,et al.  The structure of an RNA dodecamer shows how tandem U-U base pairs increase the range of stable RNA structures and the diversity of recognition sites. , 1996, Structure.

[44]  D Gautheret,et al.  Inferring the conformation of RNA base pairs and triples from patterns of sequence variation. , 1997, Nucleic acids research.

[45]  E Westhof,et al.  Conserved geometrical base-pairing patterns in RNA , 1998, Quarterly Reviews of Biophysics.

[46]  C. Wilson,et al.  The 1.3 A crystal structure of a biotin-binding pseudoknot and the basis for RNA molecular recognition. , 2000, Journal of molecular biology.

[47]  J. Wedekind,et al.  Crystal structure of a lead-dependent ribozyme revealing metal binding sites relevant to catalysis , 1999, Nature Structural Biology.

[48]  M. Sundaralingam,et al.  Crystal structure of an RNA 16-mer duplex R(GCAGAGUUAAAUCUGC)2 with nonadjacent G(syn).A+(anti) mispairs. , 1999, Biochemistry.

[49]  D. Turner,et al.  Structure of (rGGCGAGCC)2 in solution from NMR and restrained molecular dynamics. , 1993, Biochemistry.

[50]  E. Westhof Westhof's rule , 1992, Nature.

[51]  E. L. Holbrook,et al.  Structure of an RNA internal loop consisting of tandem C-A+ base pairs. , 1998, Biochemistry.

[52]  Frédéric H.-T. Allain,et al.  Solution structure of the loop B domain from the hairpin ribozyme , 1999, Nature Structural Biology.

[53]  P. Moore,et al.  Structural motifs in RNA. , 1999, Annual review of biochemistry.

[54]  D Gautheret,et al.  Identification of base-triples in RNA using comparative sequence analysis. , 1995, Journal of molecular biology.

[55]  S. Strobel,et al.  Chemical probing of RNA by nucleotide analog interference mapping. , 2000, Methods in enzymology.

[56]  G. Desiraju The C-H×××O Hydrogen Bond: Structural Implications and Supramolecular Design , 1996 .

[57]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[58]  A. Ferré-D’Amaré,et al.  Crystal structure of a hepatitis delta virus ribozyme , 1998, Nature.

[59]  C R Woese,et al.  Architecture of ribosomal RNA: constraints on the sequence of "tetra-loops". , 1990, Proceedings of the National Academy of Sciences of the United States of America.