Analysis and classification of RNA tertiary structures.

There is a fast growing interest in noncoding RNA transcripts. These transcripts are not translated into proteins, but play essential roles in many cellular and pathological processes. Recent efforts toward comprehension of their function has led to a substantial increase in both the number and the size of solved RNA structures. With the aim of addressing questions relating to RNA structural diversity, we examined RNA conservation at three structural levels: primary, secondary, and tertiary structure. Additionally, we developed an automated method for classifying RNA structures based on spatial (three-dimensional [3D]) similarity. Applying the method to all solved RNA structures resulted in a classified database of RNA tertiary structures (DARTS). DARTS embodies 1333 solved RNA structures classified into 94 clusters. The classification is hierarchical, reflecting the structural relationship between and within clusters. We also developed an application for searching DARTS with a new structure. The search is fast and its performance was successfully tested on all solved RNA structures since the creation of DARTS. A user-friendly interface for both the database and the search application is available online. We show intracluster and intercluster similarities in DARTS and demonstrate the usefulness of the search application. The analysis reveals the current structural repertoire of RNA and exposes common global folds and local tertiary motifs. Further study of these conserved substructures may suggest possible RNA domains and building blocks. This should be beneficial for structure prediction and for gaining insights into structure-function relationships.

[1]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[2]  R. Nussinov,et al.  Fast algorithm for predicting the secondary structure of single-stranded RNA. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[3]  J. Goddard Transfer RNA , 1980, Nature.

[4]  D. Sankoff Simultaneous Solution of the RNA Folding, Alignment and Protosequence Problems , 1985 .

[5]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[6]  P. Schimmel,et al.  Transfer RNA: From minihelix to genetic code , 1995, Cell.

[7]  C. Kundrot,et al.  Crystal Structure of a Group I Ribozyme Domain: Principles of RNA Packing , 1996, Science.

[8]  D. Higgins,et al.  RAGA: RNA sequence alignment by genetic algorithm. , 1997, Nucleic acids research.

[9]  T. Cech,et al.  A preorganized active site in the crystal structure of the Tetrahymena ribozyme. , 1998, Science.

[10]  A. Pyle,et al.  Stepping through an RNA structure: A novel approach to conformational analysis. , 1998, Journal of molecular biology.

[11]  R. Lück,et al.  ConStruct: a tool for thermodynamic controlled prediction of conserved secondary structure. , 1999, Nucleic acids research.

[12]  W. Olson,et al.  Overview of nucleic acid analysis programs. , 1999, Journal of biomolecular structure & dynamics.

[13]  W. Olson,et al.  Resolving the discrepancies among nucleic acid conformational analyses. , 1999, Journal of molecular biology.

[14]  P. Moore,et al.  Structural motifs in RNA. , 1999, Annual review of biochemistry.

[15]  Batey,et al.  Tertiary Motifs in RNA Structure and Folding. , 1999, Angewandte Chemie.

[16]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[17]  S. Le,et al.  Prediction of common secondary structures of RNAs: a genetic algorithm approach. , 2000, Nucleic acids research.

[18]  Lutgarde M. C. Buydens,et al.  The Influence of Different Structure Representations on the Clustering of an RNA Nucleotides Data Set , 2001, J. Chem. Inf. Comput. Sci..

[19]  Bin Ma,et al.  Edit distance between two RNA structures , 2001, RECOMB.

[20]  P. Gendron,et al.  Quantitative analysis of nucleic acid three-dimensional structures. , 2001, Journal of molecular biology.

[21]  M. J. Pereira,et al.  Reaction pathway of the trans-acting hepatitis delta virus ribozyme: a conformational change accompanies catalysis. , 2002, Biochemistry.

[22]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[23]  N. Okada,et al.  LINEs Mobilize SINEs in the Eel through a Shared 3′ Sequence , 2002, Cell.

[24]  D. Turner,et al.  Dynalign: an algorithm for finding the secondary structure common to two RNA sequences. , 2002, Journal of molecular biology.

[25]  John D. Westbrook,et al.  Tools for the automatic identification and classification of RNA base pairs , 2003, Nucleic Acids Res..

[26]  W. Olson,et al.  3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. , 2003, Nucleic acids research.

[27]  tRNA Structure Goes from L to λ , 2003, Cell.

[28]  G. Rose,et al.  RNABase: an annotated database of RNA structures , 2003, Nucleic Acids Res..

[29]  Emmanuel Tannenbaum,et al.  Automated identification of RNA conformational motifs: theory and application to the HM LSU 23S rRNA. , 2003, Nucleic acids research.

[30]  Robert Giegerich,et al.  Local similarity in RNA secondary structures , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[31]  Peter Willett,et al.  Representation, searching and discovery of patterns of bases in complex RNA structures , 2003, J. Comput. Aided Mol. Des..

[32]  Anna Marie Pyle,et al.  RNA structure comparison, motif search and discovery using a reduced representation of RNA conformational space. , 2003, Nucleic acids research.

[33]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[34]  Zukang Feng,et al.  The Nucleic Acid Database. , 2002, Acta crystallographica. Section D, Biological crystallography.

[35]  Anna Marie Pyle,et al.  The identification of novel RNA structural motifs using COMPADRES: an automated approach to structural discovery. , 2004, Nucleic acids research.

[36]  Steven E. Brenner,et al.  SCOR: Structural Classification of RNA, version 2.0 , 2004, Nucleic Acids Res..

[37]  Peter F. Stadler,et al.  Alignment of RNA base pairing probability matrices , 2004, Bioinform..

[38]  T. Cech,et al.  Structure of the Tetrahymena ribozyme: base triple sandwich and metal ion at the active site. , 2004, Molecular cell.

[39]  Ian Holmes,et al.  Stem Stem Stem Stem Loop Loop Loop LoopLoop Loop Loop Loop Loop Loop Loop , 2005 .

[40]  Boris Lenhard,et al.  RNAdb—a comprehensive mammalian noncoding RNA database , 2004, Nucleic Acids Res..

[41]  Yi Zhao,et al.  NONCODE: an integrated knowledge database of non-coding RNAs , 2004, Nucleic Acids Res..

[42]  Robert Giegerich,et al.  Consensus shapes: an alternative to the Sankoff algorithm for RNA consensus structure prediction , 2005, Bioinform..

[43]  Ruth Nussinov,et al.  ARTS: alignment of RNA tertiary structures , 2005, ECCB/JBI.

[44]  D. Haussler,et al.  The Structure of a Rigorously Conserved RNA Element within the SARS Virus Genome , 2004, PLoS biology.

[45]  Jan Gorodkin,et al.  The foldalign web server for pairwise structural RNA alignment and mutual motif search , 2005, Nucleic Acids Res..

[46]  A. S. Krasilnikov,et al.  Crystal structure of the RNA component of bacterial ribonuclease P , 2005, Nature.

[47]  Scott A. Givan,et al.  ASRP: the Arabidopsis Small RNA Project Database , 2004, Nucleic Acids Res..

[48]  Sam Griffiths-Jones,et al.  RALEE--RNA ALignment Editor in Emacs , 2005, Bioinform..

[49]  Hung-Chung Huang,et al.  The application of cluster analysis in the intercomparison of loop structures in RNA. , 2005, RNA.

[50]  A. Wilm,et al.  A benchmark of multiple sequence alignment programs upon structural RNAs , 2005, Nucleic acids research.

[51]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..

[52]  G. Varani,et al.  The structure of an enzyme-activating fragment of human telomerase RNA. , 2005, RNA.

[53]  Andreas Wilm,et al.  An enhanced RNA alignment benchmark for sequence alignment programs , 2006, Algorithms for Molecular Biology.

[54]  A. Ferré-D’Amaré,et al.  Crystal structures of the thi-box riboswitch bound to thiamine pyrophosphate analogs reveal adaptive RNA-small molecule recognition. , 2006, Structure.

[55]  Gota Kawai,et al.  Solution structure and functional importance of a conserved RNA hairpin of eel LINE UnaL2 , 2006, Nucleic acids research.

[56]  Stijn van Dongen,et al.  miRBase: microRNA sequences, targets and gene nomenclature , 2005, Nucleic Acids Res..

[57]  J. Kieft,et al.  Structural Basis for Ribosome Recruitment and Manipulation by a Viral IRES RNA , 2006, Science.

[58]  Yanga Byun,et al.  PseudoViewer: web application and web service for visualizing RNA pseudoknots and secondary structures , 2006, Nucleic Acids Res..

[59]  Aya Kojima,et al.  fRNAdb: a platform for mining/annotating functional RNA candidates from non-coding RNA sequences , 2006, Nucleic Acids Res..

[60]  Peter Clote,et al.  DIAL: a web server for the pairwise alignment of two RNA three-dimensional structures using nucleotide, dihedral angle and base-pairing similarities , 2007, Nucleic Acids Res..

[61]  R. Batey,et al.  Structure of the SAM-II riboswitch bound to S-adenosylmethionine , 2008, Nature Structural &Molecular Biology.

[62]  Eckart Bindewald,et al.  RNAJunction: a database of RNA junctions and kissing loops for three-dimensional structural analysis and nanodesign , 2007, Nucleic Acids Res..

[63]  Craig L. Zirbel,et al.  FR3D: finding local and composite recurrent structural motifs in RNA 3D structures , 2007, Journal of mathematical biology.

[64]  Michael Sarver,et al.  FR 3 D : finding local and composite recurrent structural motifs in RNA 3 D structures , 2010 .