ARTS: alignment of RNA tertiary structures

MOTIVATION A fast growing number of non-coding RNAs have recently been discovered to play essential roles in many cellular processes. Similar to proteins, understanding the functions of these active RNAs requires methods for analyzing their tertiary structures. However, in contrast to the wide range of structure-based approaches available for proteins, there is still a lack of methods for studying RNA structures. RESULTS We present a new computational method named ARTS (alignment of RNA tertiary structures). The method compares two nucleic acid structures (RNAs or DNAs) and detects a-priori unknown common substructures. These substructures can be either large global folds containing hundreds and even thousands of nucleotides or small local tertiary motifs with at least two successive base pairs. To the best of our knowledge, this is the first method of this type. The method is highly-efficient and was used to conduct an all-against-all comparison of all the RNA structures currently available in the Protein Data Bank. AVAILABILITY The program, a web-server and supplementary information are available on http://bioinfo3d.cs.tau.ac.il/ARTS

[1]  John D. Westbrook,et al.  Tools for the automatic identification and classification of RNA base pairs , 2003, Nucleic Acids Res..

[2]  A. Pyle,et al.  Stepping through an RNA structure: A novel approach to conformational analysis. , 1998, Journal of molecular biology.

[3]  Michael Zuker,et al.  Algorithms and Thermodynamics for RNA Secondary Structure Prediction: A Practical Guide , 1999 .

[4]  Batey,et al.  Tertiary Motifs in RNA Structure and Folding. , 1999, Angewandte Chemie.

[5]  Samarjit Chakraborty,et al.  Computing Largest Common Point Sets under Approximate Congruence , 2000, ESA.

[6]  T. Pan,et al.  Domain structure of the ribozyme from eubacterial ribonuclease P. , 1996, RNA.

[7]  T. Cech,et al.  Structure of the Tetrahymena ribozyme: base triple sandwich and metal ion at the active site. , 2004, Molecular cell.

[8]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[9]  W. Olson,et al.  3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. , 2003, Nucleic acids research.

[10]  G. Storz An Expanding Universe of Noncoding RNAs , 2002, Science.

[11]  Kurt Mehlhorn,et al.  The LEDA Platform of Combinatorial and Geometric Computing , 1997, ICALP.

[12]  N. Pace,et al.  Identification of the universally conserved core of ribonuclease P RNA. , 1997, RNA.

[13]  Tatsuya Akutsu,et al.  Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots , 2000, Discret. Appl. Math..

[14]  M. Nomura,et al.  Translational regulation of the spc operon in Escherichia coli. Identification and structural analysis of the target site for S8 repressor protein. , 1988, Journal of molecular biology.

[15]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[16]  T. Schlick,et al.  Exploring the repertoire of RNA secondary motifs using graph theory; implications for RNA design. , 2003, Nucleic acids research.

[17]  W. Kabsch A discussion of the solution for the best rotation to relate two sets of vectors , 1978 .

[18]  N. Pace,et al.  Evolutionary perspective on the structure and function of ribonuclease P, a ribozyme , 1995, Journal of bacteriology.

[19]  A. S. Krasilnikov,et al.  Basis for Structural Diversity in Homologous RNAs , 2004, Science.

[20]  A. S. Krasilnikov,et al.  Crystal structure of the specificity domain of ribonuclease P , 2003, Nature.

[21]  S. Eddy Non–coding RNA genes and the modern RNA world , 2001, Nature Reviews Genetics.

[22]  Tatsuya Akutsu,et al.  Protein Structure Alignment Using Dynamic Programing and Iterative Improvement , 1996 .

[23]  Joseph S. B. Mitchell,et al.  Practical methods for approximate geometric pattern matching under rigid motions: (preliminary version) , 1994, SCG '94.

[24]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[25]  P. Moore,et al.  Structural motifs in RNA. , 1999, Annual review of biochemistry.

[26]  Anna Marie Pyle,et al.  RNA structure comparison, motif search and discovery using a reduced representation of RNA conformational space. , 2003, Nucleic acids research.

[27]  P. Gendron,et al.  Quantitative analysis of nucleic acid three-dimensional structures. , 2001, Journal of molecular biology.

[28]  Steven E. Brenner,et al.  SCOR: Structural Classification of RNA, version 2.0 , 2004, Nucleic Acids Res..

[29]  Jennifer A. Doudna,et al.  The chemical repertoire of natural ribozymes , 2002, Nature.

[30]  Jimin Wang,et al.  The structure of a ribosomal protein S8/spc operon mRNA complex. , 2004, RNA.

[31]  Boris Steipe,et al.  Metric properties of the root-mean-square deviation of vector sets , 1997 .

[32]  Anna Marie Pyle,et al.  The identification of novel RNA structural motifs using COMPADRES: an automated approach to structural discovery. , 2004, Nucleic acids research.

[33]  W. Olson,et al.  Overview of nucleic acid analysis programs. , 1999, Journal of biomolecular structure & dynamics.

[34]  Scott A. Strobel,et al.  Crystal structure of a self-splicing group I intron with both exons , 2004, Nature.

[35]  Peter Willett,et al.  Representation, searching and discovery of patterns of bases in complex RNA structures , 2003, J. Comput. Aided Mol. Des..

[36]  J. Doudna Structural genomics of RNA , 2000, Nature Structural Biology.

[37]  B. Golden,et al.  Crystal structure of a phage Twort group I ribozyme–product complex , 2005, Nature Structural &Molecular Biology.

[38]  Miron Livny,et al.  Condor: a distributed job scheduler , 2001 .

[39]  W. Olson,et al.  Resolving the discrepancies among nucleic acid conformational analyses. , 1999, Journal of molecular biology.

[40]  Alon Itai,et al.  Geometry Helps in Bottleneck Matching and Related Problems , 2001, Algorithmica.

[41]  Leonidas J. Guibas,et al.  Discrete Geometric Shapes: Matching, Interpolation, and Approximation , 2000, Handbook of Computational Geometry.