RNA-Puzzles toolkit: a computational resource of RNA 3D structure benchmark datasets, structure manipulation, and evaluation tools

Abstract Significant improvements have been made in the efficiency and accuracy of RNA 3D structure prediction methods during the succeeding challenges of RNA-Puzzles, a community-wide effort on the assessment of blind prediction of RNA tertiary structures. The RNA-Puzzles contest has shown, among others, that the development and validation of computational methods for RNA fold prediction strongly depend on the benchmark datasets and the structure comparison algorithms. Yet, there has been no systematic benchmark set or decoy structures available for the 3D structure prediction of RNA, hindering the standardization of comparative tests in the modeling of RNA structure. Furthermore, there has not been a unified set of tools that allows deep and complete RNA structure analysis, and at the same time, that is easy to use. Here, we present RNA-Puzzles toolkit, a computational resource including (i) decoy sets generated by different RNA 3D structure prediction methods (raw, for-evaluation and standardized datasets), (ii) 3D structure normalization, analysis, manipulation, visualization tools (RNA_format, RNA_normalizer, rna-tools) and (iii) 3D structure comparison metric tools (RNAQUA, MCQ4Structures). This resource provides a full list of computational tools as well as a standard RNA 3D structure prediction assessment protocol for the community.

[1]  Pu Gao,et al.  Pistol ribozyme adopts a pseudoknot fold facilitating site-specific in-line cleavage , 2016, Nature chemical biology.

[2]  Eric Westhof,et al.  RNA Structure: Advances and Assessment of 3D Structure Prediction. , 2017, Annual review of biophysics.

[3]  Janusz M. Bujnicki,et al.  RNA 3D structure prediction guided by independent folding of homologous sequences , 2019, BMC Bioinformatics.

[4]  Travis E. Oliphant,et al.  Guide to NumPy , 2015 .

[5]  Adam J. Riesselman,et al.  3D RNA and Functional Interactions from Evolutionary Couplings , 2015, Cell.

[6]  E. Westhof,et al.  Three-dimensional model of Escherichia coli ribosomal 5 S RNA as deduced from structure probing in solution and computer modeling. , 1991, Journal of molecular biology.

[7]  Yi Xue,et al.  Structural and Dynamic Basis for Low-Affinity, High-Selectivity Binding of L-Glutamine by the Glutamine Riboswitch. , 2015, Cell reports.

[8]  Tatsuya Akutsu,et al.  IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming , 2011, Bioinform..

[9]  Eric Westhof,et al.  Sequence-based identification of 3D structural modules in RNA with RMDetect , 2011, Nature Methods.

[10]  D. Baker,et al.  Automated de novo prediction of native-like RNA tertiary structures , 2007, Proceedings of the National Academy of Sciences.

[11]  Yann Ponty,et al.  VARNA: Interactive drawing and editing of the RNA secondary structure , 2009, Bioinform..

[12]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[13]  H. Noller,et al.  Secondary structure of 16S ribosomal RNA. , 1981, Science.

[14]  Adelene Y. L. Sim,et al.  Fully differentiable coarse-grained and all-atom knowledge-based potentials for RNA structure evaluation. , 2011, RNA.

[15]  Tomasz Zok,et al.  LCS-TA to identify similar fragments in RNA 3D structures , 2017, BMC Bioinformatics.

[16]  J. Bujnicki,et al.  ModeRNA: a tool for comparative modeling of RNA 3D structure , 2011, Nucleic acids research.

[17]  Arindam Basu Reproducible research with jupyter notebooks , 2017 .

[18]  Janusz M Bujnicki,et al.  Computational modeling of RNA 3D structures and interactions. , 2016, Current opinion in structural biology.

[19]  Katarzyna J Purzycka,et al.  RNA-Puzzles Round III: 3D RNA structure prediction of five riboswitches and one ribozyme. , 2017, RNA.

[20]  J. Sarzynska,et al.  Bioinformatics Study of Structural Patterns in Plant MicroRNA Precursors , 2017, BioMed research international.

[21]  Janusz M Bujnicki,et al.  ClaRNA: a classifier of contacts in RNA 3D structures based on a comparative analysis of various classification schemes , 2014, Nucleic acids research.

[22]  Feng Ding,et al.  On the significance of an RNA tertiary structure prediction. , 2010, RNA.

[23]  Tomasz Zok,et al.  RNAfitme: a webserver for modeling nucleobase and nucleoside residue conformation in fixed-backbone RNA structures , 2018, BMC Bioinformatics.

[24]  Rhiju Das,et al.  FARFAR2: Improved De Novo Rosetta Prediction of Complex Global RNA Folds. , 2020, Structure.

[25]  Jinwei Zhang,et al.  Co-crystal structure of a T-box riboswitch stem I domain in complex with its cognate tRNA , 2013, Nature.

[26]  A. Serganov,et al.  Structural insights into ligand binding and gene expression control by an adenosylcobalamin riboswitch , 2012, Nature Structural &Molecular Biology.

[27]  Robert D. Finn,et al.  Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families , 2017, Nucleic Acids Res..

[28]  Jeffrey Wilusz,et al.  The Structural Basis of Pathogenic Subgenomic Flavivirus RNA (sfRNA) Production , 2014, Science.

[29]  J. Bujnicki,et al.  SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction , 2015, Nucleic acids research.

[30]  Jinwei Zhang,et al.  YbxF and YlxQ are bacterial homologs of L7Ae and bind K-turns but not K-loops. , 2012, RNA.

[31]  Jacek Blazewicz,et al.  RNAssess—a web server for quality assessment of RNA 3D structures , 2015, Nucleic Acids Res..

[32]  Eric Westhof,et al.  New metrics for comparing and assessing discrepancies between RNA 3D structures and models. , 2009, RNA.

[33]  F. Major,et al.  The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data , 2008, Nature.

[34]  Peter F. Stadler,et al.  Pseudoknots in RNA folding landscapes , 2015, Bioinform..

[35]  Feng Ding,et al.  iFoldRNA: three-dimensional RNA structure prediction and folding , 2008, Bioinform..

[36]  Jack Snoeyink,et al.  Nucleic Acids Research Advance Access published April 22, 2007 MolProbity: all-atom contacts and structure validation for proteins and nucleic acids , 2007 .

[37]  Rachel E Rigsby,et al.  Using the PyMOL application to reinforce visual understanding of protein structure , 2016, Biochemistry and molecular biology education : a bimonthly publication of the International Union of Biochemistry and Molecular Biology.

[38]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—Round XII , 2018, Proteins.

[39]  W. Olson,et al.  3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. , 2003, Nucleic acids research.

[40]  Janusz M. Bujnicki,et al.  RNArchitecture: a database and a classification system of RNA families, with a focus on structural information , 2017, Nucleic Acids Res..

[41]  Xiaojun Xu,et al.  Predicting RNA Structure with Vfold. , 2017, Methods in molecular biology.

[42]  E. Westhof,et al.  Modelling of the three-dimensional architecture of group I catalytic introns based on comparative sequence analysis. , 1990, Journal of molecular biology.

[43]  Marta Szachniuk,et al.  RNApolis: Computational Platform for RNA Structure Analysis , 2019, Foundations of Computing and Decision Sciences.

[44]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—round IX , 2011, Proteins.

[45]  Jun Li,et al.  RNA3DCNN: Local and global quality assessments of RNA 3D structures using 3D deep convolutional neural networks , 2018, PLoS Comput. Biol..

[46]  Tomasz Zok,et al.  MCQ4Structures to compute similarity of molecule structures , 2013, Central European Journal of Operations Research.

[47]  Craig L. Zirbel,et al.  Nonredundant 3D Structure Datasets for RNA Knowledge Extraction and Benchmarking , 2012 .

[48]  A. I. Yakimchik Jupyter Notebook: a system for interactive scientific computing , 2019 .

[49]  D. Lilley,et al.  The structure of a nucleolytic ribozyme that employs a catalytic metal ion , 2017, Nature chemical biology.

[50]  M. Zalis,et al.  Visualizing and quantifying molecular goodness-of-fit: small-probe contact dots with explicit hydrogen atoms. , 1999, Journal of molecular biology.

[51]  Janusz M. Bujnicki,et al.  QRNAS: software tool for refinement of nucleic acid structures , 2019, BMC Structural Biology.

[52]  P. Gendron,et al.  Quantitative analysis of nucleic acid three-dimensional structures. , 2001, Journal of molecular biology.

[53]  Jacek Blazewicz,et al.  RNAlyzer—novel approach for quality analysis of RNA structural models , 2013, Nucleic acids research.

[54]  James W. Brown,et al.  Long-range structure in ribonuclease P RNA. , 1991, Science.

[55]  Feng Ding,et al.  RNA-Puzzles: a CASP-like evaluation of RNA three-dimensional structure prediction. , 2012, RNA.

[56]  Luc Jaeger,et al.  Blind prediction of noncanonical RNA structure at atomic accuracy , 2017, Science Advances.

[57]  Thomas Villmann,et al.  Building the library of RNA 3D nucleotide conformations using the clustering approach , 2015, Int. J. Appl. Math. Comput. Sci..

[58]  Leszek Rychlewski,et al.  LiveBench‐8: The large‐scale, continuous assessment of automated protein structure prediction , 2005, Protein science : a publication of the Protein Society.

[59]  Phoebe A Rice,et al.  Crystal Structure of the VS ribozyme , 2015, Nature chemical biology.

[60]  M. Levitt Detailed Molecular Model for Transfer Ribonucleic Acid , 1969, Nature.

[61]  Adam Zemla,et al.  Critical assessment of methods of protein structure prediction (CASP)‐round V , 2005, Proteins.

[62]  Janusz M. Bujnicki,et al.  SimRNAweb: a web server for RNA 3D structure modeling with optional restraints , 2016, Nucleic Acids Res..

[63]  T. Hermann,et al.  Self-assembling RNA square , 2011, Proceedings of the National Academy of Sciences.

[64]  Robert M. Hanson,et al.  DSSR-enhanced visualization of nucleic acid structures in Jmol , 2017, Nucleic Acids Res..

[65]  Namhee Kim,et al.  Network Theory Tools for RNA Modeling. , 2013, WSEAS transactions on mathematics.

[66]  Janusz M. Bujnicki,et al.  NPDock: a web server for protein–nucleic acid docking , 2015, Nucleic Acids Res..

[67]  E. Westhof,et al.  Geometric nomenclature and classification of RNA base pairs. , 2001, RNA.

[68]  Feng Ding,et al.  RNA-Puzzles Round II: assessment of RNA structure prediction programs applied to three large RNA structures , 2015, RNA.

[69]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[70]  Marc A. Martí-Renom,et al.  All-atom knowledge-based potential for RNA structure prediction and assessment , 2011, Bioinform..

[71]  I. Hofacker,et al.  Predicting RNA 3D structure using a coarse-grain helix-centered model , 2015, RNA.

[72]  Daniel Lai,et al.  R-chie: a web server and R package for visualizing RNA secondary structures , 2012, Nucleic acids research.

[73]  Rhiju Das,et al.  Modeling complex RNA tertiary folds with Rosetta. , 2015, Methods in enzymology.

[74]  Tomasz Zok,et al.  New functionality of RNAComposer: an application to shape the axis of miR160 precursor structure. , 2016, Acta biochimica Polonica.

[75]  E Westhof,et al.  Computer modeling from solution data of spinach chloroplast and of Xenopus laevis somatic and oocyte 5 S rRNAs. , 1989, Journal of molecular biology.

[76]  Sean R. Eddy,et al.  Infernal 1.1: 100-fold faster RNA homology searches , 2013, Bioinform..

[77]  Marcin Magnus,et al.  Rearrangements within the U6 snRNA Core during the Transition between the Two Catalytic Steps of Splicing. , 2019, Molecular cell.

[78]  Andrew M. Watkins,et al.  FARFAR2: Improved de novo Rosetta prediction of complex global RNA folds , 2019, bioRxiv.