RNA folding with hard and soft constraints

BackgroundA large class of RNA secondary structure prediction programs uses an elaborate energy model grounded in extensive thermodynamic measurements and exact dynamic programming algorithms. External experimental evidence can be in principle be incorporated by means of hard constraints that restrict the search space or by means of soft constraints that distort the energy model. In particular recent advances in coupling chemical and enzymatic probing with sequencing techniques but also comparative approaches provide an increasing amount of experimental data to be combined with secondary structure prediction.ResultsResponding to the increasing needs for a versatile and user-friendly inclusion of external evidence into diverse flavors of RNA secondary structure prediction tools we implemented a generic layer of constraint handling into the ViennaRNA Package. It makes explicit use of the conceptual separation of the “folding grammar” defining the search space and the actual energy evaluation, which allows constraints to be interleaved in a natural way between recursion steps and evaluation of the standard energy function.ConclusionsThe extension of the ViennaRNA Package provides a generic way to include diverse types of constraints into RNA folding algorithms. The computational overhead incurred is negligible in practice. A wide variety of application scenarios can be accommodated by the new framework, including the incorporation of structure probing data, non-standard base pairs and chemical modifications, as well as structure-dependent ligand binding.

[1]  Piet Herdewijn,et al.  A methyl group controls conformational equilibrium in human mitochondrial tRNA(Lys). , 2007, Journal of the American Chemical Society.

[2]  Robert Giegerich,et al.  RNA Secondary Structure Analysis Using The RNAshapes Package , 2009, Current protocols in bioinformatics.

[3]  Sebastian Will,et al.  RNAalifold: improved consensus structure prediction for RNA alignments , 2008, BMC Bioinformatics.

[4]  Rolf Backofen,et al.  Global or local? Predicting secondary structure and accessibility in mRNAs , 2012, Nucleic acids research.

[5]  Jonathan D. Wren,et al.  Shared relationship analysis: ranking set cohesion and commonalities within a literature-derived relationship network , 2004, Bioinform..

[6]  Rhiju Das,et al.  Quantitative dimethyl sulfate mapping for automated RNA secondary structure inference. , 2012, Biochemistry.

[7]  Peter F. Stadler,et al.  Prediction of locally stable RNA secondary structures for genome-wide surveys , 2004, Bioinform..

[8]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[9]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[10]  C. Lawrence,et al.  A statistical sampling algorithm for RNA secondary structure prediction. , 2003, Nucleic acids research.

[11]  D. Turner,et al.  A comparison of optimal and suboptimal RNA secondary structures predicted by free energy minimization with structures determined by phylogenetic comparison. , 1991, Nucleic acids research.

[12]  Sean R. Eddy,et al.  Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction , 2004, BMC Bioinformatics.

[13]  Gaurav Sharma,et al.  TurboFold: Iterative probabilistic estimation of secondary structures for multiple RNA sequences , 2011, BMC Bioinformatics.

[14]  Qiangfeng Cliff Zhang,et al.  Landscape and variation of RNA secondary structure across the human transcriptome , 2014, Nature.

[15]  Peter Clote,et al.  Integrating Chemical Footprinting Data into RNA Secondary Structure Prediction , 2012, PloS one.

[16]  Irmtraud M. Meyer,et al.  CoFold: an RNA secondary structure prediction method that takes co-transcriptional folding into account , 2013, Nucleic acids research.

[17]  M. Zuker On finding all suboptimal foldings of an RNA molecule. , 1989, Science.

[18]  Manolis Kellis,et al.  RNA folding with soft constraints: reconciliation of probing data and thermodynamic secondary structure prediction , 2012, Nucleic acids research.

[19]  Jörg Schultz,et al.  HMM Logos for visualization of protein families , 2004, BMC Bioinformatics.

[20]  J. Doudna,et al.  Insights into RNA structure and function from genome-wide studies , 2014, Nature Reviews Genetics.

[21]  Peter F. Stadler,et al.  tRNAdb 2009: compilation of tRNA sequences and tRNA genes , 2008, Nucleic Acids Res..

[22]  Jamie J. Cannone,et al.  Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction , 2004, BMC Bioinformatics.

[23]  P. Stadler,et al.  De novo design of a synthetic riboswitch that regulates transcription termination , 2012, Nucleic acids research.

[24]  Markus E. Nebel,et al.  Evaluation of a sophisticated SCFG design for RNA secondary structure prediction , 2011, Theory in Biosciences.

[25]  Ralf Bundschuh,et al.  Modeling the interplay of single-stranded binding proteins and nucleic acid secondary structure , 2010, Bioinform..

[26]  Julius B. Lucks,et al.  Engineering naturally occurring trans-acting non-coding RNAs to sense molecular signals , 2012, Nucleic acids research.

[27]  Marcin Feder,et al.  MODOMICS: a database of RNA modification pathways , 2005, Nucleic Acids Res..

[28]  S. Ryder,et al.  Metabolite sensing in eukaryotic mRNA biology , 2013, Wiley interdisciplinary reviews. RNA.

[29]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[30]  E Westhof,et al.  An interactive framework for RNA secondary structure prediction with a dynamical treatment of constraints. , 1995, Journal of molecular biology.

[31]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[32]  Peter F. Stadler,et al.  SHAPE directed RNA folding , 2015, bioRxiv.

[33]  M. Helm,et al.  tRNA stabilization by modified nucleotides. , 2010, Biochemistry.

[34]  Monir Hajiaghayi,et al.  Analysis of energy-based algorithms for RNA secondary structure prediction , 2012, BMC Bioinformatics.

[35]  Sean R Eddy,et al.  Computational analysis of conserved RNA secondary structure in transcriptomes and genomes. , 2014, Annual review of biophysics.

[36]  Robert Giegerich,et al.  Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics , 2004, BMC Bioinformatics.

[37]  P. Schuster,et al.  Algorithm independent properties of RNA secondary structure predictions , 1996, European Biophysics Journal.

[38]  Y. Ponty Efficient sampling of RNA secondary structures from the Boltzmann ensemble of low-energy , 2007, Journal of mathematical biology.

[39]  D. Mathews,et al.  Accurate SHAPE-directed RNA structure determination , 2009, Proceedings of the National Academy of Sciences.

[40]  Mark Helm,et al.  Post-transcriptional nucleotide modification and alternative folding of RNA , 2006, Nucleic acids research.

[41]  Robert Giegerich,et al.  Abstract shapes of RNA. , 2004, Nucleic acids research.

[42]  D. Mathews,et al.  Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots , 2013, Proceedings of the National Academy of Sciences.

[43]  Howard Y. Chang,et al.  Genome-wide measurement of RNA secondary structure in yeast , 2010, Nature.

[44]  Peter F. Stadler,et al.  Local RNA base pairing probabilities in large sequences , 2006, Bioinform..

[45]  Robert Giegerich,et al.  Locomotif: from graphical motif description to RNA motif search , 2007, ISMB/ECCB.

[46]  Jerrold R. Griggs,et al.  Algorithms for Loop Matchings , 1978 .

[47]  Michael Zuker,et al.  UNAFold: software for nucleic acid folding and hybridization. , 2008, Methods in molecular biology.

[48]  A. Pardi,et al.  High-resolution molecular discrimination by RNA. , 1994, Science.

[49]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[50]  P. Stadler,et al.  Secondary structure prediction for aligned RNA sequences. , 2002, Journal of molecular biology.

[51]  Rolf Backofen,et al.  IntaRNA: efficient prediction of bacterial sRNA targets incorporating target site accessibility and seed regions , 2008, Bioinform..

[52]  Sebastian Will,et al.  The Trouble with Long-Range Base Pairs in RNA Folding , 2013, BSB.

[53]  Robert Giegerich,et al.  A discipline of dynamic programming over sequence data , 2004, Sci. Comput. Program..

[54]  Christine E. Heitsch,et al.  Evaluating the accuracy of SHAPE-directed RNA secondary structure predictions , 2013, Nucleic acids research.

[55]  Peter F. Stadler,et al.  2D Meets 4G: G-Quadruplexes in RNA Secondary Structure Prediction , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[56]  Hamidreza Chitsaz,et al.  biRNA: Fast RNA-RNA Binding Sites Prediction , 2009, WABI.

[57]  Elena Rivas,et al.  The language of RNA: a formal grammar that includes pseudoknots , 2000, Bioinform..

[58]  Peter F. Stadler,et al.  Thermodynamics of RNA-RNA Binding , 2006, German Conference on Bioinformatics.

[59]  Ronny Lorenz,et al.  Design criteria for synthetic riboswitches acting on transcription , 2015, RNA biology.

[60]  R. Breaker Prospects for riboswitch discovery and analysis. , 2011, Molecular cell.

[61]  Christiane Branlant,et al.  Identification of modified residues in RNAs by reverse transcription-based methods. , 2007, Methods in enzymology.