The impact of various seed, accessibility and interaction constraints on sRNA target prediction- a systematic assessment

Background Seed and accessibility constraints are core features to enable highly accurate sRNA target screens based on RNA-RNA interaction prediction. Currently, available tools provide different (sets of) constraints and default parameter sets. Thus, it is hard to impossible for users to estimate the influence of individual restrictions on the prediction results. Results Here, we present a systematic assessment of the impact of established and new constraints on sRNA target prediction both on a qualitative as well as computational level. This is done exemplarily based on the performance of IntaRNA, one of the most exact sRNA target prediction tools. IntaRNA provides various ways to constrain considered seed interactions, e.g. based on seed length, its accessibility, minimal unpaired probabilities, or energy thresholds, beside analogous constraints for the overall interaction. Thus, our results reveal the impact of individual constraints and their combinations. Conclusions This provides both a guide for users what is important and recommendations for existing and upcoming sRNA target prediction approaches.We show on a large sRNA target screen benchmark data set that only by altering the parameter set, IntaRNA recovers 30% more verified interactions while becoming 5-times faster. This exemplifies the potential of seed, accessibility and interaction constraints for sRNA target prediction.

[1]  SHAPE directed RNA folding , 2015 .

[2]  Brian Tjaden,et al.  TargetRNA: a tool for predicting targets of small RNA action in bacteria , 2008, Nucleic Acids Res..

[3]  Mary Beth Kery,et al.  TargetRNA2: identifying targets of small regulatory RNAs in bacteria , 2014, Nucleic Acids Res..

[4]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[5]  G. Storz,et al.  Target prediction for small, noncoding RNAs in bacteria , 2006, Nucleic acids research.

[6]  R. Backofen,et al.  Computational prediction of sRNAs and their targets in bacteria , 2010 .

[7]  K. Weeks,et al.  A fast-acting reagent for accurate analysis of RNA secondary and tertiary structure by SHAPE chemistry. , 2007, Journal of the American Chemical Society.

[8]  Tsukasa Fukunaga,et al.  RIblast: an ultrafast RNA–RNA interaction prediction system based on a seed-and-extension approach , 2016, bioRxiv.

[9]  Jens Georg,et al.  Workflow for a Computational Analysis of an sRNA Candidate in Bacteria. , 2018, Methods in molecular biology.

[10]  Peter F. Stadler,et al.  Thermodynamics of RNA-RNA Binding , 2006, German Conference on Bioinformatics.

[11]  Fabrizio Costa,et al.  Fast Neighborhood Subgraph Pairwise Distance Kernel , 2010, ICML.

[12]  Daniel Gautheret,et al.  An assessment of bacterial small RNA target prediction programs , 2015, RNA biology.

[13]  Razvan Nutiu,et al.  Pervasive Regulatory Functions of mRNA Structure Revealed by High-Resolution SHAPE Probing , 2018, Cell.

[14]  Xian Jiang,et al.  Hydrogen Sulfide Attenuates Carbon Tetrachloride-Induced Hepatotoxicity, Liver Cirrhosis and Portal Hypertension in Rats , 2011, PloS one.

[15]  Rolf Backofen,et al.  Interactive implementations of thermodynamics-based RNA structure and RNA–RNA interaction prediction approaches for example-driven teaching , 2018, PLoS computational biology.

[16]  Rolf Backofen,et al.  Constraint Maximal Inter-molecular Helix Lengths within RNA-RNA Interaction Prediction Improves Bacterial sRNA Target Prediction , 2019, BIOINFORMATICS.

[17]  Rolf Backofen,et al.  Freiburg RNA tools: a central online resource for RNA-focused research and teaching , 2018, Nucleic Acids Res..

[18]  Hakim Tafer,et al.  RNAplex: a fast tool for RNA-RNA interaction search , 2008, Bioinform..

[19]  J. McCaskill The equilibrium partition function and base pair binding probabilities for RNA secondary structure , 1990, Biopolymers.

[20]  Rolf Backofen,et al.  ShaKer: RNA SHAPE prediction using graph kernel , 2019, Bioinform..

[21]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[22]  K. Tomizawa,et al.  Reactive sulfur species regulate tRNA methylthiolation and contribute to insulin secretion , 2016, Nucleic acids research.

[23]  Peter F. Stadler,et al.  RIsearch2: suffix array-based large-scale prediction of RNA–RNA interactions and siRNA off-targets , 2017, Nucleic acids research.

[24]  Rolf Backofen,et al.  Structure and Interaction Prediction in Prokaryotic RNA Biology. , 2018, Microbiology spectrum.

[25]  Qian Liu,et al.  sTarPicker: A Method for Efficient Prediction of Bacterial sRNA Targets Based on a Two-Step Model for Hybridization , 2011, PloS one.

[26]  Daniel Lai,et al.  A comprehensive comparison of general RNA–RNA interaction prediction methods , 2015, Nucleic acids research.

[27]  Paul P. Gardner,et al.  A comprehensive benchmark of RNA–RNA interaction prediction tools for all domains of life , 2016, Bioinform..

[28]  Rolf Backofen,et al.  IntaRNA 2.0: enhanced and customizable prediction of RNA–RNA interactions , 2017, Nucleic Acids Res..

[29]  Rolf Backofen,et al.  CopraRNA and IntaRNA: predicting small RNA targets, networks and interaction domains , 2014, Nucleic Acids Res..

[30]  Jirí Vanícek,et al.  Efficient use of accessibility in microRNA target prediction , 2010, Nucleic Acids Res..

[31]  Peter F. Stadler,et al.  Local RNA base pairing probabilities in large sequences , 2006, Bioinform..

[32]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[33]  Rolf Backofen,et al.  Integration of accessibility data from structure probing into RNA-RNA interaction prediction , 2018, bioRxiv.

[34]  Michael F. Sloma,et al.  AccessFold: predicting RNA-RNA interactions with consideration for competing self-structure , 2016, Bioinform..

[35]  Kevin P. Murphy,et al.  Efficient parameter estimation for RNA secondary structure prediction , 2007, ISMB/ECCB.

[36]  Rolf Backofen,et al.  IntaRNA: efficient prediction of bacterial sRNA targets incorporating target site accessibility and seed regions , 2008, Bioinform..

[37]  William B. Langdon,et al.  Evolving Better RNAfold Structure Prediction , 2018, EuroGP.

[38]  R. Russell,et al.  DMS footprinting of structured RNAs and RNA–protein complexes , 2007, Nature Protocols.

[39]  Rolf Backofen,et al.  Integration of accessibility data from structure probing into RNA–RNA interaction prediction , 2019, Bioinform..