An Adaptive Defect Weighted Sampling Algorithm to Design Pseudoknotted RNA Secondary Structures

Computational design of RNA sequences that fold into targeted secondary structures has many applications in biomedicine, nanotechnology and synthetic biology. An RNA molecule is made of different types of secondary structure elements and an important RNA element named pseudoknot plays a key role in stabilizing the functional form of the molecule. However, due to the computational complexities associated with characterizing pseudoknotted RNA structures, most of the existing RNA sequence designer algorithms generally ignore this important structural element and therefore limit their applications. In this paper we present a new algorithm to design RNA sequences for pseudoknotted secondary structures. We use NUPACK as the folding algorithm to compute the equilibrium characteristics of the pseudoknotted RNAs, and describe a new adaptive defect weighted sampling algorithm named Enzymer to design low ensemble defect RNA sequences for targeted secondary structures including pseudoknots. We used a biological data set of 201 pseudoknotted structures from the Pseudobase library to benchmark the performance of our algorithm. We compared the quality characteristics of the RNA sequences we designed by Enzymer with the results obtained from the state of the art MODENA and antaRNA. Our results show our method succeeds more frequently than MODENA and antaRNA do, and generates sequences that have lower ensemble defect, lower probability defect and higher thermostability. Finally by using Enzymer and by constraining the design to a naturally occurring and highly conserved Hammerhead motif, we designed 8 sequences for a pseudoknotted cis-acting Hammerhead ribozyme. Enzymer is available for download at https://bitbucket.org/casraz/enzymer.

[1]  M. Singer,et al.  Messenger RNA: an evaluation. , 1966, Annual review of biochemistry.

[2]  M. P. Smith,et al.  The New Frontier. , 1984, RNABC news.

[3]  J. McCaskill The equilibrium partition function and base pair binding probabilities for RNA secondary structure , 1990, Biopolymers.

[4]  C. Florentz,et al.  tRNA structure and aminoacylation efficiency. , 1993, Progress in nucleic acid research and molecular biology.

[5]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[6]  J. Ng,et al.  PseudoBase: a database with RNA pseudoknots , 2000, Nucleic Acids Res..

[7]  G. Hannon RNA interference : RNA , 2002 .

[8]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[9]  Niles A. Pierce,et al.  A partition function algorithm for nucleic acid secondary structure including pseudoknots , 2003, J. Comput. Chem..

[10]  E. Westhof,et al.  Analysis of RNA motifs. , 2003, Current opinion in structural biology.

[11]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[12]  Manuel López-Ibáñez,et al.  Ant colony optimization , 2010, GECCO '10.

[13]  Anne Condon,et al.  A new algorithm for RNA secondary structure design. , 2004, Journal of molecular biology.

[14]  Robert M. Dirks,et al.  Paradigms for computational nucleic acid design. , 2004, Nucleic acids research.

[15]  R. Breaker,et al.  Computational design and experimental validation of oligonucleotide-sensing allosteric ribozymes , 2005, Nature Biotechnology.

[16]  H. Hoos,et al.  HotKnots: heuristic prediction of RNA secondary structures including pseudoknots. , 2005, RNA.

[17]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..

[18]  Thomas Stützle,et al.  Ant colony optimization: artificial ants as a computational intelligence technique , 2006 .

[19]  Rolf Backofen,et al.  INFO-RNA - a fast approach to inverse RNA folding , 2006, Bioinform..

[20]  A. Ferré-D’Amaré,et al.  Structural Basis of glmS Ribozyme Activation by Glucosamine-6-Phosphate , 2006, Science.

[21]  G. Soukup Core requirements for glmS ribozyme self-cleavage reveal a putative pseudoknot structure , 2006, Nucleic acids research.

[22]  J. Mattick,et al.  Non-coding RNA. , 2006, Human molecular genetics.

[23]  Colin N. Dewey,et al.  Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures , 2007, Nature.

[24]  A novel structural rearrangement of hepatitis delta virus antigenomic ribozyme , 2007, Nucleic acids research.

[25]  Wojciech Kasprzak,et al.  Bridging the gap in RNA structure prediction. , 2007, Current opinion in structural biology.

[26]  R. Batey,et al.  Structure of the SAM-II riboswitch bound to S-adenosylmethionine , 2008, Nature Structural &Molecular Biology.

[27]  Bonnie Berger,et al.  Inverting the Viterbi algorithm: an abstract framework for structure design , 2008, ICML '08.

[28]  F. Slack,et al.  Small non-coding RNAs in animal development , 2008, Nature Reviews Molecular Cell Biology.

[29]  Christian M. Reidys,et al.  Inverse folding of RNA pseudoknot structures , 2009, Algorithms for Molecular Biology.

[30]  D. Bartel MicroRNAs: Target Recognition and Regulatory Functions , 2009, Cell.

[31]  Kiyoshi Asai,et al.  Prediction of RNA secondary structure using generalized centroid estimators , 2009, Bioinform..

[32]  Yanga Byun,et al.  PseudoViewer3: generating planar drawings of large-scale RNA structures with pseudoknots , 2009, Bioinform..

[33]  D. Mathews,et al.  Improved RNA secondary structure prediction by maximizing expected pair accuracy. , 2009, RNA.

[34]  T. Henkin,et al.  Riboswitch RNAs: Regulation of gene expression by direct monitoring of a physiological signal , 2010, RNA biology.

[35]  John F. G. Atack,et al.  RNA Interference , 2010, Methods in Molecular Biology.

[36]  D. Mathews,et al.  ProbKnot: fast prediction of RNA secondary structure including pseudoknots. , 2010, RNA.

[37]  Ahmad S. Khalil,et al.  Synthetic biology: applications come of age , 2010, Nature Reviews Genetics.

[38]  Andrea L Edwards,et al.  Structural basis for recognition of S-adenosylhomocysteine by riboswitches. , 2010, RNA.

[39]  Yann Ponty,et al.  A Combinatorial Framework for Designing (Pseudoknotted) RNA Algorithms , 2011, WABI.

[40]  Tatsuya Akutsu,et al.  IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming , 2011, Bioinform..

[41]  J. Perreault,et al.  In vitro and in vivo cleavage of HIV-1 RNA by new SOFA-HDV ribozymes and their potential to inhibit viral replication , 2011, RNA biology.

[42]  T. Schlick,et al.  Computational approaches to RNA structure prediction, analysis, and design. , 2011, Current opinion in structural biology.

[43]  Zasha Weinberg,et al.  Identification of Hammerhead Ribozymes in All Domains of Life Reveals Novel Structural Variations , 2011, PLoS Comput. Biol..

[44]  Niles A. Pierce,et al.  Nucleic acid sequence design via efficient ensemble defect optimization , 2011, J. Comput. Chem..

[45]  L. Jaeger,et al.  Multistrand RNA secondary structure prediction and nanostructure design including pseudoknots. , 2011, ACS nano.

[46]  Joe C. Liang,et al.  Engineering biological systems with synthetic RNA molecules. , 2011, Molecular cell.

[47]  Rune B. Lyngsø,et al.  Inverse folding of RNA , 2011 .

[48]  Alexander Churkin,et al.  RNAexinv: An extended inverse RNA folding from shape and physical attributes to sequences , 2011, BMC Bioinformatics.

[49]  Akito Taneda,et al.  Multi-Objective Genetic Algorithm for Pseudoknotted RNA Sequence Design , 2012, Front. Gene..

[50]  B. Berger,et al.  A global sampling approach to designing and reengineering RNA secondary structures , 2012, Nucleic acids research.

[51]  Jotun Hein,et al.  Frnakenstein: multiple target inverse RNA folding , 2012, BMC Bioinformatics.

[52]  J. Burnett,et al.  RNA-based therapeutics: current progress and future prospects. , 2012, Chemistry & biology.

[53]  Yann Ponty,et al.  A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution , 2013, Bioinform..

[54]  J. Rossi,et al.  RNA Nanotechnology Approach for Targeted Delivery of RNA Therapeutics Using Cell-Internalizing Aptamers , 2013 .

[55]  D. Mathews,et al.  31 Discovery of novel ncRNA by scanning multiple genome alignments , 2013 .

[56]  Thomas E. Landrain,et al.  A new frontier in synthetic biology: automated design of small RNA devices in bacteria. , 2013, Trends in genetics : TIG.

[57]  Peter Clote,et al.  Rnaifold: a Constraint Programming Algorithm for RNA inverse Folding and molecular Design , 2013, J. Bioinform. Comput. Biol..

[58]  Christoph Dieterich,et al.  Computational biology of RNA interactions , 2013, Wiley interdisciplinary reviews. RNA.

[59]  J. Perreault,et al.  Design and evaluation of clinically relevant SOFA-HDV ribozymes targeting HIV RNA. , 2014, Methods in molecular biology.

[60]  R. Breaker,et al.  A widespread self-cleaving ribozyme class is revealed by bioinformatics , 2013, Nature chemical biology.

[61]  Cody W. Geary,et al.  A single-stranded architecture for cotranscriptional folding of RNA nanostructures , 2014, Science.

[62]  B. Rogelj,et al.  The many faces of small nucleolar RNAs. , 2014, Biochimica et biophysica acta.

[63]  A. Gregory Matera,et al.  A day in the life of the spliceosome , 2014, Nature Reviews Molecular Cell Biology.

[64]  Rolf Backofen,et al.  antaRNA – Multi-objective inverse folding of pseudoknot RNA using ant-colony optimization , 2015, BMC Bioinformatics.

[65]  B. Shapiro,et al.  Engineered RNA Nanodesigns for Applications in RNA Nanotechnology , 2013, DNA and RNA nanotechnology.

[66]  Ján Manuch,et al.  Combinatorial RNA Design: Designability and Structure-Approximating Algorithm , 2015, CPM.

[67]  Niles A Pierce,et al.  Sequence Design for a Test Tube of Interacting Nucleic Acid Strands. , 2015, ACS synthetic biology.

[68]  Robert Giegerich,et al.  The RNA shapes studio , 2014, Bioinform..