RNAblueprint: flexible multiple target nucleic acid sequence design

Motivation: Realizing the value of synthetic biology in biotechnology and medicine requires the design of molecules with specialized functions. Due to its close structure to function relationship, and the availability of good structure prediction methods and energy models, RNA is perfectly suited to be synthetically engineered with predefined properties. However, currently available RNA design tools cannot be easily adapted to accommodate new design specifications. Furthermore, complicated sampling and optimization methods are often developed to suit a specific RNA design goal, adding to their inflexibility. Results: We developed a C ++ library implementing a graph coloring approach to stochastically sample sequences compatible with structural and sequence constraints from the typically very large solution space. The approach allows to specify and explore the solution space in a well defined way. Our library also guarantees uniform sampling, which makes optimization runs performant by not only avoiding re‐evaluation of already found solutions, but also by raising the probability of finding better solutions for long optimization runs. We show that our software can be combined with any other software package to allow diverse RNA design applications. Scripting interfaces allow the easy adaption of existing code to accommodate new scenarios, making the whole design process very flexible. We implemented example design approaches written in Python to demonstrate these advantages. Availability and implementation: RNAblueprint, Python implementations and benchmark datasets are available at github: https://github.com/ViennaRNA. Contact: s.hammer@univie.ac.at, ivo@tbi.univie.ac.at or sven@tbi.univie.ac.at Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  B. Berger,et al.  A global sampling approach to designing and reengineering RNA secondary structures , 2012, Nucleic acids research.

[2]  Niles A Pierce,et al.  Sequence Design for a Test Tube of Interacting Nucleic Acid Strands. , 2015, ACS synthetic biology.

[3]  Jotun Hein,et al.  Frnakenstein: multiple target inverse RNA folding , 2012, BMC Bioinformatics.

[4]  Julius B. Lucks,et al.  Engineering naturally occurring trans-acting non-coding RNAs to sense molecular signals , 2012, Nucleic acids research.

[5]  Peter Clote,et al.  Rnaifold: a Constraint Programming Algorithm for RNA inverse Folding and molecular Design , 2013, J. Bioinform. Comput. Biol..

[6]  Akito Taneda,et al.  MODENA: a multi-objective RNA inverse folding , 2010, Advances and applications in bioinformatics and chemistry : AABC.

[7]  Ingrid G. Abfalter,et al.  Computational design of RNAs with complex energy landscapes , 2013, Biopolymers.

[8]  Yann Ponty,et al.  VARNA: Interactive drawing and editing of the RNA secondary structure , 2009, Bioinform..

[9]  Rolf Backofen,et al.  antaRNA: ant colony-based RNA sequence design , 2015, Bioinform..

[10]  Mohammad Ganjtabesh,et al.  Evolutionary solution for the RNA design problem , 2014, Bioinform..

[11]  Hebing Chen,et al.  ARDesigner: a web-based system for allosteric RNA design. , 2010, Journal of biotechnology.

[12]  Minjae Lee,et al.  RNA design rules from a massive open laboratory , 2014, Proceedings of the National Academy of Sciences.

[13]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[14]  Alfonso Jaramillo,et al.  RiboMaker: computational design of conformation-based riboregulation , 2014, Bioinform..

[15]  Anne Condon,et al.  A new algorithm for RNA secondary structure design. , 2004, Journal of molecular biology.

[16]  Niles A. Pierce,et al.  Nucleic acid sequence design via efficient ensemble defect optimization , 2011, J. Comput. Chem..

[17]  Rolf Backofen,et al.  antaRNA – Multi-objective inverse folding of pseudoknot RNA using ant-colony optimization , 2015, BMC Bioinformatics.

[18]  Yann Ponty,et al.  A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution , 2013, Bioinform..

[19]  Robert Giegerich,et al.  The RNA shapes studio , 2014, Bioinform..

[20]  James Chappell,et al.  Creating small transcription activating RNAs. , 2015, Nature chemical biology.

[21]  J. Collins,et al.  Toehold Switches: De-Novo-Designed Regulators of Gene Expression , 2014, Cell.

[22]  Eli Upfal,et al.  Probability and Computing: Randomized Algorithms and Probabilistic Analysis , 2005 .

[23]  A. Torda,et al.  Dynamics in Sequence Space for RNA Secondary Structure Design. , 2012, Journal of chemical theory and computation.

[24]  Beatrix Suess,et al.  Riboswitch engineering - making the all-important second and third steps. , 2015, Current opinion in biotechnology.

[25]  Peter Clote,et al.  RNAiFold 2.0: a web server and software to design custom and Rfam-based RNA molecules , 2015, Nucleic Acids Res..

[26]  Robert M. Dirks,et al.  Paradigms for computational nucleic acid design. , 2004, Nucleic acids research.

[27]  Akito Taneda Multi-objective optimization for RNA design with multiple target secondary structures , 2015, BMC Bioinformatics.

[28]  P. Stadler,et al.  De novo design of a synthetic riboswitch that regulates transcription termination , 2012, Nucleic acids research.

[29]  H. Salis,et al.  Automated physics-based design of synthetic riboswitches from diverse RNA aptamers , 2015, Nucleic acids research.

[30]  Conrad Steenberg,et al.  NUPACK: Analysis and design of nucleic acid systems , 2011, J. Comput. Chem..

[31]  P. Stadler,et al.  Design of multistable RNA molecules. , 2001, RNA.

[32]  Farren J. Isaacs,et al.  Engineered riboregulators enable post-transcriptional control of gene expression , 2004, Nature Biotechnology.

[33]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[34]  Peter Clote,et al.  RNAiFold: a web server for RNA inverse folding and molecular design , 2013, Nucleic Acids Res..

[35]  Juliane Neupert,et al.  Design of simple synthetic RNA thermometers for temperature-controlled gene expression in Escherichia coli , 2008, Nucleic acids research.

[36]  Uzi Vishkin,et al.  Parallel Ear Decomposition Search (EDS) and st-Numbering in Graphs , 1986, Theor. Comput. Sci..

[37]  Peter F. Stadler,et al.  RNA folding with hard and soft constraints , 2016, Algorithms for Molecular Biology.

[38]  Rolf Backofen,et al.  INFO-RNA - a fast approach to inverse RNA folding , 2006, Bioinform..

[39]  Thomas E. Landrain,et al.  De novo automated design of small RNA circuits for engineering synthetic riboregulation in living cells , 2012, Proceedings of the National Academy of Sciences.

[40]  Alexander Churkin,et al.  RNAexinv: An extended inverse RNA folding from shape and physical attributes to sequences , 2011, BMC Bioinformatics.