Multiobjective Metaheuristic to Design RNA Sequences

RNA inverse folding problem is a bioinformatics problem where the objective is to find an RNA sequence that folds into a given target secondary structure. In this paper, we use evolutionary computation to solve a new and innovative multiobjective definition of this problem. In this new multiobjective definition of the problem, we have considered the similarity between target and predicted structures as a constraint, and three objective functions: 1) partition function (free energy of the ensemble); 2) ensemble diversity; and 3) nucleotides composition. The multiobjective metaheuristic to design RNA sequences (m2dRNAs) proposed in this paper is compared against other RNA inverse folding methods published in the literature, such as RNAinverse, RNA secondary structure designer, inverse folding of RNA, MODENA, NUPACK, fRNAkenstein, dynamics in sequence space optimization, RNAiFOLD, antaRNA, evolutionary RNA design, and Eterna players. After a comprehensive comparative study on two well-known benchmarks (Rfam and Eterna100), we conclude that m2dRNAs is capable of obtaining very promising results in terms of both quality of RNA designs and required runtime. The source code of m2dRNAs is available at http://arco.unex.es/arl/m2dRNAs-source_code.zip.

[1]  Kalyanmoy Deb,et al.  An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints , 2014, IEEE Transactions on Evolutionary Computation.

[2]  Yann Ponty,et al.  Design of RNAs: comparing programs for inverse RNA folding , 2017, Briefings Bioinform..

[3]  Minjae Lee,et al.  RNA design rules from a massive open laboratory , 2014, Proceedings of the National Academy of Sciences.

[4]  T. Cech RNA finds a simpler way , 2004, Nature.

[5]  Douglas B. Kell,et al.  Multiobjective Optimization in Bioinformatics and Computational Biology , 2007, IEEE ACM Trans. Comput. Biol. Bioinform..

[6]  Tatsuya Akutsu,et al.  IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming , 2011, Bioinform..

[7]  Ronny Lorenz,et al.  The Vienna RNA Websuite , 2008, Nucleic Acids Res..

[8]  Peter Clote,et al.  Rnaifold: a Constraint Programming Algorithm for RNA inverse Folding and molecular Design , 2013, J. Bioinform. Comput. Biol..

[9]  Kalyanmoy Deb,et al.  Muiltiobjective Optimization Using Nondominated Sorting in Genetic Algorithms , 1994, Evolutionary Computation.

[10]  Akito Taneda Multi-objective optimization for RNA design with multiple target secondary structures , 2015, BMC Bioinformatics.

[11]  Bonnie Berger,et al.  Inverting the Viterbi algorithm: an abstract framework for structure design , 2008, ICML '08.

[12]  Rolf Backofen,et al.  antaRNA: ant colony-based RNA sequence design , 2015, Bioinform..

[13]  Miguel A. Vega-Rodríguez,et al.  A Hybrid Multiobjective Memetic Metaheuristic for Multiple Sequence Alignment , 2016, IEEE Transactions on Evolutionary Computation.

[14]  Mohammad Ganjtabesh,et al.  Evolutionary solution for the RNA design problem , 2014, Bioinform..

[15]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[16]  Jotun Hein,et al.  Frnakenstein: multiple target inverse RNA folding , 2012, BMC Bioinformatics.

[17]  Lothar Thiele,et al.  Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach , 1999, IEEE Trans. Evol. Comput..

[18]  Anne Condon,et al.  A new algorithm for RNA secondary structure design. , 2004, Journal of molecular biology.

[19]  Richard A. Watson,et al.  Reducing Local Optima in Single-Objective Problems by Multi-objectivization , 2001, EMO.

[20]  Niles A. Pierce,et al.  Nucleic acid sequence design via efficient ensemble defect optimization , 2011, J. Comput. Chem..

[21]  Yaochu Jin,et al.  Single and Multi-objective in Silico Evolution of Tunable Genetic Oscillators , 2013, EMO.

[22]  Meikang Qiu,et al.  RNA nanotechnology for computer design and in vivo computation , 2013, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[23]  Lothar Thiele,et al.  Comparison of Multiobjective Evolutionary Algorithms: Empirical Results , 2000, Evolutionary Computation.

[24]  D. Higgins,et al.  R-Coffee: a method for multiple alignment of non-coding RNA , 2008, Nucleic acids research.

[25]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[26]  A. Torda,et al.  Dynamics in Sequence Space for RNA Secondary Structure Design. , 2012, Journal of chemical theory and computation.

[27]  Rune B. Lyngsø RNA Secondary Structure Prediction by Minimum Free Energy , 2008, Encyclopedia of Algorithms.

[28]  Guillermo Sapiro,et al.  An image processing approach to computing distances between RNA secondary structures dot plots , 2009, Algorithms for Molecular Biology.

[29]  Rune B. Lyngsø RNA Secondary Structure Boltzmann Distribution , 2016, Encyclopedia of Algorithms.

[30]  Andreas Wagner,et al.  Neutral network sizes of biological RNA molecules can be computed and are not atypically small , 2008, BMC Bioinformatics.

[31]  Mohammad Ganjtabesh,et al.  ERD: a fast and reliable tool for RNA design including constraints , 2015, BMC Bioinformatics.

[32]  Michael T. Wolfinger,et al.  Predicting RNA secondary structures from sequence and probing data. , 2016, Methods.

[33]  Rolf Backofen,et al.  antaRNA – Multi-objective inverse folding of pseudoknot RNA using ant-colony optimization , 2015, BMC Bioinformatics.

[34]  Rolf Backofen,et al.  INFO-RNA - a fast approach to inverse RNA folding , 2006, Bioinform..

[35]  Hisao Ishibuchi,et al.  Performance comparison of NSGA-II and NSGA-III on various many-objective test problems , 2016, 2016 IEEE Congress on Evolutionary Computation (CEC).

[36]  Peter Clote,et al.  RNAiFold 2.0: a web server and software to design custom and Rfam-based RNA molecules , 2015, Nucleic Acids Res..

[37]  Robert M. Dirks,et al.  Paradigms for computational nucleic acid design. , 2004, Nucleic acids research.

[38]  J. McCaskill The equilibrium partition function and base pair binding probabilities for RNA secondary structure , 1990, Biopolymers.

[39]  Akito Taneda,et al.  Multi-Objective Genetic Algorithm for Pseudoknotted RNA Sequence Design , 2012, Front. Gene..

[40]  Akito Taneda,et al.  MODENA: a multi-objective RNA inverse folding , 2010, Advances and applications in bioinformatics and chemistry : AABC.

[41]  Anne Condon,et al.  Computational RNA secondary structure design: empirical complexity and improved methods , 2007, BMC Bioinformatics.

[42]  H. Hoos,et al.  HotKnots: heuristic prediction of RNA secondary structures including pseudoknots. , 2005, RNA.

[43]  Yann Ponty,et al.  VARNA: Interactive drawing and editing of the RNA secondary structure , 2009, Bioinform..

[44]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[45]  Miguel A. Vega-Rodríguez,et al.  Hybrid multiobjective artificial bee colony for multiple sequence alignment , 2016, Appl. Soft Comput..

[46]  P. Siarry,et al.  Multiobjective Optimization: Principles and Case Studies , 2004 .

[47]  Jamie J. Cannone,et al.  Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction , 2004, BMC Bioinformatics.

[48]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[49]  Michelle J. Wu,et al.  Principles for Predicting RNA Secondary Structure Design Difficulty. , 2016, Journal of molecular biology.

[50]  Sebastian E. Ahnert,et al.  Genetic Correlations Greatly Increase Mutational Robustness and Can Both Reduce and Enhance Evolvability , 2015, PLoS Comput. Biol..

[51]  Eckart Zitzler,et al.  Reducing Bloat in GP with Multiple Objectives , 2008, Multiobjective Problem Solving from Nature.

[52]  Joshua D. Knowles,et al.  Investigations into the Effect of Multiobjectivization in Protein Structure Prediction , 2008, PPSN.

[53]  Miguel A. Vega-Rodríguez,et al.  A multiobjective approach based on artificial bee colony for the static routing and wavelength assignment problem , 2013, Soft Comput..