Multi-Objective Genetic Algorithm for Pseudoknotted RNA Sequence Design

RNA inverse folding is a computational technology for designing RNA sequences which fold into a user-specified secondary structure. Although pseudoknots are functionally important motifs in RNA structures, less reports concerning the inverse folding of pseudoknotted RNAs have been done compared to those for pseudoknot-free RNA design. In this paper, we present a new version of our multi-objective genetic algorithm (MOGA), MODENA, which we have previously proposed for pseudoknot-free RNA inverse folding. In the new version of MODENA, (i) a new crossover operator is implemented and (ii) pseudoknot prediction methods, IPknot and HotKnots, are used to evaluate the designed RNA sequences, allowing us to perform the inverse folding of pseudoknotted RNAs. The new version of MODENA with the new crossover operator was benchmarked with a dataset composed of natural pseudoknotted RNA secondary structures, and we found that MODENA can successfully design more pseudoknotted RNAs compared to the other pseudoknot design algorithm. In addition, a sequence constraint function newly implemented in the new version of MODENA was tested by designing RNA sequences which fold into the pseudoknotted structure of a hepatitis delta virus ribozyme; as a result, we successfully designed eight RNA sequences. The new version of MODENA is downloadable from http://rna.eit.hirosaki-u.ac.jp/modena/.

[1]  H. Hoos,et al.  HotKnots: heuristic prediction of RNA secondary structures including pseudoknots. , 2005, RNA.

[2]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[3]  Akito Taneda,et al.  MODENA: a multi-objective RNA inverse folding , 2010, Advances and applications in bioinformatics and chemistry : AABC.

[4]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[5]  E. Westhof,et al.  TectoRNA: modular assembly units for the construction of RNA nano-objects. , 2001, Nucleic acids research.

[6]  D. W. Staple,et al.  Open access, freely available online Primer Pseudoknots: RNA Structures with Diverse Functions , 2022 .

[7]  R. K. Ursem Multi-objective Optimization using Evolutionary Algorithms , 2009 .

[8]  Ronald R. Breaker,et al.  Natural and engineered nucleic acids as tools to explore biology , 2004, Nature.

[9]  Christian M. Reidys,et al.  Inverse folding of RNA pseudoknot structures , 2009, Algorithms for Molecular Biology.

[11]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[12]  Michael Zuker,et al.  UNAFold: software for nucleic acid folding and hybridization. , 2008, Methods in molecular biology.

[13]  D. Baker,et al.  Automated de novo prediction of native-like RNA tertiary structures , 2007, Proceedings of the National Academy of Sciences.

[14]  Rolf Backofen,et al.  INFO-RNA - a fast approach to inverse RNA folding , 2006, Bioinform..

[15]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[16]  J. Ng,et al.  PseudoBase: a database with RNA pseudoknots , 2000, Nucleic Acids Res..

[17]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[18]  Kiyoshi Asai,et al.  Prediction of RNA secondary structure using generalized centroid estimators , 2009, Bioinform..

[19]  Thomas Stützle,et al.  Stochastic Local Search: Foundations & Applications , 2004 .

[20]  Detlef Weigel,et al.  Highly Specific Gene Silencing by Artificial MicroRNAs in Arabidopsis[W][OA] , 2006, The Plant Cell Online.

[21]  Tatsuya Akutsu,et al.  IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming , 2011, Bioinform..

[22]  Anne Condon,et al.  A new algorithm for RNA secondary structure design. , 2004, Journal of molecular biology.

[23]  D. Bartel,et al.  One sequence, two ribozymes: implications for the emergence of new ribozyme folds. , 2000, Science.

[24]  Conrad Steenberg,et al.  NUPACK: Analysis and design of nucleic acid systems , 2011, J. Comput. Chem..

[25]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[26]  Anne Condon,et al.  Classifying RNA pseudoknotted structures , 2004, Theor. Comput. Sci..

[27]  F. Major,et al.  The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data , 2008, Nature.