RNA Secondary Structure Prediction Using Soft Computing

Prediction of RNA structure is invaluable in creating new drugs and understanding genetic diseases. Several deterministic algorithms and soft computing-based techniques have been developed for more than a decade to determine the structure from a known RNA sequence. Soft computing gained importance with the need to get approximate solutions for RNA sequences by considering the issues related with kinetic effects, cotranscriptional folding, and estimation of certain energy parameters. A brief description of some of the soft computing-based techniques, developed for RNA secondary structure prediction, is presented along with their relevance. The basic concepts of RNA and its different structural elements like helix, bulge, hairpin loop, internal loop, and multiloop are described. These are followed by different methodologies, employing genetic algorithms, artificial neural networks, and fuzzy logic. The role of various metaheuristics, like simulated annealing, particle swarm optimization, ant colony optimization, and tabu search is also discussed. A relative comparison among different techniques, in predicting 12 known RNA secondary structures, is presented, as an example. Future challenging issues are then mentioned.

[1]  Lotfi A. Zadeh,et al.  Fuzzy logic, neural networks, and soft computing , 1993, CACM.

[2]  S H Kim,et al.  High-resolution x-ray diffraction patterns of crystalline transfer RNA that show helical regions. , 1971, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Jin Chu Wu,et al.  An annealing mutation operator in the genetic algorithms for RNA folding , 1996, Comput. Appl. Biosci..

[4]  Yingjie Zhao,et al.  Consensus RNA Secondary Structure Prediction Based on Support Vector Machine Classification , 2008 .

[5]  Jennifer A. Doudna,et al.  A universal mode of helix packing in RNA , 2001, Nature Structural Biology.

[6]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[7]  R. Breaker,et al.  In-line probing analysis of riboswitches. , 2008, Methods in molecular biology.

[8]  B. Singer,et al.  All oxygens in nucleic acids react with carcinogenic ethylating agents , 1976, Nature.

[9]  Shubhra Sankar Ray,et al.  HD-RNAS: An Automated Hierarchical Database of RNA Structures , 2012, Front. Gene..

[10]  Cole Trapnell,et al.  Modeling and automation of sequencing-based characterization of RNA structure , 2011, Proceedings of the National Academy of Sciences.

[11]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[12]  Bruce A. Shapiro,et al.  A massively parallel genetic algorithm for RNA secondary structure prediction , 1994, The Journal of Supercomputing.

[13]  Bjarne Knudsen,et al.  Pfold: RNA Secondary Structure Prediction Using Stochastic Context-Free Grammars , 2003 .

[14]  Jamie J. Cannone,et al.  Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction , 2004, BMC Bioinformatics.

[15]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[16]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[17]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[18]  Tatsuya Akutsu,et al.  Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots , 2000, Discret. Appl. Math..

[19]  Andrew Hendriks,et al.  RnaPredict—An Evolutionary Algorithm for RNA Secondary Structure Prediction , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[20]  Stephan H Bernhart,et al.  RNA structure prediction. , 2011, Methods in molecular biology.

[21]  D.E. Goldberg,et al.  Classifier Systems and Genetic Algorithms , 1989, Artif. Intell..

[22]  F. Crick Codon--anticodon pairing: the wobble hypothesis. , 1966, Journal of molecular biology.

[23]  Kay C. Wiese,et al.  Permutation-based RNA secondary structure prediction via a genetic algorithm , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[24]  P. Huber,et al.  Chemical nucleases: their use in studying RNA structure and RNA‐protein interactions , 1993, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[25]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[26]  G. Steger,et al.  Description of RNA folding by "simulated annealing". , 1996, Journal of molecular biology.

[27]  C. Lawrence,et al.  A statistical sampling algorithm for RNA secondary structure prediction. , 2003, Nucleic acids research.

[28]  Lotfi A. Zadeh,et al.  Fuzzy Logic , 2009, Encyclopedia of Complexity and Systems Science.

[29]  Sankar K. Pal,et al.  Soft computing data mining , 2004, Inf. Sci..

[30]  Alain Xayaphoummine,et al.  Kinefold web server for RNA/DNA folding path and structure prediction including pseudoknots and knots , 2005, Nucleic Acids Res..

[31]  ROY MARKHAM,et al.  Structure of Ribonucleic Acid , 1951, Nature.

[32]  G. Yarrington Molecular Cell Biology , 1987, The Yale Journal of Biology and Medicine.

[33]  Bruce A. Shapiro,et al.  Stem Trace: an interactive visual tool for comparative RNA structure analysis , 1999, Bioinform..

[34]  F. Collins,et al.  Principles of Biochemistry , 1937, The Indian Medical Gazette.

[35]  J. McCaskill The equilibrium partition function and base pair binding probabilities for RNA secondary structure , 1990, Biopolymers.

[36]  D. Mathews Predicting RNA secondary structure by free energy minimization , 2006 .

[37]  Andries Petrus Engelbrecht,et al.  Determining RNA Secondary Structure using Set-based Particle Swarm Optimization , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[38]  K. Weeks,et al.  SHAPE-directed RNA secondary structure prediction. , 2010, Methods.

[39]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[40]  Eric Westhof,et al.  RNA Tertiary Structure , 2006 .

[41]  F. Crick,et al.  Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid , 1974, Nature.

[42]  John A Tainer,et al.  Improving small-angle X-ray scattering data for structural analyses of the RNA world. , 2010, RNA.

[43]  M. Zuker On finding all suboptimal foldings of an RNA molecule. , 1989, Science.

[44]  R. Nussinov,et al.  Tree graphs of RNA secondary structures and their comparisons. , 1989, Computers and biomedical research, an international journal.

[45]  P. Schuster,et al.  Analysis of RNA sequence structure maps by exhaustive enumeration I. Neutral networks , 1995 .

[46]  David H. Mathews,et al.  NNDB: the nearest neighbor parameter database for predicting stability of nucleic acid secondary structure , 2009, Nucleic Acids Res..

[47]  Andrew Hendriks,et al.  Comparison of P-RnaPredict and mfold - algorithms for RNA secondary structure prediction , 2006, Bioinform..

[48]  Nan Yu,et al.  The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs , 2002, BMC Bioinformatics.

[49]  Sankar K. Pal,et al.  Fuzzy rough granular self-organizing map and fuzzy rough entropy , 2012, Theor. Comput. Sci..

[50]  Terran Lane,et al.  A Framework for Multiple Kernel Support Vector Regression and Its Applications to siRNA Efficacy Prediction , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[51]  Sebastian Will,et al.  RNAalifold: improved consensus structure prediction for RNA alignments , 2008, BMC Bioinformatics.

[52]  Jun Peng,et al.  Predicting RNA secondary structure with Tabu Search , 2010, 9th IEEE International Conference on Cognitive Informatics (ICCI'10).

[53]  Zhidong Deng,et al.  A Fuzzy Dynamic Programming Approach to Predict RNA Secondary Structure , 2006, WABI.

[54]  N. McMillan,et al.  RNA Secondary Structure Prediction using Ant Colony Optimisation , 2006 .

[55]  Melanie Mitchell,et al.  The royal road for genetic algorithms: Fitness landscapes and GA performance , 1991 .

[56]  Herbert H. Tsang,et al.  SARNA-Predict: Accuracy Improvement of RNA Secondary Structure Prediction Using Permutation-Based Simulated Annealing , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[57]  B. Shapiro,et al.  RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers. , 2006, RNA.

[58]  Temple F. Smith,et al.  Rapid dynamic programming algorithms for RNA secondary structure , 1986 .

[59]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[60]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[61]  H. Urlaub,et al.  RNA structure and RNA-protein interactions in purified yeast U6 snRNPs. , 2006, Journal of molecular biology.

[62]  R. Montange,et al.  Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine , 2004, Nature.

[63]  I. Tinoco,et al.  How RNA folds. , 1999, Journal of molecular biology.

[64]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[65]  E. Westhof,et al.  A common motif organizes the structure of multi-helix loops in 16 S and 23 S ribosomal RNAs. , 1998, Journal of molecular biology.

[66]  M. Waterman,et al.  RNA secondary structure: a complete mathematical analysis , 1978 .

[67]  C. Condon,et al.  RNase J1 endonuclease activity as a probe of RNA secondary structure. , 2009, RNA.

[68]  D. Thirumalai,et al.  Folding path of P5abc RNA involves direct coupling of secondary and tertiary structures , 2012, Nucleic acids research.

[69]  D. Bartel,et al.  One sequence, two ribozymes: implications for the emergence of new ribozyme folds. , 2000, Science.

[70]  E. Westhof,et al.  Geometric nomenclature and classification of RNA base pairs. , 2001, RNA.

[71]  Eric Westhof,et al.  Predicting and modeling RNA architecture. , 2011, Cold Spring Harbor perspectives in biology.

[72]  C. Pleij,et al.  An APL-programmed genetic algorithm for the prediction of RNA secondary structure. , 1995, Journal of theoretical biology.

[73]  Xin Li,et al.  Simulating the folding pathway of RNA secondary structure using the modified ant colony algorithm , 2010 .

[74]  S S Ray,et al.  Notice of RetractionRNA secondary structure prediction in soft computing framework: A review , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[75]  D. Crothers,et al.  Improved estimation of secondary structure in ribonucleic acids. , 1973, Nature: New biology.

[76]  Qi Liu,et al.  A Hopfield Neural Network Based Algorithm for RNA Secondary Structure Prediction , 2006, First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS'06).

[77]  Guoqiang Peter Zhang,et al.  Neural networks for classification: a survey , 2000, IEEE Trans. Syst. Man Cybern. Part C.

[78]  Kiyoshi Asai,et al.  Stem Kernels for RNA Sequence Analyses , 2007, BIRD.

[79]  R. Russell,et al.  DMS footprinting of structured RNAs and RNA–protein complexes , 2007, Nature Protocols.

[80]  D. Draper,et al.  Stabilization of RNA tertiary structure by monovalent cations. , 2000, Journal of molecular biology.

[81]  Andrzej Skowron,et al.  Rough-Fuzzy Hybridization: A New Trend in Decision Making , 1999 .

[82]  Teresa W. Haynes,et al.  A predictive model for secondary RNA structure using graph theory and a neural network , 2010, BMC Bioinformatics.

[83]  S. Bandyopadhyay,et al.  Evolutionary computation in bioinformatics: a review , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[84]  Ye Ding,et al.  Sfold web server for statistical folding and rational design of nucleic acids , 2004, Nucleic Acids Res..

[85]  Jerrold R. Griggs,et al.  Algorithms for Loop Matchings , 1978 .

[86]  J Ofengand,et al.  Four newly located pseudouridylate residues in Escherichia coli 23S ribosomal RNA are all at the peptidyltransferase center: analysis by the application of a new sequencing technique. , 1993, Biochemistry.

[87]  C. Pleij,et al.  The computer simulation of RNA folding pathways using a genetic algorithm. , 1995, Journal of molecular biology.

[88]  B. Fürtig,et al.  Transient RNA–protein interactions in RNA folding , 2011, The FEBS journal.

[89]  Yang Liu,et al.  Predicting RNA secondary structure based on the class information and Hopfield network , 2009, Comput. Biol. Medicine.

[90]  G. Varani,et al.  The G x U wobble base pair. A fundamental building block of RNA structure crucial to RNA function in diverse biological systems. , 2000, EMBO reports.

[91]  S. Pal,et al.  Bioinformatics in neurocomputing framework , 2005 .

[92]  R. Nussinov,et al.  Fast algorithm for predicting the secondary structure of single-stranded RNA. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[93]  K. Wiese,et al.  A permutation-based genetic algorithm for the RNA folding problem: a critical look at selection strategies, crossover operators, and representation issues. , 2003, Bio Systems.

[94]  Jin Chu Wu,et al.  Predicting RNA H-type pseudoknots with the massively parallel genetic algorithm , 1997, Comput. Appl. Biosci..

[95]  István Miklós,et al.  Co-transcriptional folding is encoded within RNA genes , 2004, BMC Molecular Biology.

[96]  J. T. Madison,et al.  Structure of a Ribonucleic Acid , 1965, Science.

[97]  B. Dombroski,et al.  Hydroxyl radical "footprinting": high-resolution information about DNA-protein contacts and application to lambda repressor and Cro protein. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[98]  P. Schuster,et al.  Analysis of RNA sequence structure maps by exhaustive enumeration II. Structures of neutral networks and shape space covering , 1996 .

[99]  Jin Chu Wu,et al.  The massively parallel genetic algorithm for RNA folding: MIMD implementation and population variation , 2001, Bioinform..

[100]  H. Noller,et al.  Functional modification of 16S ribosomal RNA by kethoxal. , 1972, Proceedings of the National Academy of Sciences of the United States of America.

[101]  Janusz Kacprzyk,et al.  Fuzzy dynamic programming: Main developments and applications , 1996, Fuzzy Sets Syst..

[102]  G. Wagner,et al.  NMR spectroscopy: a multifaceted approach to macromolecular structure , 2000, Quarterly Reviews of Biophysics.

[103]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[104]  J. C. Wu,et al.  RNA folding pathway functional intermediates: their prediction and analysis. , 2001, Journal of molecular biology.

[105]  C. Lawrence,et al.  RNA secondary structure prediction by centroids in a Boltzmann weighted ensemble. , 2005, RNA.

[106]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .