RIPGA: RNA-RNA interaction prediction using genetic algorithm

Non-coding RNAs are RNA molecules that do not translate into proteins. These RNAs are functional important in many biological processes. Their biological functions are highly related to their interaction partners. RNA-RNA interactions are one of the possibilities. It is desired to use computational methods to study and predict the interaction partners of non-coding RNAs. Most recent programs for RNA-RNA interaction prediction programs are based on Turner Energy Model. Some papers show that the free energy of RNA-structures is lower than random sequences but it is not statistically significant. It shows that we may need to modify the energy model for different RNA structures applications. In this paper, we first study the RNA-RNA interaction pattern using experimental validated RNA-RNA interaction data, which are extracted from sRNATarBase. We study the sRNA-mRNA interaction data and extract some features of the RNA-RNA interaction patterns. Then we combine these features about interaction sites into the Turner Energy Model. We develop a genetic algorithm based program RIPGA to solve the RNA-RNA interaction prediction problem. We use genetic algorithm because the RNA-RNA interaction prediction is NP-hard and we are interested to find out good suboptimal solutions. We use an sRNA-mRNA interaction dataset to evaluate the performance of the modified energy model and compare the results with two state-of-the-art programs. The comparison of the original model and the modified model shows that the modified energy model has better performance in both sensitivity and positive predictive value (PPV). Comparing RIPGA with state-of-the-art programs, RIPGA have better sensitivity and comparable positive predictive value.

[1]  Elena Rivas,et al.  Secondary structure alone is generally not statistically significant for the detection of noncoding RNAs , 2000, Bioinform..

[2]  Paulo P. Amaral,et al.  The Eukaryotic Genome as an RNA Machine , 2008, Science.

[3]  M. Zuker,et al.  Prediction of hybridization and melting for double-stranded nucleic acids. , 2004, Biophysical journal.

[4]  Tatsuya Akutsu,et al.  RactIP: fast and accurate prediction of RNA-RNA interaction using integer programming , 2010, Bioinform..

[5]  Kaizhong Zhang,et al.  RNA-RNA Interaction Prediction and Antisense RNA Target Search , 2006, J. Comput. Biol..

[6]  Rolf Backofen,et al.  IntaRNA: efficient prediction of bacterial sRNA targets incorporating target site accessibility and seed regions , 2008, Bioinform..

[7]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[8]  R. Giegerich,et al.  Fast and effective prediction of microRNA/target duplexes. , 2004, RNA.

[9]  Christian M. Reidys,et al.  Target prediction and a statistical sampling algorithm for RNA–RNA interaction , 2009, Bioinform..

[10]  Tatsuya Akutsu,et al.  Dynamic Programming Algorithms for RNA Structure Prediction with Binding Sites , 2010, Pacific Symposium on Biocomputing.

[11]  Hamidreza Chitsaz,et al.  A partition function algorithm for interacting nucleic acid strands , 2009, Bioinform..

[12]  D. Pervouchine IRIS: intermolecular RNA interaction search. , 2004, Genome informatics. International Conference on Genome Informatics.

[13]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[14]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[15]  Yuan Cao,et al.  sRNATarBase: a comprehensive database of bacterial sRNA targets verified by experiments. , 2010, RNA.

[16]  Peter F. Stadler,et al.  Partition function and base pairing probabilities of RNA heterodimers , 2006, Algorithms for Molecular Biology.

[17]  Tatsuya Akutsu,et al.  A grammatical approach to RNA-RNA interaction prediction , 2009, Pattern Recognit..

[18]  J. Bonfield,et al.  Finishing the euchromatic sequence of the human genome , 2004, Nature.

[19]  P. Clote,et al.  Structural RNA has lower folding energy than random RNA of the same dinucleotide frequency. , 2005, RNA.

[20]  Peter F. Stadler,et al.  Thermodynamics of RNA-RNA Binding , 2006, German Conference on Bioinformatics.

[21]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[22]  Klas Flärdh,et al.  Antisense RNAs everywhere? , 2002, Trends in genetics : TIG.

[23]  S. Eddy Non–coding RNA genes and the modern RNA world , 2001, Nature Reviews Genetics.

[24]  A. Condon,et al.  Secondary structure prediction of interacting RNA molecules. , 2005, Journal of molecular biology.

[25]  Nadav S. Bar,et al.  Landscape of transcription in human cells , 2012, Nature.

[26]  Christian M. Reidys,et al.  Partition function and base pairing probabilities for RNA-RNA interaction prediction , 2009, Bioinform..

[27]  Rolf Backofen,et al.  Fast prediction of RNA-RNA interaction , 2009, Algorithms for Molecular Biology.

[28]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..