Prediction of common secondary structures of RNAs: a genetic algorithm approach.

In this study we apply a genetic algorithm to a set of RNA sequences to find common RNA secondary structures. Our method is a three-step procedure. At the first stage of the procedure for each sequence, a genetic algorithm is used to optimize the structures in a population to a certain degree of stability. In this step, the free energy of a structure is the fitness criterion for the algorithm. Next, for each structure, we define a measure of structural conservation with respect to those in other sequences. We use this measure in a genetic algorithm to improve the structural similarity among sequences for the structures in the population of a sequence. Finally, we select those structures satisfying certain conditions of structural stability and similarity as predicted common structures for a set of RNA sequences. We have obtained satisfactory results from a set of tRNA, 5S rRNA, rev response elements (RRE) of HIV-1 and RRE of HIV-2/SIV, respectively.

[1]  David Sankoff,et al.  RNA secondary structures and their prediction , 1984 .

[2]  B. Ganem RNA world , 1987, Nature.

[3]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[4]  M. Waterman [52] Computer analysis of nucleic acid sequences , 1988 .

[5]  S. Le,et al.  The HIV-1 rev trans-activator acts through a structured target sequence to activate nuclear export of unspliced viral mRNA , 1989, Nature.

[6]  M. Zuker On finding all suboptimal foldings of an RNA molecule. , 1989, Science.

[7]  S. Le,et al.  A highly conserved RNA folding region coincident with the Rev response element of primate immunodeficiency viruses. , 1990, Nucleic acids research.

[8]  M. Zuker,et al.  Common structures of the 5' non-coding RNA in enteroviruses and rhinoviruses. Thermodynamical stability and statistical significance. , 1990, Journal of molecular biology.

[9]  B. Berkhout,et al.  Functional roles for the TATA promoter and enhancers in basal and Tat-induced expression of the human immunodeficiency virus type 1 long terminal repeat , 1992, Journal of virology.

[10]  R. Durbin,et al.  RNA sequence analysis using covariance models. , 1994, Nucleic acids research.

[11]  M. Zuker,et al.  "Well-determined" regions in RNA secondary structure prediction: analysis of small subunit ribosomal RNA. , 1995, Nucleic acids research.

[12]  G Benedetti,et al.  A genetic algorithm to search for optimal and suboptimal RNA secondary structures. , 1995, Biophysical chemistry.

[13]  C. Pleij,et al.  The computer simulation of RNA folding pathways using a genetic algorithm. , 1995, Journal of molecular biology.

[14]  R. Lück,et al.  Thermodynamic prediction of conserved secondary structure: application to the RRE element of HIV, the tRNA-like element of CMV and the mRNA of prion protein. , 1996, Journal of molecular biology.

[15]  D. Higgins,et al.  RAGA: RNA sequence alignment by genetic algorithm. , 1997, Nucleic acids research.

[16]  Laurie J. Heyer,et al.  Finding the most significant common sequence and structure motifs in a set of RNA sequences. , 1997, Nucleic acids research.

[17]  M. Huynen,et al.  Automatic detection of conserved RNA structure elements in complete RNA virus genomes. , 1998, Nucleic acids research.

[18]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.