ConStruct: a tool for thermodynamic controlled prediction of conserved secondary structure.

A tool for prediction of conserved secondary structure of a set of homologous single-stranded RNAs is presented. For each RNA of the set the structure distribution is calculated and stored in a base pair probability matrix. Gaps, resulting from a multiple sequence alignment of the RNA set, are introduced into the individual probability matrices. These 'aligned' probability matrices are summed up to give a consensus probability matrix emphasizing the conserved structural elements of the RNA set. Because the multiple sequence alignment is independent of any structural constraints, such an alignment may result in introduction of gaps into the homologous probability matrices that disrupt a common consensus structure. By use of its graphical user interface the presented tool allows the removal of such misalignments, which are easily recognized, from the individual probability matrices by optimizing the sequence alignment with respect to a structural alignment. From the consensus probability matrix a consensus structure is extracted, which is viewable in three different graphical representations. The functionality of the tool is demonstrated using a small set of U7 RNAs, which are involved in 3'-end processing of histone mRNA precursors. Supplementary Material lists further results obtained. Advantages and drawbacks of the tool are discussed in comparison to several other algorithms.

[1]  I. Tinoco,et al.  Estimation of Secondary Structure in Ribonucleic Acids , 1971, Nature.

[2]  M. Waterman,et al.  RNA secondary structure: a complete mathematical analysis , 1978 .

[3]  Jerrold R. Griggs,et al.  Algorithms for Loop Matchings , 1978 .

[4]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[5]  G. Steger,et al.  Conformational transitions in viroids and virusoids: comparison of results from energy minimization algorithm and from experimental data. , 1984, Journal of biomolecular structure & dynamics.

[6]  T. D. Schneider,et al.  Information content of binding sites on nucleotide sequences. , 1986, Journal of molecular biology.

[7]  M. Birnstiel,et al.  Analysis of a sea urchin gene cluster coding for the small nuclear U7 RNA, a rare RNA species implicated in the 3' editing of histone precursor mRNAs. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[8]  J. Steitz,et al.  Identification of the human U7 snRNP as one of several factors involved in the 3' end maturation of histone premessenger RNA's. , 1987, Science.

[9]  G. Schaffner,et al.  Specific contacts between mammalian U7 snRNA and histone precursor RNA are indispensable for the in vitro 3′ RNA processing reaction. , 1988, The EMBO journal.

[10]  M. Birnstiel,et al.  Structure and Function of Major and Minor Small Nuclear Ribonucleoprotein Particles , 1988, Springer Berlin Heidelberg.

[11]  D. Turner,et al.  RNA structure prediction. , 1988, Annual review of biophysics and biophysical chemistry.

[12]  J. Steitz,et al.  snRNP mediators of 3' end processing: functional fossils? , 1988, Trends in biochemical sciences.

[13]  I. Tinoco,et al.  RNA folding: Pseudoknots, loops and bulges , 1989, BioEssays : news and reviews in molecular, cellular and developmental biology.

[14]  D. Turner,et al.  Predicting optimal and suboptimal secondary structure for RNA. , 1990, Methods in enzymology.

[15]  J. Abrahams,et al.  Prediction of RNA secondary structure, including pseudoknotting, by computer simulation. , 1990, Nucleic acids research.

[16]  J. McCaskill The equilibrium partition function and base pair binding probabilities for RNA secondary structure , 1990, Biopolymers.

[17]  N. Larsen,et al.  SRP-RNA sequence alignment and secondary structure. , 1991, Nucleic acids research.

[18]  M. Zuker,et al.  Predicting common foldings of homologous RNAs. , 1991, Journal of biomolecular structure & dynamics.

[19]  D. Soldati,et al.  Isolation of an active gene and of two pseudogenes for mouse U7 small nuclear RNA. , 1991, Biochimica et biophysica acta.

[20]  I. Tinoco,et al.  A thermodynamic study of unusually stable RNA and DNA hairpins. , 1991, Nucleic acids research.

[21]  I. Tinoco,et al.  Thermodynamic parameters for loop formation in RNA and DNA hairpin tetraloops. , 1992, Nucleic acids research.

[22]  G. Stormo,et al.  Identifying constraints on the higher-order structure of RNA: continued development and application of comparative sequence analysis methods. , 1992, Nucleic acids research.

[23]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[24]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[25]  R. Durbin,et al.  RNA sequence analysis using covariance models. , 1994, Nucleic acids research.

[26]  D Gautheret,et al.  Identification of base-triples in RNA using comparative sequence analysis. , 1995, Journal of molecular biology.

[27]  E Westhof,et al.  An interactive framework for RNA secondary structure prediction with a dynamical treatment of constraints. , 1995, Journal of molecular biology.

[28]  C. Pleij,et al.  An APL-programmed genetic algorithm for the prediction of RNA secondary structure. , 1995, Journal of theoretical biology.

[29]  S Y Le,et al.  A method for predicting common structures of homologous RNAs. , 1995, Computers and biomedical research, an international journal.

[30]  Alignment editing and identification of consensus secondary structures for nucleic acid sequences: interactive use of dot matrix representations. , 1995, Nucleic acids research.

[31]  Secondary Structure Model of the Last Two Domains of Single-stranded RNA Phage Qβ , 1995 .

[32]  Michael S. Waterman,et al.  Introduction to computational biology , 1995 .

[33]  R. Lück,et al.  Thermodynamic prediction of conserved secondary structure: application to the RRE element of HIV, the tRNA-like element of CMV and the mRNA of prion protein. , 1996, Journal of molecular biology.

[34]  Susan R. Wilson INTRODUCTION TO COMPUTATIONAL BIOLOGY: MAPS, SEQUENCES AND GENOMES. , 1996 .

[35]  Rupert De Wachter,et al.  RnaViz, a program for the visualisation of RNA secondary structure , 1997 .

[36]  A Renner,et al.  RNA structures and folding: from conventional to new issues in structure predictions. , 1997, Current opinion in structural biology.

[37]  C Gaspin,et al.  ESSA: an integrated and interactive computer tool for analysing RNA secondary structure. , 1997, Nucleic acids research.

[38]  J. Thompson,et al.  Multiple sequence alignment with Clustal X. , 1998, Trends in biochemical sciences.

[39]  Gary D. Stormo,et al.  An RNA folding method capable of identifying pseudoknots and base triples , 1998, Bioinform..

[40]  M. Zuker,et al.  Using reliability information to annotate RNA secondary structures. , 1998, RNA.

[41]  T. Hope,et al.  The hepatitis B virus post-transcriptional regulatory element contains two conserved RNA stem-loops which are required for function. , 1998, Nucleic acids research.

[42]  M. Huynen,et al.  Automatic detection of conserved RNA structure elements in complete RNA virus genomes. , 1998, Nucleic acids research.

[43]  E Rivas,et al.  A dynamic programming algorithm for RNA structure prediction including pseudoknots. , 1998, Journal of molecular biology.

[44]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[45]  C. Pleij,et al.  An approximation of loop free energy values of RNA H-pseudoknots. , 1999, RNA.

[46]  Jan Barciszewski,et al.  RNA Biochemistry and Biotechnology , 1999 .

[47]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .