4SALE – A tool for synchronous RNA sequence and secondary structure alignment and editing

BackgroundIn sequence analysis the multiple alignment builds the fundament of all proceeding analyses. Errors in an alignment could strongly influence all succeeding analyses and therefore could lead to wrong predictions. Hand-crafted and hand-improved alignments are necessary and meanwhile good common practice. For RNA sequences often the primary sequence as well as a secondary structure consensus is well known, e.g., the cloverleaf structure of the t-RNA. Recently, some alignment editors are proposed that are able to include and model both kinds of information. However, with the advent of a large amount of reliable RNA sequences together with their solved secondary structures (available from e.g. the ITS2 Database), we are faced with the problem to handle sequences and their associated secondary structures synchronously.Results4SALE fills this gap. The application allows a fast sequence and synchronous secondary structure alignment for large data sets and for the first time synchronous manual editing of aligned sequences and their secondary structures. This study describes an algorithm for the synchronous alignment of sequences and their associated secondary structures as well as the main features of 4SALE used for further analyses and editing. 4SALE builds an optimal and unique starting point for every RNA sequence and structure analysis.Conclusion4SALE, which provides an user-friendly and intuitive interface, is a comprehensive toolbox for RNA analysis based on sequence and secondary structure information. The program connects sequence and structure databases like the ITS2 Database to phylogeny programs as for example the CBCAnalyzer. 4SALE is written in JAVA and therefore platform independent. The software is freely available and distributed from the website at http://4sale.bioapps.biozentrum.uni-wuerzburg.de

[1]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[2]  Thomas W H Lui,et al.  Empirical models for substitution in ribosomal RNA. , 2003, Molecular biology and evolution.

[3]  Jongsik Chun,et al.  jPHYDIT: a JAVA-based integrated environment for molecular phylogeny of ribosomal RNA sequences , 2005, Bioinform..

[4]  Robert Giegerich,et al.  Pure multiple RNA secondary structure alignments: a progressive profile approach , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[5]  Tobias Müller,et al.  The internal transcribed spacer 2 database—a web server for (not only) low level phylogenetic analyses , 2006, Nucleic Acids Res..

[6]  A. W. Coleman,et al.  The significance of a coincidence between evolutionary landmarks found in mating affinity and a DNA sequence. , 2000, Protist.

[7]  R De Wachter,et al.  DCSE, an interactive tool for sequence alignment and secondary structure research. , 1993, Computer applications in the biosciences : CABIOS.

[8]  Sean R. Eddy,et al.  Rfam: an RNA family database , 2003, Nucleic Acids Res..

[9]  Tobias Müller,et al.  A common core of secondary structure of the internal transcribed spacer 2 (ITS2) throughout the Eukaryota. , 2005, RNA.

[10]  A. Coleman,et al.  ITS2 is a double-edged tool for eukaryote evolutionary comparisons. , 2003, Trends in genetics : TIG.

[11]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[12]  Thomas Dandekar,et al.  Homology modeling revealed more than 20,000 rRNA internal transcribed spacer 2 (ITS2) secondary structures. , 2005, RNA.

[13]  Tobias Müller,et al.  CBCAnalyzer: inferring phylogenies based on compensatory base changes in RNA secondary structures , 2005, Silico Biol..

[14]  Annette W. Coleman,et al.  Exploring the Phylogenetic Utility of ITS Sequences for Animals: A Test Case for Abalone (Haliotis) , 2002, Journal of Molecular Evolution.

[15]  Jan Krüger,et al.  Playing with pesticides. , 1998, BMC Bioinformatics.

[16]  Martin Vingron,et al.  Modeling Amino Acid Replacement , 2000, J. Comput. Biol..

[17]  Sam Griffiths-Jones,et al.  RALEE--RNA ALignment Editor in Emacs , 2005, Bioinform..

[18]  R. Spang,et al.  Estimating amino acid substitution models: a comparison of Dayhoff's estimator, the resolvent approach and a maximum likelihood method. , 2002, Molecular biology and evolution.

[19]  Jens Stoye,et al.  DCA: an efficient implementation of the divide-and-conquer approach to simultaneous multiple sequence alignment , 1997, Comput. Appl. Biosci..

[20]  A. Michie,et al.  CINEMA--a novel colour INteractive editor for multiple alignments. , 1998, Gene.

[21]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[22]  Geoffrey J. Barton,et al.  The Jalview Java alignment editor , 2004, Bioinform..

[23]  Rolf Backofen,et al.  Backofen R: MARNA: multiple alignment and consensus structure prediction of RNAs based on sequence structure comparisons , 2005 .

[24]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[25]  D. Turner,et al.  Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Manolo Gouy,et al.  SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny , 1996, Comput. Appl. Biosci..

[27]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[28]  M. Zuker Computer prediction of RNA structure. , 1989, Methods in enzymology.

[29]  Magnus Rattray,et al.  RNA-based phylogenetic methods: application to mammalian mitochondrial RNA sequences. , 2003, Molecular phylogenetics and evolution.

[30]  Burkhard Morgenstern,et al.  DIALIGN: multiple DNA and protein sequence alignment at BiBiServ , 2004, Nucleic Acids Res..