The ITS2 Database

The internal transcribed spacer 2 (ITS2) has been used as a phylogenetic marker for more than two decades. As ITS2 research mainly focused on the very variable ITS2 sequence, it confined this marker to low-level phylogenetics only. However, the combination of the ITS2 sequence and its highly conserved secondary structure improves the phylogenetic resolution1 and allows phylogenetic inference at multiple taxonomic ranks, including species delimitation2-8. The ITS2 Database9 presents an exhaustive dataset of internal transcribed spacer 2 sequences from NCBI GenBank11 accurately reannotated10. Following an annotation by profile Hidden Markov Models (HMMs), the secondary structure of each sequence is predicted. First, it is tested whether a minimum energy based fold12 (direct fold) results in a correct, four helix conformation. If this is not the case, the structure is predicted by homology modeling13. In homology modeling, an already known secondary structure is transferred to another ITS2 sequence, whose secondary structure was not able to fold correctly in a direct fold. The ITS2 Database is not only a database for storage and retrieval of ITS2 sequence-structures. It also provides several tools to process your own ITS2 sequences, including annotation, structural prediction, motif detection and BLAST14 search on the combined sequence-structure information. Moreover, it integrates trimmed versions of 4SALE15,16 and ProfDistS17 for multiple sequence-structure alignment calculation and Neighbor Joining18 tree reconstruction. Together they form a coherent analysis pipeline from an initial set of sequences to a phylogeny based on sequence and secondary structure. In a nutshell, this workbench simplifies first phylogenetic analyses to only a few mouse-clicks, while additionally providing tools and data for comprehensive large-scale analyses.

[1]  Harald Meier,et al.  46. ARB: A Software Environment for Sequence Data , 2011 .

[2]  B. Merget,et al.  Internal Transcribed Spacer 2 (nu ITS2 rRNA) Sequence-Structure Phylogenetics: Towards an Automated Reconstruction of the Green Algal Tree of Life , 2011, PloS one.

[3]  B. Merget,et al.  A molecular phylogeny of Hypnales (Bryophyta) inferred from ITS2 sequence-structure data , 2010, BMC Research Notes.

[4]  Alexander Keller,et al.  The ITS2 Database III—sequences and structures for phylogeny , 2009, Nucleic Acids Res..

[5]  Frank Förster,et al.  Including RNA secondary structures improves accuracy and robustness in reconstruction of phylogenetic trees , 2010, Biology Direct.

[6]  Pierre Tufféry,et al.  BIOINFORMATICS ORIGINAL PAPER , 2022 .

[7]  J. Schultz,et al.  ITS2 sequence-structure analysis in phylogenetics: a how-to manual for molecular systematics. , 2009, Molecular phylogenetics and evolution.

[8]  Thomas Dandekar,et al.  5.8S-28S rRNA interaction and HMM-based ITS2 annotation. , 2009, Gene.

[9]  Tobias Müller,et al.  ProfDistS: (profile-) distance based phylogeny on sequence - structure alignments , 2008, Bioinform..

[10]  Thomas Dandekar,et al.  Synchronous visual analysis and editing of RNA sequence and secondary structure alignments using 4SALE , 2008, BMC Research Notes.

[11]  Michael Zuker,et al.  UNAFold: software for nucleic acid folding and hybridization. , 2008, Methods in molecular biology.

[12]  Thomas Dandekar,et al.  Distinguishing species. , 2007, RNA.

[13]  Tobias Müller,et al.  4SALE – A tool for synchronous RNA sequence and secondary structure alignment and editing , 2006, BMC Bioinformatics.

[14]  Thomas Dandekar,et al.  Homology modeling revealed more than 20,000 rRNA internal transcribed spacer 2 (ITS2) secondary structures. , 2005, RNA.

[15]  Tobias Müller,et al.  A common core of secondary structure of the internal transcribed spacer 2 (ITS2) throughout the Eukaryota. , 2005, RNA.

[16]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[17]  Sven Rahmann,et al.  Accurate and robust phylogeny estimation based on profile distances: a study of the Chlorophyceae (Chlorophyta) , 2004, BMC Evolutionary Biology.

[18]  K. Schleifer,et al.  ARB: a software environment for sequence data. , 2004, Nucleic acids research.

[19]  A. Coleman,et al.  ITS2 is a double-edged tool for eukaryote evolutionary comparisons. , 2003, Trends in genetics : TIG.

[20]  A. W. Coleman,et al.  The significance of a coincidence between evolutionary landmarks found in mating affinity and a DNA sequence. , 2000, Protist.

[21]  Gapped BLAST and PSI-BLAST: A new , 1997 .

[22]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[23]  R. Sokal,et al.  A METHOD FOR DEDUCING BRANCHING SEQUENCES IN PHYLOGENY , 1965 .