ITS2 database IV: interactive taxon sampling for internal transcribed spacer 2 based phylogenies.

The first step of any molecular phylogenetic analysis is the selection of the species and sequences to be included, the taxon sampling. Already here different pitfalls exist. Sequences can contain errors, annotations in databases can be inaccurate and even the taxonomic classification of a species can be wrong. Usually, these artefacts become evident only after calculation of the phylogenetic tree. Following, the taxon sampling has to be corrected iteratively. This can become tedious and time consuming, as in most cases the taxon sampling is de-coupled from the further steps of the phylogenetic analysis. Here, we present the ITS2 Workbench (http://its2.bioapps.biozentrum.uni-wuerzburg.de/), which eliminates this problem by a tight integration of taxon sampling, secondary structure prediction, multiple alignment and phylogenetic tree calculation. The ITS2 Workbench has access to more than 280,000 ITS2 sequences and their structures provided by the ITS2 database enabling sequence-structure based alignment and tree reconstruction. This allows the interactive improvement of the taxon sampling throughout the whole phylogenetic tree reconstruction process. Thus, the ITS2 Workbench enables a fast, interactive and iterative taxon sampling leading to more accurate ITS2 based phylogenies.

[1]  Michael Zuker,et al.  UNAFold: software for nucleic acid folding and hybridization. , 2008, Methods in molecular biology.

[2]  Michael Zuker,et al.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information , 1981, Nucleic Acids Res..

[3]  W. Ludwig,et al.  SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB , 2007, Nucleic acids research.

[4]  A. Coleman,et al.  The Internal Transcribed Spacer 2 Exhibits a Common Secondary Structure in Green Algae and Flowering Plants , 1997, Journal of Molecular Evolution.

[5]  Alexander Keller,et al.  The ITS2 Database III—sequences and structures for phylogeny , 2009, Nucleic Acids Res..

[6]  A. Coleman,et al.  ITS2 is a double-edged tool for eukaryote evolutionary comparisons. , 2003, Trends in genetics : TIG.

[7]  Thomas Dandekar,et al.  Homology modeling revealed more than 20,000 rRNA internal transcribed spacer 2 (ITS2) secondary structures. , 2005, RNA.

[8]  Pierre Tufféry,et al.  BIOINFORMATICS ORIGINAL PAPER , 2022 .

[9]  Tobias Müller,et al.  4SALE – A tool for synchronous RNA sequence and secondary structure alignment and editing , 2006, BMC Bioinformatics.

[10]  R. Raff,et al.  Molecular phylogeny of the animal kingdom. , 1988, Science.

[11]  B. G. Baldwin Phylogenetic utility of the internal transcribed spacers of nuclear ribosomal DNA in plants: an example from the compositae. , 1992, Molecular phylogenetics and evolution.

[12]  Indra Neil Sarkar,et al.  The impact of taxon sampling on phylogenetic inference: a review of two decades of controversy , 2012, Briefings Bioinform..

[13]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[14]  Tobias Müller,et al.  A common core of secondary structure of the internal transcribed spacer 2 (ITS2) throughout the Eukaryota. , 2005, RNA.

[15]  Tobias Müller,et al.  ProfDistS: (profile-) distance based phylogeny on sequence - structure alignments , 2008, Bioinform..

[16]  J. Claverie,et al.  BLAST-EXPLORER helps you building datasets for phylogenetic analysis , 2010, BMC Evolutionary Biology.

[17]  Frank Förster,et al.  Including RNA secondary structures improves accuracy and robustness in reconstruction of phylogenetic trees , 2010, Biology Direct.

[18]  J. Schultz,et al.  ITS2 sequence-structure analysis in phylogenetics: a how-to manual for molecular systematics. , 2009, Molecular phylogenetics and evolution.

[19]  H. Philippe,et al.  Resolving Difficult Phylogenetic Questions: Why More Sequences Are Not Enough , 2011, PLoS biology.

[20]  Rodrigo Lopez,et al.  A new bioinformatics analysis tools framework at EMBL–EBI , 2010, Nucleic Acids Res..

[21]  B. Michot,et al.  Ribosomal internal transcribed spacer 2 (ITS2) exhibits a common core of secondary structure in vertebrates and yeast. , 1999, Nucleic acids research.

[22]  Jean-Michel Claverie,et al.  Phylogeny.fr: robust phylogenetic analysis for the non-specialist , 2008, Nucleic Acids Res..

[23]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[24]  K. Schleifer,et al.  ARB: a software environment for sequence data. , 2004, Nucleic acids research.

[25]  F. Delsuc,et al.  Phylogenomics and the reconstruction of the tree of life , 2005, Nature Reviews Genetics.

[26]  Annette W. Coleman,et al.  Pan-eukaryote ITS2 homologies revealed by RNA secondary structure , 2007, Nucleic acids research.