The ITS2 Database III—sequences and structures for phylogeny

The internal transcribed spacer 2 (ITS2) is a widely used phylogenetic marker. In the past, it has mainly been used for species level classifications. Nowadays, a wider applicability becomes apparent. Here, the conserved structure of the RNA molecule plays a vital role. We have developed the ITS2 Database (http://its2.bioapps.biozentrum.uni-wuerzburg.de) which holds information about sequence, structure and taxonomic classification of all ITS2 in GenBank. In the new version, we use Hidden Markov models (HMMs) for the identification and delineation of the ITS2 resulting in a major redesign of the annotation pipeline. This allowed the identification of more than 160 000 correct full length and more than 50 000 partial structures. In the web interface, these can now be searched with a modified BLAST considering both sequence and structure, enabling rapid taxon sampling. Novel sequences can be annotated using the HMM based approach and modelled according to multiple template structures. Sequences can be searched for known and newly identified motifs. Together, the database and the web server build an exhaustive resource for ITS2 based phylogenetic analyses.

[1]  Michael Zuker,et al.  UNAFold: software for nucleic acid folding and hybridization. , 2008, Methods in molecular biology.

[2]  Julia C. Engelmann,et al.  Modelling cross‐hybridization on phylogenetic DNA microarrays increases the detection power of closely related species , 2009, Molecular ecology resources.

[3]  J. Schultz,et al.  ITS2 sequence-structure analysis in phylogenetics: a how-to manual for molecular systematics. , 2009, Molecular phylogenetics and evolution.

[4]  Thomas Dandekar,et al.  Distinguishing species. , 2007, RNA.

[5]  Tobias Müller,et al.  A common core of secondary structure of the internal transcribed spacer 2 (ITS2) throughout the Eukaryota. , 2005, RNA.

[6]  Tobias Müller,et al.  CBCAnalyzer: inferring phylogenies based on compensatory base changes in RNA secondary structures , 2005, Silico Biol..

[7]  I. Kaczmarska,et al.  Barcoding of diatoms: nuclear encoded ITS revisited. , 2010, Protist.

[8]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[9]  A. Gargas,et al.  Using ITS2 secondary structure to create species-specific oligonucleotide probes for fungi. , 2007, Mycologia.

[10]  Annette W. Coleman,et al.  Pan-eukaryote ITS 2 homologies revealed by RNA secondary structure , 2007 .

[11]  S. Morin,et al.  ITS2 sequences as barcodes for identifying and analyzing spider mites (Acari: Tetranychidae) , 2007, Experimental and Applied Acarology.

[12]  Gary D. Stormo,et al.  Displaying the information contents of structural RNA alignments: the structure logos , 1997, Comput. Appl. Biosci..

[13]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[14]  Annette W. Coleman,et al.  Pan-eukaryote ITS2 homologies revealed by RNA secondary structure , 2007, Nucleic acids research.

[15]  A. Coleman,et al.  The Internal Transcribed Spacer 2 Exhibits a Common Secondary Structure in Green Algae and Flowering Plants , 1997, Journal of Molecular Evolution.

[16]  Tobias Müller,et al.  ProfDistS: (profile-) distance based phylogeny on sequence - structure alignments , 2008, Bioinform..

[17]  Thomas Dandekar,et al.  Synchronous visual analysis and editing of RNA sequence and secondary structure alignments using 4SALE , 2008, BMC Research Notes.

[18]  P. Sørensen,et al.  A Phylogenetic Analysis of the Genus Dahlia (Asteraceae) Based on Internal and External Transcribed Spacer Regions of Nuclear Ribosomal DNA , 2009 .

[19]  Yanga Byun,et al.  PseudoViewer3: generating planar drawings of large-scale RNA structures with pseudoknots , 2009, Bioinform..

[20]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[21]  A. Coleman,et al.  ITS2 is a double-edged tool for eukaryote evolutionary comparisons. , 2003, Trends in genetics : TIG.

[22]  M. Kimmel,et al.  Conflict of interest statement. None declared. , 2010 .

[23]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[24]  Thomas Dandekar,et al.  5.8S-28S rRNA interaction and HMM-based ITS2 annotation. , 2009, Gene.