An outlook on the fungal internal transcribed spacer sequences in GenBank and the introduction of a web-based tool for the exploration of fungal diversity.

The environmental and distributional data associated with fungal internal transcribed spacer (ITS) sequences in GenBank are investigated and a new web-based tool with which these sequences can be explored is introduced. All fungal ITS sequences in GenBank were classified as either identified to species level or insufficiently identified and compared using BLAST. The results are made available as a biweekly updated web service that can be queried to retrieve all insufficiently identified sequences (IIS) associated with any fungal genus. The most commonly available annotation items in GenBank are isolation source (55%); country of origin (50%); and specific host (38%). The molecular sampling of fungi shows a bias towards North America, Europe, China, and Japan whereas vast geographical areas remain effectively unexplored. Mycorrhizal and parasitic genera are on average associated with more IIS than are saprophytic taxa. Glomus, Alternaria, and Tomentella are the genera represented by the highest number of insufficiently identified ITS sequences in GenBank. The web service presented (http://andromeda.botany.gu.se/emerencia.html#genus_search) offers new means, particularly for mycorrhizal and plant pathogenic fungi, to examine the IIS in GenBank in a taxon-oriented framework and to explore their metadata in an easily accessible and time-efficient manner.

[1]  J. Cairney,et al.  Influence of long-term repeated prescribed burning on mycelial communities of ectomycorrhizal fungi. , 2006, The New phytologist.

[2]  Jeremy R. deWaard,et al.  Biological identifications through DNA barcodes , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[3]  R. Henrik Nilsson,et al.  Intraspecific ITS Variability in the Kingdom Fungi as Expressed in the International Sequence Databases and Its Implications for Molecular Species Identification , 2008, Evolutionary bioinformatics online.

[4]  L. Tedersoo,et al.  Molecular and morphological diversity of pezizalean ectomycorrhiza. , 2006, The New phytologist.

[5]  Erik Kristiansson,et al.  Mining metadata from unidentified ITS sequences in GenBank: A case study in Inocybe (Basidiomycota) , 2008, BMC Evolutionary Biology.

[6]  T. Bruns,et al.  The molecular revolution in ectomycorrhizal ecology: peeking into the black‐box , 2001, Molecular ecology.

[7]  F. Richard,et al.  Sebacinales are common mycorrhizal associates of Ericaceae. , 2007, The New phytologist.

[8]  L. Koski,et al.  The Closest BLAST Hit Is Often Not the Nearest Neighbor , 2001, Journal of Molecular Evolution.

[9]  M. C. Aime,et al.  Molecular characterization of fungal endophytic morphospecies isolated from stems and pods of Theobroma cacao , 2006 .

[10]  R. Henrik Nilsson,et al.  Approaching the taxonomic affiliation of unidentified sequences in public databases – an example from the mycorrhizal fungi , 2005, BMC Bioinformatics.

[11]  David L. Hawksworth,et al.  Fungal diversity and its implications for genetic resource collections , 2005 .

[12]  Andrew P. Martin,et al.  Widespread occurrence and phylogenetic placement of a soil clone group adds a prominent new branch to the fungal tree of life. , 2008, Molecular phylogenetics and evolution.

[13]  R. Henrik Nilsson,et al.  Taxonomic Reliability of DNA Sequences in Public Sequence Databases: A Fungal Perspective , 2006, PloS one.

[14]  P. Hoegger,et al.  Correct identification of wood-inhabiting fungi by ITS analysis , 2007 .

[15]  Nils Hallenberg,et al.  Preserving accuracy in GenBank , 2008 .

[16]  A. Urban,et al.  Sebacinales: a hitherto overlooked cosm of heterobasidiomycetes with a broad mycorrhizal potential. , 2004, Mycological research.

[17]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[18]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[19]  Robin Sen,et al.  UNITE: a database providing web-based methods for the molecular identification of ectomycorrhizal fungi. , 2005, The New phytologist.

[20]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[21]  R. Molina,et al.  Douglas-fir ectomycorrhizae in 40- and 400-year-old stands: mycobiont availability to late successional western hemlock , 2005, Mycorrhiza.

[22]  R. Kjøller Disproportionate abundance between ectomycorrhizal root tips and their associated mycelia. , 2006, FEMS microbiology ecology.

[23]  M. McCormick,et al.  Internal transcribed spacer primers and sequences for improved characterization of basidiomycetous orchid mycorrhizas. , 2008, The New phytologist.