Towards a unified paradigm for sequence‐based identification of fungi

The nuclear ribosomal internal transcribed spacer (ITS) region is the formal fungal barcode and in most cases the marker of choice for the exploration of fungal diversity in environmental samples. Two problems are particularly acute in the pursuit of satisfactory taxonomic assignment of newly generated ITS sequences: (i) the lack of an inclusive, reliable public reference data set and (ii) the lack of means to refer to fungal species, for which no Latin name is available in a standardized stable way. Here, we report on progress in these regards through further development of the UNITE database (http://unite.ut.ee) for molecular identification of fungi. All fungal species represented by at least two ITS sequences in the international nucleotide sequence databases are now given a unique, stable name of the accession number type (e.g. Hymenoscyphus pseudoalbidus|GU586904|SH133781.05FU), and their taxonomic and ecological annotations were corrected as far as possible through a distributed, third‐party annotation effort. We introduce the term ‘species hypothesis’ (SH) for the taxa discovered in clustering on different similarity thresholds (97–99%). An automatically or manually designated sequence is chosen to represent each such SH. These reference sequences are released (http://unite.ut.ee/repository.php) for use by the scientific community in, for example, local sequence similarity searches and in the QIIME pipeline. The system and the data will be updated automatically as the number of public fungal ITS sequences grows. We invite everybody in the position to improve the annotation or metadata associated with their particular fungal lineages of expertise to do so through the new Web‐based sequence management system in UNITE.

[1]  John L. Spouge,et al.  Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi , 2012, Proceedings of the National Academy of Sciences.

[2]  T. Sieber,et al.  Cryptic speciation in Hymenoscyphus albidus , 2011 .

[3]  Erik Kristiansson,et al.  Mining metadata from unidentified ITS sequences in GenBank: A case study in Inocybe (Basidiomycota) , 2008, BMC Evolutionary Biology.

[4]  R. Henrik Nilsson,et al.  Tidying Up International Nucleotide Sequence Databases: Ecological, Geographical and Sequence Quality Annotation of ITS Sequences of Mycorrhizal Fungi , 2011, PloS one.

[5]  Robin Sen,et al.  UNITE: a database providing web-based methods for the molecular identification of ectomycorrhizal fungi. , 2005, The New phytologist.

[6]  R. Henrik Nilsson,et al.  Progress in molecular and morphological taxon discovery in Fungi and options for formal classification of environmental sequences , 2011 .

[7]  T. Glenn Field guide to next‐generation DNA sequencers , 2011, Molecular ecology resources.

[8]  M. Pautasso Fungal under-representation is (indeed) diminishing in the life sciences , 2013 .

[9]  A. Cornish-Bowden Nomenclature for incompletely specified bases in nucleic acid sequences: recommendations 1984. , 1985, Nucleic acids research.

[10]  R. Henrik Nilsson,et al.  Taxonomic Reliability of DNA Sequences in Public Sequence Databases: A Fungal Perspective , 2006, PloS one.

[11]  M. Blackwell The fungi: 1, 2, 3 ... 5.1 million species? , 2011, American journal of botany.

[12]  K. Hyde,et al.  Epitypification: should we epitypify? , 2008, Journal of Zhejiang University SCIENCE B.

[13]  R. Henrik Nilsson,et al.  Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences. , 2012 .

[14]  H. Ross Principles of Numerical Taxonomy , 1964 .

[15]  Kessy Abarenkov,et al.  Fungal community analysis by high-throughput sequencing of amplified markers – a user's guide , 2013, The New phytologist.

[16]  Mark Blaxter,et al.  Defining operational taxonomic units using DNA barcode data , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[17]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[18]  Andy F. S. Taylor,et al.  The UNITE database for molecular identification of fungi--recent updates and future perspectives. , 2010, The New phytologist.

[19]  Nils Hallenberg,et al.  Preserving accuracy in GenBank , 2008 .

[20]  N. Dam Comment on Fungal under-representation is (slowly) diminishing in the life sciences , 2013 .

[21]  D. Bass,et al.  Three reasons to re-evaluate fungal diversity 'on Earth and in the ocean' , 2011 .

[22]  Mehrdad Hajibabaei,et al.  Next‐generation sequencing technologies for environmental DNA research , 2012, Molecular ecology.

[23]  Rob Knight,et al.  UCHIME improves sensitivity and speed of chimera detection , 2011, Bioinform..

[24]  L. Kedes,et al.  Nomenclature for incompletely specified bases in nucleic acid sequences. Recommendations 1984. Nomenclature Committee of the International Union of Biochemistry (NC-IUB). , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[25]  R. Henrik Nilsson,et al.  PlutoF—a Web Based Workbench for Ecological and Taxonomic Research, with an Online Implementation for Fungal ITS Sequences , 2010, Evolutionary Bioinformatics Online.

[26]  Rob Knight,et al.  Meeting Report: Fungal ITS Workshop (October 2012) , 2013, Standards in genomic sciences.

[27]  Ellen Larsson,et al.  Controversy over Hygrophorus cossus settled using ITS sequence data from 200 year-old type material. , 2004, Mycological research.

[28]  L. Tedersoo,et al.  Evolution of nutritional modes of Ceratobasidiaceae (Cantharellales, Basidiomycota) as revealed from publicly available ITS sequences , 2013 .

[29]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[30]  David S. Hibbett,et al.  Fungal systematics: is a new age of enlightenment at hand? , 2013, Nature Reviews Microbiology.

[31]  R. Vilgalys,et al.  A global meta‐analysis of Tuber ITS rDNA sequences: species diversity, host associations and long‐distance dispersal , 2010, Molecular ecology.