Finding needles in haystacks: linking scientific names, reference specimens and molecular data for Fungi

DNA phylogenetic comparisons have shown that morphology-based species recognition often underestimates fungal diversity. Therefore, the need for accurate DNA sequence data, tied to both correct taxonomic names and clearly annotated specimen data, has never been greater. Furthermore, the growing number of molecular ecology and microbiome projects using high-throughput sequencing require fast and effective methods for en masse species assignments. In this article, we focus on selecting and re-annotating a set of marker reference sequences that represent each currently accepted order of Fungi. The particular focus is on sequences from the internal transcribed spacer region in the nuclear ribosomal cistron, derived from type specimens and/or ex-type cultures. Re-annotated and verified sequences were deposited in a curated public database at the National Center for Biotechnology Information (NCBI), namely the RefSeq Targeted Loci (RTL) database, and will be visible during routine sequence similarity searches with NR_prefixed accession numbers. A set of standards and protocols is proposed to improve the data quality of new sequences, and we suggest how type and other reference sequences can be used to improve identification of Fungi. Database URL: http://www.ncbi.nlm.nih.gov/bioproject/PRJNA177353

R. Henrik Nilsson | Kazuaki Tanaka | Qing Cai | Kerstin Voigt | Pedro W. Crous | M. Catherine Aime | Kevin D. Hyde | Andrew N. Miller | Jie Chen | Kessy Abarenkov | Scott Federhen | Vincent Robert | Gianluigi Cardinali | Conrad L. Schoch | Bart Buyck | Duong Vu | Urmas Kõljalg | Wieland Meyer | Barbara Robbertse | Laszlo Irinyi | Paul M. Kirk | Keith A. Seifert | Cécile Gueidan | Soili Stenroos | Teun Boekhout | Tamás Papp | Cobus M. Visagie | Leho Tedersoo | Jos Houbraken | Michael Weiß | Peter M. Letcher | Giuseppina Mulè | Ning Zhang | Michael J. Wingfield | Peter R. Johnston | Marieka Gryzenhout | Gábor M. Kovács | Csaba Vágvölgyi | Gareth W. Griffith | Karen Hughes | Hiran A. Ariyawansa | Martin Bidartondo | Ana Crespo | Ulrike Damm | Z. Wilhelm De Beer | Bryn T. M. Dentinger | Pradeep K. Divakar | Margarita Dueñas | Nicolas Feau | Katerina Fliegerova | Miguel A. García | Zai-Wei Ge | Johannes Z. Groenewald | Marizeth Groenewald | Martin Grube | Liangdong Guo | Sarah Hambleton | Richard Hamelin | Karen Hansen | Valérie Hofstetter | Seung-Beom Hong | Patrik Inderbitzin | Samantha C. Karunarathna | Ekaphan Kraichak | Krisztina Krizsan | Cletus P. Kurtzman | Karl-Henrik Larsson | Steven Leavitt | Kare Liimatainen | Jian-Kui Liu | D. Jean Lodge | Janet Jennifer Luangsa-ard | H. Thorsten Lumbsch | Sajeewa S. N. Maharachchikumbura | Dimuthu Manamgoda | María P. Martín | Andrew M. Minnis | Jean-Marc Moncalvo | Karen K. Nakasone | Tuula Niskanen | Ibai Olariaga | Tamás Petkovits | Raquel Pino-Bodas | Martha J. Powell | Huzefa A. Raja | Dirk Redecker | J. M. Sarmiento-Ramirez | Bhushan Shrestha | Benjamin Stielow | Sung-Oui Suh | M. Teresa Telleria | Dhanushka Udayanga | Wendy A. Untereiner | Javier Diéguez Uribeondo | Krishna V. Subbarao | Donald M. Walker | Bevan S. Weir | Nalin N. Wijayawardene | J. P. Xu | Zhu L. Yang | Wen-Ying Zhuang | D. J. Lodge | S. Federhen | P. Kirk | M. Grube | G. Griffith | R. Hamelin | G. Cardinali | K. Hyde | M. Wingfield | ZHU-LIANG Yang | J. Moncalvo | C. Gueidan | C. Schoch | M. Powell | B. Buyck | S. Karunarathna | P. Crous | P. Johnston | J. Houbraken | K. Liimatainen | L. Tedersoo | S. Maharachchikumbura | M. Dueñas | M. T. Telleria | N. Wijayawardene | A. Miller | V. Hofstetter | C. Kurtzman | H. Lumbsch | J. Z. Groenewald | Kazuaki Tanaka | H. Raja | I. Olariaga | B. Dentinger | S. Hambleton | K. Hansen | M. C. Aime | Z. Ge | K. Hughes | K. Larsson | B. Robbertse | U. Damm | B. Stielow | R. H. Nilsson | D. Manamgoda | T. Niskanen | D. Udayanga | U. Kõljalg | K. Abarenkov | H. Ariyawansa | K. Fliegerová | G. Kovács | T. Petkovits | C. Vágvölgyi | T. Papp | K. Seifert | V. Robert | M. Bidartondo | T. Boekhout | M. Weiß | W. Untereiner | M. Gryzenhout | S. Suh | P. Inderbitzin | S. Stenroos | N. Feau | K. Voigt | M. Groenewald | D. Walker | S. Leavitt | A. Crespo | P. Divakar | K. Subbarao | D. Redecker | L. Irinyi | W. Meyer | D. Vu | Ning Zhang | G. Mulè | K. Krizsan | Seung-Beom Hong | Liangdong Guo | Ekaphan Kraichak | C. Visagie | W. Zhuang | M. García | R. Pino-Bodas | K. Nakasone | J. Luangsa-ard | B. Shrestha | Bevan Simon Weir | A. M. Minnis | J. Xu | Q. Cai | M. Martín | Z. W. Beer | Jie Chen | Jiankui Liu | Zhu-Liang Yang | C. M. Visagie

[1]  R. Henrik Nilsson,et al.  Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences. , 2012 .

[2]  D. Hibbett,et al.  Phylogenetic species recognition and species concepts in fungi. , 2000, Fungal genetics and biology : FG & B.

[3]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[4]  Rob Knight,et al.  Meeting Report: Fungal ITS Workshop (October 2012) , 2013, Standards in genomic sciences.

[5]  Robin Sen,et al.  UNITE: a database providing web-based methods for the molecular identification of ectomycorrhizal fungi. , 2005, The New phytologist.

[6]  A. Cozijnsen,et al.  Organisation of ribosomal DNA in the ascomycete Leptosphaeria maculans. , 1997, Microbiological research.

[7]  J Davison,et al.  The online database MaarjAM reveals global and ecosystemic distribution patterns in arbuscular mycorrhizal fungi (Glomeromycota). , 2010, The New phytologist.

[8]  Prosanta Chakrabarty,et al.  Genetypes: a concept to help integrate molecular phylogenetics and taxonomy , 2010 .

[9]  Scott Federhen,et al.  Comments on the paper by Pleijel et al. (2008): Vouching for GenBank. , 2009, Molecular phylogenetics and evolution.

[10]  Gianluigi Cardinali,et al.  Centrality of Objects in a Multidimensional Space and its Effects on Distance-Based Biological Classifications , 2011 .

[11]  Koichi Makimura,et al.  Cryptococcuslacticolor sp. nov. and Rhodotorula oligophaga sp. nov., novel yeasts isolated from the nasal smear microbiota of Queensland koalas kept in Japanese zoological parks , 2013, Antonie van Leeuwenhoek.

[12]  H. Chandler Database , 1985 .

[13]  Michael Weiss,et al.  Towards a unified paradigm for sequence‐based identification of fungi , 2013, Molecular ecology.

[14]  Cletus P Kurtzman Use of gene sequence analyses and genome comparisons for yeast systematics. , 2014, International journal of systematic and evolutionary microbiology.

[15]  A. Schüßler,et al.  DNA barcoding of arbuscular mycorrhizal fungi. , 2010, The New phytologist.

[16]  Michael Weiss,et al.  A higher-level phylogenetic classification of the Fungi. , 2007, Mycological research.

[17]  M. Blackwell The fungi: 1, 2, 3 ... 5.1 million species? , 2011, American journal of botany.

[18]  L. Kiss,et al.  Limits of nuclear ribosomal DNA internal transcribed spacer (ITS) sequences as species barcodes for Fungi , 2012, Proceedings of the National Academy of Sciences.

[19]  Roderic D.M. Page,et al.  BioNames: linking taxonomy, texts, and trees , 2013, PeerJ.

[20]  Lorelei L. Norvell,et al.  Report of the Nomenclature Committee for Fungi 19: Official repositories for fungal names , 2013 .

[21]  John Bissett,et al.  An oligonucleotide barcode for species identification in Trichoderma and Hypocrea. , 2005, Fungal genetics and biology : FG & B.

[22]  N. Baeshen,et al.  Biological Identifications Through DNA Barcodes , 2012 .

[23]  W. John Kress,et al.  Semantic tagging of and semantic enhancements to systematics papers: ZooKeys working examples , 2010, ZooKeys.

[24]  Giovanni Nicolotti,et al.  Ecological Constraints Limit the Fitness of Fungal Hybrids in the Heterobasidion annosum Species Complex , 2007, Applied and Environmental Microbiology.

[25]  Jason E Stajich,et al.  Toward genome-enabled mycology , 2013, Mycologia.

[26]  Andrew W. Claridge,et al.  Comparative taxonomy of desert truffles of the Australian outback and the African Kalahari , 2010, Mycological Progress.

[27]  B. Thiers,et al.  Index Herbariorum: a global directory of public herbaria and associated staff. New York Botanical Garden's Virtual Herbarium. , 2009 .

[28]  A. Cornish-Bowden Nomenclature for incompletely specified bases in nucleic acid sequences: recommendations 1984. , 1985, Nucleic acids research.

[29]  David L. Hawksworth,et al.  Possible house-keeping and other draft proposals to clarify or enhance the naming of fungi within the International Code of Nomenclature for algae, fungi, and plants (ICN) , 2014, IMA fungus.

[30]  P. Kirk,et al.  International Code of Nomenclature for algae, fungi, and plants (Melbourne Code) , 2012 .

[31]  R. Henrik Nilsson,et al.  PlutoF—a Web Based Workbench for Ecological and Taxonomic Research, with an Online Implementation for Fungal ITS Sequences , 2010, Evolutionary Bioinformatics Online.

[32]  Guy Cochrane,et al.  The International Nucleotide Sequence Database Collaboration , 2011, Nucleic Acids Res..

[33]  M. Weiß,et al.  Intragenomic variation of fungal ribosomal genes is higher than previously thought. , 2008, Molecular biology and evolution.

[34]  Portland Press Ltd Nomenclature Committee for the International Union of Biochemistry (NC-IUB). Nomenclature for incompletely specified bases in nucleic acid sequences. Recommendations 1984. , 1985, Molecular biology and evolution.

[35]  Daniel H. Huson,et al.  CREST – Classification Resources for Environmental Sequence Tags , 2012, PloS one.

[36]  Nico Cellinese,et al.  Evolutionary informatics: unifying knowledge about the diversity of life. , 2012, Trends in ecology & evolution.

[37]  Pedro W. Crous,et al.  Cercospora Leaf Spot of Sugar Beet and Related Species , 2010 .

[38]  Vincent Robert,et al.  Identification of Fungal DNA Barcode Targets and PCR Primers Based on Pfam Protein Families and Taxonomic Hierarchy , 2011 .

[39]  Noel H. Holmgren,et al.  Index Herbariorum: A global directory of public herbaria and associated staff , 1998 .

[40]  Ellen Larsson,et al.  Controversy over Hygrophorus cossus settled using ITS sequence data from 200 year-old type material. , 2004, Mycological research.

[41]  Lakshmi M. Akella,et al.  NetiNeti: discovery of scientific names from text using machine learning methods , 2010, BMC Bioinformatics.

[42]  G.H.J. Kema,et al.  Zymoseptoria gen. nov.: a new genus to accommodate Septoria-like species occurring on graminicolous hosts , 2011, Persoonia.

[43]  David L. Hawksworth,et al.  Mycospeak and Biobabble , 2013 .

[44]  Pedro W. Crous,et al.  MycoBank gearing up for new horizons , 2013, IMA fungus.

[45]  S. Stenroos,et al.  Successful DNA sequencing of a 75 year-old herbarium specimen of Aspicilia aschabadensis (J. Steiner) Mereschk. , 2010, The Lichenologist.

[46]  Lawrence M. Page,et al.  GenSeq: An updated nomenclature and ranking for genetic sequences from type and non-type sources , 2013, ZooKeys.

[47]  David S. Hibbett,et al.  Toward Sequence-Based Classification of Fungal Species , 2013 .

[48]  G. Cochrane,et al.  The International Nucleotide Sequence Database Collaboration , 2011, Nucleic Acids Res..

[49]  S. Carroll,et al.  Genome-scale approaches to resolving incongruence in molecular phylogenies , 2003, Nature.

[50]  J. Gauthier,et al.  The genus , 2006, A Gateway to Number Theory.

[51]  D. Hibbett,et al.  Assembling the fungal tree of life: progress, classification, and evolution of subcellular traits. , 2004, American journal of botany.

[52]  Anthony Bretaudeau,et al.  PHYMYCO-DB: A Curated Database for Analyses of Fungal Diversity and Evolution , 2012, PloS one.

[53]  Conrad L. Schoch,et al.  Filling Gaps in Biodiversity Knowledge for Macrofungi: Contributions and Assessment of an Herbarium Collection DNA Barcode Sequencing Project , 2013, PloS one.

[54]  Stephen W. Peterson Aspergillus and Penicillium identification using DNA sequences: barcode or MLST? , 2012, Applied Microbiology and Biotechnology.

[55]  John L. Spouge,et al.  Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi , 2012, Proceedings of the National Academy of Sciences.

[56]  Kenji Matsuura,et al.  Reconstructing the early evolution of Fungi using a six-gene phylogeny , 2006, Nature.

[57]  U. Jondelius,et al.  Phylogenies without roots? A plea for the use of vouchers in molecular phylogenetic studies. , 2008, Molecular phylogenetics and evolution.

[58]  Nicholas P Money,et al.  Against the naming of fungi. , 2013, Fungal biology.

[59]  Kuan-Liang Liu,et al.  Accurate, Rapid Taxonomic Classification of Fungal Large-Subunit rRNA Genes , 2011, Applied and Environmental Microbiology.

[60]  P. Hebert,et al.  bold: The Barcode of Life Data System (http://www.barcodinglife.org) , 2007, Molecular ecology notes.

[61]  Bryn T. M. Dentinger,et al.  Comparing COI and ITS as DNA Barcode Markers for Mushrooms and Allies (Agaricomycotina) , 2011, PloS one.

[62]  Tom Hsiang,et al.  Intergeneric transfer of ribosomal genes between two fungi , 2008, BMC Evolutionary Biology.

[63]  M. Wrzosek,et al.  DNA barcoding in Mucorales: an inventory of biodiversity , 2013, Persoonia.

[64]  P. Crous,et al.  The genus Cladosporium , 2012, Studies in mycology.

[65]  R. Henrik Nilsson,et al.  Progress in molecular and morphological taxon discovery in Fungi and options for formal classification of environmental sequences , 2011 .

[66]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[67]  Pedro W. Crous,et al.  Cercospora Speciation and Host Range , 2010 .

[68]  Pedro W. Crous,et al.  A without-prejudice list of generic names of fungi for protection under the International Code of Nomenclature for algae, fungi, and plants , 2013, IMA fungus.

[69]  T. Bruns,et al.  ITS primers with enhanced specificity for basidiomycetes ‐ application to the identification of mycorrhizae and rusts , 1993, Molecular ecology.

[70]  R. Henrik Nilsson,et al.  Taxonomic Reliability of DNA Sequences in Public Sequence Databases: A Fungal Perspective , 2006, PloS one.

[71]  Bernard De Baets,et al.  StrainInfo introduces electronic passports for microorganisms. , 2014, Systematic and applied microbiology.

[72]  P. Hebert,et al.  bold: The Barcode of Life Data System (http://www.barcodinglife.org) , 2007, Molecular ecology notes.

[73]  Priscila Chaverri,et al.  Species delimitation in fungal endophyte diversity studies and its implications in ecological and biogeographic inferences , 2011, Molecular ecology.

[74]  Pedro W. Crous,et al.  Phylogeny and taxonomy of Cladosporium-like hyphomycetes, including Davidiella gen. nov., the teleomorph of Cladosporium s. str. , 2003, Mycological Progress.

[75]  Patrick M. Gillevet,et al.  Oral Mycobiome Analysis of HIV-Infected Patients: Identification of Pichia as an Antagonist of Opportunistic Fungi , 2014, PLoS pathogens.

[76]  David S. Hibbett,et al.  Fungal systematics: is a new age of enlightenment at hand? , 2013, Nature Reviews Microbiology.

[77]  María P Martín,et al.  The diversity of Terfezia desert truffles: new species and a highly variable species complex with intrasporocarpic nrDNA ITS heterogeneity , 2011, Mycologia.

[78]  Petr Baldrian,et al.  Estimation of fungal biomass in forest litter and soil , 2013 .

[79]  Amy Y. Rossman,et al.  Taiwanascus samuelsii sp. nov., an addition to Niessliaceae from the Western Ghats, Kerala, India , 2013, IMA fungus.

[80]  Tor Carlsen,et al.  Employing 454 amplicon pyrosequencing to reveal intragenomic divergence in the internal transcribed spacer rDNA region in fungi , 2013, Ecology and evolution.

[81]  M. Wrzosek,et al.  Taxonomic position of Mucor hiemalis f. luteus. , 2010 .

[82]  C. Schardl,et al.  Interspecific hybridization in plant‐associated fungi and oomycetes: a review , 2003, Molecular ecology.

[83]  Kerstin Voigt,et al.  Data Partitions, Bayesian Analysis and Phylogeny of the Zygomycetous Fungal Family Mortierellaceae, Inferred from Nuclear Ribosomal DNA Sequences , 2011, PloS one.

[84]  C. Kurtzman,et al.  Phylogenetic relationships among yeasts of the 'Saccharomyces complex' determined from multigene sequence analyses. , 2003, FEMS yeast research.

[85]  Kendra Baumgartner,et al.  Evolutionary consequences of putative intra-and interspecific hybridization in agaric fungi , 2013, Mycologia.

[86]  Erik Kristiansson,et al.  Incorporating molecular data in fungal systematics: a guide for aspiring researchers , 2013, 1302.3244.

[87]  David L. Hawksworth,et al.  The fungal dimension of biodiversity: magnitude, significance, and conservation , 1991 .

[88]  Jack W. McFarland,et al.  A first comprehensive census of fungi in soil reveals both hyperdiversity and fine-scale niche partitioning , 2014 .

[89]  Thomas Hackl,et al.  ITS2 database IV: interactive taxon sampling for internal transcribed spacer 2 based phylogenies. , 2012, Molecular phylogenetics and evolution.

[90]  Austen R. D. Ganley,et al.  Highly efficient concerted evolution in the ribosomal DNA repeats: total rDNA repeat variation revealed by whole-genome shotgun sequence data. , 2007, Genome research.

[91]  R. Cruickshank,et al.  The seven deadly sins of DNA barcoding , 2012, Molecular ecology resources.

[92]  Scott Federhen,et al.  The NCBI Taxonomy database , 2011, Nucleic Acids Res..

[93]  R. Henrik Nilsson,et al.  Tasting Soil Fungal Diversity with Earth Tongues: Phylogenetic Test of SATé Alignments for Environmental ITS Data , 2011, PloS one.

[94]  B. Dujon,et al.  Genome evolution in yeasts , 2004, Nature.

[95]  Linda D. Strausbaugh,et al.  Redefining the Human Oral Mycobiome with Improved Practices in Amplicon-based Taxonomy: Discovery of Malassezia as a Prominent Commensal , 2014, PloS one.

[96]  Yong-Hwan Lee,et al.  Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing , 2010, Nucleic Acids Res..

[97]  T. A. Hall,et al.  BIOEDIT: A USER-FRIENDLY BIOLOGICAL SEQUENCE ALIGNMENT EDITOR AND ANALYSIS PROGRAM FOR WINDOWS 95/98/ NT , 1999 .

[98]  Seung-Beom Hong,et al.  The Amsterdam Declaration on Fungal Nomenclature , 2011, IMA fungus.

[99]  M. C. Aime,et al.  A new lineage in Pucciniomycotina: class Tritirachiomycetes, order Tritirachiales, family Tritirachiaceae , 2011, Mycologia.

[100]  Nils Hallenberg,et al.  Preserving accuracy in GenBank , 2008 .

[101]  John McNeill,et al.  Names of fungal species with the same epithet applied to different morphs: how to treat them , 2013, IMA fungus.

[102]  Graziano Pesole,et al.  Reference databases for taxonomic assignment in metagenomics , 2012, Briefings Bioinform..

[103]  M. Bidartondo,et al.  How to know unknown fungi: the role of a herbarium. , 2009, The New phytologist.

[104]  R. Petersen,et al.  Using heterozygosity to estimate a percentage DNA sequence similarity for environmental species' delimitation across basidiomycete fungi. , 2009, The New phytologist.

[105]  R. Henrik Nilsson,et al.  Improved software detection and extraction of ITS1 and ITS2 from ribosomal ITS sequences of fungi and other eukaryotes for analysis of environmental sequencing data , 2013 .

[106]  P. Hebert,et al.  Barcoding animal life: cytochrome c oxidase subunit 1 divergences among closely related species , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[107]  D J Patterson,et al.  Names are key to the big new biology. , 2010, Trends in ecology & evolution.

[108]  John Wieczorek,et al.  Darwin Core: An Evolving Community-Developed Biodiversity Data Standard , 2012, PloS one.