Where is the unseen fungal diversity hidden? A study of Mortierella reveals a large contribution of reference collections to the identification of fungal environmental sequences.

• Estimation of the proportion of undescribed fungal taxa is an issue that has remained unresolved for many decades. Several very different estimates have been published, and the relative contributions of traditional taxonomic and next-generation sequencing (NGS) techniques to species discovery have also been called into question recently. • Here, we addressed the question of what proportion of hitherto unidentifiable molecular operational taxonomic units (MOTUs) have already been described but not sequenced, and how many of them represent truly undescribed lineages. We accomplished this by modeling the effects of increasing type strain sequencing effort on the number of identifiable MOTUs of the widespread soil fungus Mortierella. • We found a nearly linear relationship between the number of type strains sequenced and the number of identifiable MOTUs. Using this relationship, we made predictions about the total number of Mortierella species and found that it was very close to the number of described species in Mortierella. • These results suggest that the unusually high number of unidentifiable MOTUs in environmental sequencing projects can be, at least in some fungal groups, ascribed to a lag in type strain and specimen sequencing rather than to a high number of undescribed species.

[1]  Thomas J. White,et al.  PCR protocols: a guide to methods and applications. , 1990 .

[2]  Paul M Kirk,et al.  Fungal ecology catches fire. , 2009, The New phytologist.

[3]  Michael P. Cummings,et al.  PAUP* [Phylogenetic Analysis Using Parsimony (and Other Methods)] , 2004 .

[4]  R. Henrik Nilsson,et al.  Intraspecific ITS Variability in the Kingdom Fungi as Expressed in the International Sequence Databases and Its Implications for Molecular Species Identification , 2008, Evolutionary bioinformatics online.

[5]  E. Boa,et al.  Ainsworth and Bisby's Dictionary of the Fungi , 1998 .

[6]  R. Henrik Nilsson,et al.  Taxonomic Reliability of DNA Sequences in Public Sequence Databases: A Fungal Perspective , 2006, PloS one.

[7]  L. Tedersoo,et al.  454 Pyrosequencing and Sanger sequencing of tropical mycorrhizal fungi provide similar results but reveal substantial methodological biases. , 2010, The New phytologist.

[8]  G. Mueller,et al.  Fungal biodiversity: what do we know? What can we predict? , 2007, Biodiversity and Conservation.

[9]  R. Henrik Nilsson,et al.  Progress in molecular and morphological taxon discovery in Fungi and options for formal classification of environmental sequences , 2011 .

[10]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[11]  E. Larsson,et al.  Inocybe spuria, a new species in section Rimosae from boreal coniferous forests , 2009 .

[12]  T. White Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics , 1990 .

[13]  Andy F. S. Taylor,et al.  The UNITE database for molecular identification of fungi--recent updates and future perspectives. , 2010, The New phytologist.

[14]  Robin Sen,et al.  UNITE: a database providing web-based methods for the molecular identification of ectomycorrhizal fungi. , 2005, The New phytologist.

[15]  Rytas Vilgalys,et al.  Fungal Community Analysis by Large-Scale Sequencing of Environmental Samples , 2005, Applied and Environmental Microbiology.

[16]  Nils Hallenberg,et al.  Preserving accuracy in GenBank , 2008 .

[17]  Dennis R. Livesay,et al.  Probalign: multiple sequence alignment using partition function posterior probabilities , 2006, Bioinform..

[18]  F. Martin,et al.  454 Pyrosequencing analyses of forest soils reveal an unexpectedly high fungal diversity. , 2009, The New phytologist.

[19]  D. Hawksworth The magnitude of fungal diversity: the 1.5 million species estimate revisited * * Paper presented at , 2001 .

[20]  M. Zobel,et al.  Large-scale parallel 454 sequencing reveals host ecological group specificity of arbuscular mycorrhizal fungi in a boreonemoral forest. , 2009, The New phytologist.

[21]  B. Lindahl,et al.  Production of ectomycorrhizal mycelium peaks during canopy closure in Norway spruce forests. , 2010, The New phytologist.

[22]  Kathryn F. Beal,et al.  The Staden package, 1998. , 2000, Methods in molecular biology.

[23]  Patrick M. Gillevet,et al.  Characterization of the Oral Fungal Microbiome (Mycobiome) in Healthy Individuals , 2010, PLoS pathogens.

[24]  K. Jones,et al.  Massively parallel 454 sequencing indicates hyperdiverse fungal communities in temperate Quercus macrocarpa phyllosphere. , 2009, The New phytologist.

[25]  David L. Hawksworth,et al.  Ainsworth and Bisby's Dictionary of the Fungi, 8th edn. , 1996 .

[26]  R. W. Embree,et al.  Mucorales. Eine Beschreibung aller Gattungen und Arten dieser Pilzgruppe , 1971 .

[27]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[28]  J. Blair,et al.  Vertical distribution of fungal communities in tallgrass prairie soil , 2010, Mycologia.

[29]  M. Bidartondo,et al.  How to know unknown fungi: the role of a herbarium. , 2009, The New phytologist.

[30]  Erik Kristiansson,et al.  An outlook on the fungal internal transcribed spacer sequences in GenBank and the introduction of a web-based tool for the exploration of fungal diversity. , 2009, The New phytologist.

[31]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[32]  A. Orgiazzi,et al.  Disclosing arbuscular mycorrhizal fungal biodiversity in soil through a land-use gradient using a pyrosequencing approach. , 2009, Environmental microbiology.

[33]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[34]  G. Kovács,et al.  Glomus perpusillum, a new arbuscular mycorrhizal fungus , 2009, Mycologia.

[35]  R. Knight,et al.  Soil bacterial and fungal communities across a pH gradient in an arable soil , 2010, The ISME Journal.

[36]  Robert Samson,et al.  Indoor fungal composition is geographically patterned and more diverse in temperate zones than in the tropics , 2010, Proceedings of the National Academy of Sciences.

[37]  M. P. Cummings,et al.  PAUP* Phylogenetic analysis using parsimony (*and other methods) Version 4 , 2000 .

[38]  W. Gams A key to the species of Mortierella , 1977 .

[39]  Alexandros Stamatakis,et al.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models , 2006, Bioinform..