Status of the Archaeal and Bacterial Census: an Update

ABSTRACT A census is typically carried out for people across a range of geographical levels; however, microbial ecologists have implemented a molecular census of bacteria and archaea by sequencing their 16S rRNA genes. We assessed how well the census of full-length 16S rRNA gene sequences is proceeding in the context of recent advances in high-throughput sequencing technologies because full-length sequences are typically used as references for classification of the short sequences generated by newer technologies. Among the 1,411,234 and 53,546 full-length bacterial and archaeal sequences, 94.5% and 95.1% of the bacterial and archaeal sequences, respectively, belonged to operational taxonomic units (OTUs) that have been observed more than once. Although these metrics suggest that the census is approaching completion, 29.2% of the bacterial and 38.5% of the archaeal OTUs have been observed more than once. Thus, there is still considerable diversity to be explored. Unfortunately, the rate of new full-length sequences has been declining, and new sequences are primarily being deposited by a small number of studies. Furthermore, sequences from soil and aquatic environments, which are known to be rich in bacterial diversity, represent only 7.8 and 16.5% of the census, while sequences associated with host-associated environments represent 55.0% of the census. Continued use of traditional approaches and new technologies such as single-cell genomics and short-read assembly are likely to improve our ability to sample rare OTUs if it is possible to overcome this sampling bias. The success of ongoing efforts to use short-read sequencing to characterize archaeal and bacterial communities requires that researchers strive to expand the depth and breadth of this census. IMPORTANCE The biodiversity contained within the bacterial and archaeal domains dwarfs that of the eukaryotes, and the services these organisms provide to the biosphere are critical. Surprisingly, we have done a relatively poor job of formally tracking the quality of the biodiversity as represented in full-length 16S rRNA genes. By understanding how this census is proceeding, it is possible to suggest the best allocation of resources for advancing the census. We found that the ongoing effort has done an excellent job of sampling the most abundant organisms but struggles to sample the rarer organisms. Through the use of new sequencing technologies, we should be able to obtain full-length sequences from these rare organisms. Furthermore, we suggest that by allocating more resources to sampling environments known to have the greatest biodiversity, we will be able to make significant advances in our characterization of archaeal and bacterial diversity. The biodiversity contained within the bacterial and archaeal domains dwarfs that of the eukaryotes, and the services these organisms provide to the biosphere are critical. Surprisingly, we have done a relatively poor job of formally tracking the quality of the biodiversity as represented in full-length 16S rRNA genes. By understanding how this census is proceeding, it is possible to suggest the best allocation of resources for advancing the census. We found that the ongoing effort has done an excellent job of sampling the most abundant organisms but struggles to sample the rarer organisms. Through the use of new sequencing technologies, we should be able to obtain full-length sequences from these rare organisms. Furthermore, we suggest that by allocating more resources to sampling environments known to have the greatest biodiversity, we will be able to make significant advances in our characterization of archaeal and bacterial diversity.

[1]  Ajay S. Gulati,et al.  Inflammatory Bowel Diseases Phenotype, C. difficile and NOD2 Genotype Are Associated with Shifts in Human Ileum Associated Microbial Composition , 2012, PloS one.

[2]  R. Knight,et al.  Phylogenetic stratigraphy in the Guerrero Negro hypersaline microbial mat , 2012, The ISME Journal.

[3]  Katherine H. Huang,et al.  Structure, Function and Diversity of the Healthy Human Microbiome , 2012, Nature.

[4]  J. L. Green,et al.  A unified initiative to harness Earth's microbiomes , 2015, Science.

[5]  Brian C. Thomas,et al.  Biostimulation induces syntrophic interactions that impact C, S and N cycling in a sediment microbial community , 2012, The ISME Journal.

[6]  L. Artigas,et al.  A global census of marine microbes , 2010 .

[7]  Patrick D. Schloss,et al.  Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system , 2016, PeerJ.

[8]  B. Roe,et al.  Comparison of Species Richness Estimates Obtained Using Nearly Complete Fragments and Simulated Pyrosequencing-Generated Fragments in 16S rRNA Gene-Based Environmental Surveys , 2009, Applied and Environmental Microbiology.

[9]  Daniel S. Jones,et al.  Metagenomic insights into S(0) precipitation in a terrestrial subsurface lithoautotrophic ecosystem , 2014, Front. Microbiol..

[10]  Patrick D. Schloss,et al.  The Effects of Alignment Quality, Distance Calculation Method, Sequence Filtering, and Region on the Analysis of 16S rRNA Gene-Based Studies , 2010, PLoS Comput. Biol..

[11]  Brian C. Thomas,et al.  Fermentation, Hydrogen, and Sulfur Metabolism in Multiple Uncultivated Bacterial Phyla , 2012, Science.

[12]  Eoin L. Brodie,et al.  Isolation of a significant fraction of non-phototroph diversity from a desert Biological Soil Crust , 2015, Front. Microbiol..

[13]  Evan S Snitkin,et al.  Longitudinal shift in diabetic wound microbiota correlates with prolonged skin defense response , 2010, Proceedings of the National Academy of Sciences.

[14]  K. Williams,et al.  Speciation and Reactivity of Uranium Products Formed during in Situ Bioremediation in a Shallow Alluvial Aquifer , 2014, Environmental science & technology.

[15]  H. Noller,et al.  Complete nucleotide sequence of a 16S ribosomal RNA gene from Escherichia coli. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Tanja Woyke,et al.  Metagenomics uncovers gaps in amplicon-based detection of microbial diversity , 2016, Nature Microbiology.

[17]  W. Ludwig,et al.  SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB , 2007, Nucleic acids research.

[18]  Christopher S. Miller,et al.  Fluctuations in Species-Level Protein Expression Occur during Element and Nutrient Cycling in the Subsurface , 2013, PloS one.

[19]  George M. Garrity,et al.  Then and now: a systematic review of the systematics of prokaryotes in the last 80 years , 2013, Antonie van Leeuwenhoek.

[20]  Sukant Khurana,et al.  Progress in the development of gelling agents for improved culturability of microorganisms , 2015, Front. Microbiol..

[21]  J. VanderGheynst,et al.  Glycoside Hydrolase Activities of Thermophilic Bacterial Consortia Adapted to Switchgrass , 2011, Applied and Environmental Microbiology.

[22]  Brian C. Thomas,et al.  EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data , 2011, Genome Biology.

[23]  J. Handelsman,et al.  Status of the Microbial Census , 2004, Microbiology and Molecular Biology Reviews.

[24]  Brian C. Thomas,et al.  Microbes in the neonatal intensive care unit resemble those found in the gut of premature infants , 2014, Microbiome.

[25]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[26]  J. C. Thrash,et al.  Artificial Seawater Media Facilitate Cultivating Members of the Microbial Majority from the Gulf of Mexico , 2016, mSphere.

[27]  Brian C. Thomas,et al.  Unusual biology across a group comprising more than 15% of domain Bacteria , 2015, Nature.

[28]  Julia Oh,et al.  Temporal shifts in the skin microbiome associated with disease flares and treatment in children with atopic dermatitis , 2012, Genome research.

[29]  Natalia N. Ivanova,et al.  Insights into the phylogeny and coding potential of microbial dark matter , 2013, Nature.

[30]  Sarah L. Westcott,et al.  De novo clustering methods outperform reference-based methods for assigning 16S rRNA gene sequences to operational taxonomic units , 2015, PeerJ.

[31]  K. Konstantinidis,et al.  The bacterial species definition in the genomic era , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[32]  Kelly C. Wrighton,et al.  Short-Read Assembly of Full-Length 16S Amplicons Reveals Bacterial Diversity in Subsurface Sediments , 2013, PloS one.

[33]  D. Dykhuizen Santa Rosalia revisited: Why are there so many species of bacteria? , 2004, Antonie van Leeuwenhoek.

[34]  W. Whitman,et al.  Prokaryotes: the unseen majority. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[35]  A. Bélanger,et al.  Use of Ichip for High-Throughput In Situ Cultivation of “Uncultivable” Microbial Species , 2010, Applied and Environmental Microbiology.

[36]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[37]  I. Good THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[38]  Susan M. Huse,et al.  Microbial diversity in the deep sea and the underexplored “rare biosphere” , 2006, Proceedings of the National Academy of Sciences.

[39]  K. Williams,et al.  Disturbed subsurface microbial communities follow equivalent trajectories despite different structural starting points. , 2015, Environmental microbiology.

[40]  J. Lennon,et al.  Scaling laws predict global microbial diversity , 2016, Proceedings of the National Academy of Sciences.

[41]  Thomas P. Curtis,et al.  Estimating prokaryotic diversity and its limits , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Rob Knight,et al.  The Earth Microbiome project: successes and aspirations , 2014, BMC Biology.

[43]  I-Min A. Chen,et al.  IMG 4 version of the integrated microbial genomes comparative analysis system , 2013, Nucleic Acids Res..

[44]  J. Fuhrman,et al.  Every base matters: assessing small subunit rRNA primers for marine microbiomes with mock communities, time series and global field samples. , 2016, Environmental microbiology.

[45]  Marti J. Anderson,et al.  Species abundance distributions: moving beyond single prediction theories to integration within an ecological framework. , 2007, Ecology letters.

[46]  C. Leslin,et al.  Microbial Scout Hypothesis, Stochastic Exit from Dormancy, and the Nature of Slow Growers , 2012, Applied and Environmental Microbiology.

[47]  C. Deming,et al.  Topographical and Temporal Diversity of the Human Skin Microbiome , 2009, Science.