Comparative metagenomic and rRNA microbial diversity characterization using archaeal and bacterial synthetic communities.

Next-generation sequencing has dramatically changed the landscape of microbial ecology, large-scale and in-depth diversity studies being now widely accessible. However, determining the accuracy of taxonomic and quantitative inferences and comparing results obtained with different approaches are complicated by incongruence of experimental and computational data types and also by lack of knowledge of the true ecological diversity. Here we used highly diverse bacterial and archaeal synthetic communities assembled from pure genomic DNAs to compare inferences from metagenomic and SSU rRNA amplicon sequencing. Both Illumina and 454 metagenomic data outperformed amplicon sequencing in quantifying the community composition, but the outcome was dependent on analysis parameters and platform. New approaches in processing and classifying amplicons can reconstruct the taxonomic composition of the community with high reproducibility within primer sets, but all tested primers sets lead to significant taxon-specific biases. Controlled synthetic communities assembled to broadly mimic the phylogenetic richness in target environments can provide important validation for fine-tuning experimental and computational parameters used to characterize natural communities.

[1]  D. Lane 16S/23S rRNA sequencing , 1991 .

[2]  S. Goodison,et al.  16S ribosomal DNA amplification for phylogenetic study , 1991, Journal of bacteriology.

[3]  R. Amann,et al.  Sequence heterogeneities of genes encoding 16S rRNAs in Paenibacillus polymyxa detected by temperature gradient gel electrophoresis , 1996, Journal of bacteriology.

[4]  S. Giovannoni,et al.  Bias caused by template annealing in the amplification of mixtures of 16S rRNA genes by PCR , 1996, Applied and environmental microbiology.

[5]  L. Forney,et al.  Distribution of bacterioplankton in meromictic Lake Saelenvannet, as determined by denaturing gradient gel electrophoresis of PCR-amplified gene fragments coding for 16S rRNA , 1997, Applied and environmental microbiology.

[6]  Martin F. Polz,et al.  Bias in Template-to-Product Ratios in Multitemplate PCR , 1998, Applied and Environmental Microbiology.

[7]  K. Horikoshi,et al.  Rapid Detection and Quantification of Members of the Archaeal Community by Quantitative PCR Using Fluorogenic Probes , 2000, Applied and Environmental Microbiology.

[8]  E. Delong,et al.  Environmental diversity of bacteria and archaea. , 2001, Systematic biology.

[9]  Kazuya Watanabe,et al.  Design and evaluation of PCR primers to amplify bacterial 16S ribosomal DNA fragments used for community fingerprinting. , 2001, Journal of microbiological methods.

[10]  K. R. Clarke,et al.  Change in marine communities : an approach to statistical analysis and interpretation , 2001 .

[11]  D. Cowan,et al.  Review and re-analysis of domain-specific 16S primers. , 2003, Journal of microbiological methods.

[12]  S. Tringe,et al.  Comparative Metagenomics of Microbial Communities , 2004, Science.

[13]  R. B. Jackson,et al.  Assessment of Soil Microbial Community Structure by Use of Taxon-Specific Quantitative PCR Assays , 2005, Applied and Environmental Microbiology.

[14]  R. Lynn,et al.  Intelligence: Is there a sex difference in IQ scores? , 2006, Nature.

[15]  E. Stackebrandt Taxonomic parameters revisited : tarnished gold standards , 2006 .

[16]  M. Tivey,et al.  A ubiquitous thermoacidophilic archaeon from deep-sea hydrothermal vents , 2006, Nature.

[17]  N. Moran,et al.  Parallel genomic evolution and metabolic interdependence in an ancient symbiosis , 2007, Proceedings of the National Academy of Sciences.

[18]  A. Salamov,et al.  Use of simulated data sets to evaluate the fidelity of metagenomic processing methods , 2007, Nature Methods.

[19]  Alexander F. Auch,et al.  MEGAN analysis of metagenomic data. , 2007, Genome research.

[20]  R. Knight,et al.  Evolution of Mammals and Their Gut Microbes , 2008, Science.

[21]  Philip Hugenholtz,et al.  A renaissance for the pioneering 16S rRNA gene. , 2008, Current opinion in microbiology.

[22]  G. Olsen,et al.  Critical Evaluation of Two Primers Commonly Used for Amplification of Bacterial 16S rRNA Genes , 2008, Applied and Environmental Microbiology.

[23]  Adam Godzik,et al.  Shotgun metaproteomics of the human distal gut microbiota , 2008, The ISME Journal.

[24]  N. Pace Mapping the Tree of Life: Progress and Prospects , 2009, Microbiology and Molecular Biology Reviews.

[25]  W. Holben,et al.  Empirical Testing of 16S rRNA Gene PCR Primer Pairs Reveals Variance in Target Specificity and Efficacy Not Suggested by In Silico Analysis , 2009, Applied and Environmental Microbiology.

[26]  Natalia N. Ivanova,et al.  A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea , 2009, Nature.

[27]  Tracy K. Teal,et al.  Systematic artifacts in metagenomes from complex microbial communities , 2009, The ISME Journal.

[28]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[29]  James R. Cole,et al.  The Ribosomal Database Project: improved alignments and new tools for rRNA analysis , 2008, Nucleic Acids Res..

[30]  J. Bunge,et al.  Polymerase chain reaction primers miss half of rRNA microbial diversity , 2009, The ISME Journal.

[31]  C. Quince,et al.  Accurate determination of microbial diversity from 454 pyrosequencing data , 2009, Nature Methods.

[32]  Russell J. Davenport,et al.  Removing Noise From Pyrosequenced Amplicons , 2011, BMC Bioinformatics.

[33]  Héctor Corrada Bravo,et al.  Intensity normalization improves color calling in SOLiD sequencing , 2010, Nature Methods.

[34]  F. Chen,et al.  Experimental factors affecting PCR-based estimates of microbial species richness and evenness , 2010, The ISME Journal.

[35]  P. Hugenholtz,et al.  Multiple displacement amplification compromises quantitative analysis of metagenomes , 2010, Nature Methods.

[36]  William A. Walters,et al.  Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample , 2010, Proceedings of the National Academy of Sciences.

[37]  R. Knight,et al.  Rapid denoising of pyrosequencing amplicon data: exploiting the rank-abundance distribution , 2010, Nature Methods.

[38]  F. Bushman,et al.  Sampling and pyrosequencing methods for characterizing bacterial communities in the human gut using 16S sequence tags , 2010, BMC Microbiology.

[39]  E. Delong,et al.  Microbial community transcriptomes reveal microbes and metabolic pathways associated with dissolved organic matter turnover in the sea , 2010, Proceedings of the National Academy of Sciences.

[40]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[41]  Susan M. Huse,et al.  Ironing out the wrinkles in the rare biosphere through improved OTU clustering , 2010, Environmental microbiology.

[42]  Ö. Springer Characterization of Archaeal Community in Contaminated and Uncontaminated Surface Stream Sediments , 2010 .

[43]  D. Antonopoulos,et al.  Using the metagenomics RAST server (MG-RAST) for analyzing shotgun metagenomes. , 2010, Cold Spring Harbor protocols.

[44]  Andrew C. Adey,et al.  Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition , 2010, Genome Biology.

[45]  J. Eisen,et al.  Metagenomic Sequencing of an In Vitro-Simulated Microbial Community , 2010, PloS one.

[46]  J. Eisen,et al.  Metagenomic Sequencing of an In Vitro-Simulated Microbial Community , 2010, PloS one.

[47]  J. Prosser Replicate or lie. , 2010, Environmental microbiology.

[48]  V. Kunin,et al.  Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimates. , 2009, Environmental microbiology.

[49]  N. Caruccio Preparation of next-generation sequencing libraries using Nextera™ technology: simultaneous DNA fragmentation and adaptor tagging by in vitro transposition. , 2011, Methods in molecular biology.

[50]  Patrick D. Schloss,et al.  Assessing and Improving Methods Used in Operational Taxonomic Unit-Based Approaches for 16S rRNA Gene Sequence Analysis , 2011, Applied and Environmental Microbiology.

[51]  M. Pop,et al.  Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences , 2011, BMC Genomics.

[52]  Sergey Koren,et al.  Bambus 2: scaffolding metagenomes , 2011, Bioinform..

[53]  S. Tringe,et al.  Metagenomic Discovery of Biomass-Degrading Genes and Genomes from Cow Rumen , 2011, Science.

[54]  R. Knight,et al.  Moving pictures of the human microbiome , 2011, Genome Biology.

[55]  Jizhong Zhou,et al.  Reproducibility and quantitation of amplicon sequencing-based detection , 2011, The ISME Journal.

[56]  W. Inskeep,et al.  Archaea in Yellowstone Lake , 2011, The ISME Journal.

[57]  B. Haas,et al.  Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons. , 2011, Genome research.

[58]  A. Moya,et al.  Evaluating the Fidelity of De Novo Short Read Metagenomic Assembly Using Simulated Data , 2011, PloS one.

[59]  Mihai Pop,et al.  Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences , 2011, Genome Biology.

[60]  Rob Knight,et al.  Examining the global distribution of dominant archaeal populations in soil , 2011, The ISME Journal.

[61]  Patrick D. Schloss,et al.  Reducing the Effects of PCR Amplification and Sequencing Artifacts on 16S rRNA-Based Studies , 2011, PloS one.

[62]  T. Scheffer,et al.  Taxonomic metagenome sequence assignment with structured output models , 2011, Nature Methods.

[63]  Xiaoyu Wang,et al.  A large-scale benchmark study of existing algorithms for taxonomy-independent microbial community analysis , 2012, Briefings Bioinform..

[64]  Peter Williams,et al.  IMG: the integrated microbial genomes database and comparative analysis system , 2011, Nucleic Acids Res..

[65]  M. Podar,et al.  Distinct and complex bacterial profiles in human periodontitis and health revealed by 16S pyrosequencing , 2011, The ISME Journal.

[66]  R. Morris,et al.  Untangling Genomes from Metagenomes: Revealing an Uncultured Class of Marine Euryarchaeota , 2012, Science.

[67]  C. Schadt,et al.  Massively parallel rRNA gene sequencing exacerbates the potential for biased community diversity comparisons due to variable library sizes. , 2012, Environmental microbiology.

[68]  M. W. Taylor,et al.  Marine sponges and their microbial symbionts: love and other relationships. , 2012, Environmental microbiology.

[69]  Brian C. Thomas,et al.  Fermentation, Hydrogen, and Sulfur Metabolism in Multiple Uncultivated Bacterial Phyla , 2012, Science.

[70]  E. Chesler,et al.  Host genetic and environmental effects on mouse intestinal microbiota , 2012, The ISME Journal.

[71]  William A. Walters,et al.  Impact of training sets on classification of high-throughput bacterial 16s rRNA gene surveys , 2011, The ISME Journal.