Selection of primers for optimal taxonomic classification of environmental 16S rRNA gene sequences

Microbial community profiling using 16S rRNA gene sequences requires accurate taxonomy assignments. ‘Universal’ primers target conserved sequences and amplify sequences from many taxa, but they provide variable coverage of different environments, and regions of the rRNA gene differ in taxonomic informativeness—especially when high-throughput short-read sequencing technologies (for example, 454 and Illumina) are used. We introduce a new evaluation procedure that provides an improved measure of expected taxonomic precision when classifying environmental sequence reads from a given primer. Applying this measure to thousands of combinations of primers and read lengths, simulating single-ended and paired-end sequencing, reveals that these choices greatly affect taxonomic informativeness. The most informative sequence region may differ by environment, partly due to variable coverage of different environments in reference databases. Using our Rtax method of classifying paired-end reads, we found that paired-end sequencing provides substantial benefit in some environments including human gut, but not in others. Optimal primer choice for short reads totaling 96 nt provides 82–100% of the confident genus classifications available from longer reads.

[1]  Jae-Chang Cho,et al.  Direct Extraction of DNA from Soil for Amplification of 16S rRNA Gene Sequences by Polymerase Chain Reaction , 1996 .

[2]  S. Mazmanian,et al.  Regulation of surface architecture by symbiotic bacteria mediates host colonization , 2008, Proceedings of the National Academy of Sciences.

[3]  Jo Handelsman,et al.  Miniprimer PCR, a New Lens for Viewing the Microbial World , 2007, Applied and Environmental Microbiology.

[4]  Shinichi Sunagawa,et al.  Bacterial diversity and White Plague Disease-associated community changes in the Caribbean coral Montastraea faveolata , 2009, The ISME Journal.

[5]  B. Roe,et al.  Comparison of Species Richness Estimates Obtained Using Nearly Complete Fragments and Simulated Pyrosequencing-Generated Fragments in 16S rRNA Gene-Based Environmental Surveys , 2009, Applied and Environmental Microbiology.

[6]  J. Eisen,et al.  An Automated Phylogenetic Tree-Based Small Subunit rRNA Taxonomy and Alignment Pipeline (STAP) , 2008, PloS one.

[7]  K. Nelson,et al.  Gene-centric metagenomics of the fiber-adherent bovine rumen microbiome reveals forage specific glycoside hydrolases , 2009, Proceedings of the National Academy of Sciences.

[8]  Jean M. Macklaim,et al.  Microbiome Profiling by Illumina Sequencing of Combinatorial Sequence-Tagged PCR Products , 2010, PLoS ONE.

[9]  Rick L. Stevens,et al.  Meeting Report: The Terabase Metagenomics Workshop and the Vision of an Earth Microbiome Project , 2010, Standards in genomic sciences.

[10]  Eric P. Nawrocki,et al.  An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea , 2011, The ISME Journal.

[11]  Philip Hugenholtz,et al.  A renaissance for the pioneering 16S rRNA gene. , 2008, Current opinion in microbiology.

[12]  Natalia N. Ivanova,et al.  Metagenomic and functional analysis of hindgut microbiota of a wood-feeding higher termite , 2007, Nature.

[13]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[14]  Philip Hugenholtz,et al.  NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes , 2006, Nucleic Acids Res..

[15]  Fei Zou,et al.  BIPES, a cost-effective high-throughput method for assessing microbial diversity , 2011, The ISME Journal.

[16]  Jeffrey L Ram,et al.  Strategy for microbiome analysis using 16S rRNA gene sequence analysis on the Illumina sequencing platform , 2011, Systems biology in reproductive medicine.

[17]  Susan M. Huse,et al.  Microbial diversity in the deep sea and the underexplored “rare biosphere” , 2006, Proceedings of the National Academy of Sciences.

[18]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[19]  Les Dethlefsen,et al.  The Pervasive Effects of an Antibiotic on the Human Gut Microbiota, as Revealed by Deep 16S rRNA Sequencing , 2008, PLoS biology.

[20]  S. Batzoglou,et al.  Bacterial flora-typing with targeted, chip-based Pyrosequencing , 2007, BMC Microbiology.

[21]  Andrea K. Bartram,et al.  Generation of Multimillion-Sequence 16S rRNA Gene Libraries from Complex Microbial Communities by Assembling Paired-End Illumina Reads , 2011, Applied and Environmental Microbiology.

[22]  J. Tiedje,et al.  Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy , 2007, Applied and Environmental Microbiology.

[23]  N. Pace A molecular view of microbial diversity and the biosphere. , 1997, Science.

[24]  Haifeng Lu,et al.  Symbiotic gut microbes modulate human metabolic phenotypes , 2008, Proceedings of the National Academy of Sciences.

[25]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[26]  Marcus J. Claesson,et al.  Comparison of two next-generation sequencing technologies for resolving highly complex microbiota composition using tandem variable 16S rRNA gene regions , 2010, Nucleic acids research.

[27]  Eoin L. Brodie,et al.  Despite strong seasonal responses, soil microbial consortia are more resilient to long-term changes in rainfall than overlying grassland , 2009, The ISME Journal.

[28]  Susan M. Huse,et al.  Exploring Microbial Diversity and Taxonomy Using SSU rRNA Hypervariable Tag Sequencing , 2008, PLoS genetics.

[29]  Eoin L. Brodie,et al.  Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB , 2006, Applied and Environmental Microbiology.

[30]  Yves Van de Peer,et al.  The European database on small subunit ribosomal RNA , 2002, Nucleic Acids Res..

[31]  Martin Täubel,et al.  The occupant as a source of house dust bacteria. , 2009, The Journal of allergy and clinical immunology.

[32]  D. Cowan,et al.  Review and re-analysis of domain-specific 16S primers. , 2003, Journal of microbiological methods.

[33]  J. Jonasson,et al.  Classification, identification and subtyping of bacteria based on pyrosequencing and signature matching of 16S rDNA fragments , 2002, APMIS : acta pathologica, microbiologica, et immunologica Scandinavica.

[34]  Allison K Shaw,et al.  It's all relative: ranking the diversity of aquatic bacterial communities. , 2008, Environmental microbiology.

[35]  S. Acinas,et al.  Fine-scale phylogenetic architecture of a complex bacterial community , 2004, Nature.

[36]  N. Pace,et al.  Analysis of Hydrothermal Vent-Associated Symbionts by Ribosomal RNA Sequences , 1984, Science.

[37]  William A. Walters,et al.  Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample , 2010, Proceedings of the National Academy of Sciences.

[38]  Anthony A. Fodor,et al.  Effects of Experimental Choices and Analysis Noise on Surveys of the “Rare Biosphere” , 2009, Applied and Environmental Microbiology.

[39]  H. Ochman,et al.  Illumina-based analysis of microbial community diversity , 2011, The ISME Journal.

[40]  Susan M. Huse,et al.  Metagenomic study of the oral microbiota by Illumina high-throughput sequencing. , 2009, Journal of microbiological methods.