Species-level classification of the vaginal microbiome

BackgroundThe application of next-generation sequencing to the study of the vaginal microbiome is revealing the spectrum of microbial communities that inhabit the human vagina. High-resolution identification of bacterial taxa, minimally to the species level, is necessary to fully understand the association of the vaginal microbiome with bacterial vaginosis, sexually transmitted infections, pregnancy complications, menopause, and other physiological and infectious conditions. However, most current taxonomic assignment strategies based on metagenomic 16S rDNA sequence analysis provide at best a genus-level resolution. While surveys of 16S rRNA gene sequences are common in microbiome studies, few well-curated, body-site-specific reference databases of 16S rRNA gene sequences are available, and no such resource is available for vaginal microbiome studies.ResultsWe constructed the Vaginal 16S rDNA Reference Database, a comprehensive and non-redundant database of 16S rDNA reference sequences for bacterial taxa likely to be associated with vaginal health, and we developed STIRRUPS, a new method that employs the USEARCH algorithm with a curated reference database for rapid species-level classification of 16S rDNA partial sequences. The method was applied to two datasets of V1-V3 16S rDNA reads: one generated from a mock community containing DNA from six bacterial strains associated with vaginal health, and a second generated from over 1,000 mid-vaginal samples collected as part of the Vaginal Human Microbiome Project at Virginia Commonwealth University. In both datasets, STIRRUPS, used in conjunction with the Vaginal 16S rDNA Reference Database, classified more than 95% of processed reads to a species-level taxon using a 97% global identity threshold for assignment.ConclusionsThis database and method provide accurate species-level classifications of metagenomic 16S rDNA sequence reads that will be useful for analysis and comparison of microbiome profiles from vaginal samples. STIRRUPS can be used to classify 16S rDNA sequence reads from other ecological niches if an appropriate reference database of 16S rDNA sequences is available.

[1]  J. Izard,et al.  The Human Oral Microbiome , 2010, Journal of bacteriology.

[2]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[3]  Jennifer M. Fettweis,et al.  The Vaginal Microbiome: Disease, Genetics and the Environment , 2010, Nature Precedings.

[4]  J. Tiedje,et al.  Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy , 2007, Applied and Environmental Microbiology.

[5]  Sean R. Eddy,et al.  Multiple Alignment Using Hidden Markov Models , 1995, ISMB.

[6]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[7]  J. Marrazzo,et al.  Targeted PCR for Detection of Vaginal Bacteria Associated with Bacterial Vaginosis , 2007, Journal of Clinical Microbiology.

[8]  J. Euzéby List of Bacterial Names with Standing in Nomenclature: a folder available on the Internet. , 1997, International journal of systematic bacteriology.

[9]  M. Vaneechoutte,et al.  Quantitative determination by real-time PCR of four vaginal Lactobacillus species, Gardnerella vaginalis and Atopobium vaginae indicates an inverse relationship between L. gasseri and L. iners , 2007, BMC Microbiology.

[10]  M. Ferris,et al.  Prevalence and Abundance of Uncultivated Megasphaera-Like Bacteria in the Human Vaginal Environment , 2008, Applied and Environmental Microbiology.

[11]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[12]  Patrick D. Schloss,et al.  Assessing and Improving Methods Used in Operational Taxonomic Unit-Based Approaches for 16S rRNA Gene Sequence Analysis , 2011, Applied and Environmental Microbiology.

[13]  M. Vaneechoutte,et al.  Bmc Microbiology , 2022 .

[14]  Eoin L. Brodie,et al.  Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB , 2006, Applied and Environmental Microbiology.

[15]  Rob Knight,et al.  UCHIME improves sensitivity and speed of chimera detection , 2011, Bioinform..

[16]  D. Fredricks Molecular methods to describe the spectrum and dynamics of the vaginal microbiota. , 2011, Anaerobe.

[17]  J. Jonasson,et al.  Identification of randomly selected colonies of lactobacilli from normal vaginal fluid by pyrosequencing of the 16S rDNA variable V1 and V3 regions , 2002, APMIS : acta pathologica, microbiologica, et immunologica Scandinavica.

[18]  Emese Meglécz,et al.  Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing , 2011, BMC Genomics.

[19]  Wen-Han Yu,et al.  The Human Oral Microbiome Database: a web accessible resource for investigating oral microbe taxonomic and genomic information , 2010, Database J. Biol. Databases Curation.

[20]  W. P. Maddison,et al.  Mesquite: a modular system for evolutionary analysis. Version 2.01 (Build j28) , 2007 .

[21]  W. Ludwig,et al.  SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB , 2007, Nucleic acids research.

[22]  A New Era of the Vaginal Microbiome: Advances Using Next‐Generation Sequencing , 2012, Chemistry & biodiversity.

[23]  J. Marrazzo,et al.  Molecular identification of bacteria associated with bacterial vaginosis. , 2005, The New England journal of medicine.

[24]  Alexander F. Auch,et al.  MEGAN analysis of metagenomic data. , 2007, Genome research.

[25]  James R. Cole,et al.  The Ribosomal Database Project: improved alignments and new tools for rRNA analysis , 2008, Nucleic Acids Res..

[26]  D. Maddison,et al.  Mesquite: a modular system for evolutionary analysis. Version 2.6 , 2009 .

[27]  K. Konstantinidis,et al.  The bacterial species definition in the genomic era , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[28]  D. Janies,et al.  CORE: A Phylogenetically-Curated 16S rDNA Database of the Core Oral Microbiome , 2011, PloS one.

[29]  S. Schuster,et al.  Integrative analysis of environmental sequences using MEGAN4. , 2011, Genome research.

[30]  James T Staley,et al.  The bacterial species dilemma and the genomic–phylogenetic species concept , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[31]  Susan M. Huse,et al.  Ironing out the wrinkles in the rare biosphere through improved OTU clustering , 2010, Environmental microbiology.

[32]  Yuzhen Ye,et al.  Identification and quantification of abundant species from pyrosequences of 16S rRNA by consensus alignment , 2010, 2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[33]  Ruth Ann Luna,et al.  Metagenomic pyrosequencing and microbial identification. , 2009, Clinical chemistry.