NBC: the Naïve Bayes Classification tool webserver for taxonomic classification of metagenomic reads

Motivation: Datasets from high-throughput sequencing technologies have yielded a vast amount of data about organisms in environmental samples. Yet, it is still a challenge to assess the exact organism content in these samples because the task of taxonomic classification is too computationally complex to annotate all reads in a dataset. An easy-to-use webserver is needed to process these reads. While many methods exist, only a few are publicly available on webservers, and out of those, most do not annotate all reads. Results: We introduce a webserver that implements the naïve Bayes classifier (NBC) to classify all metagenomic reads to their best taxonomic match. Results indicate that NBC can assign next-generation sequencing reads to their taxonomic classification and can find significant populations of genera that other classifiers may miss. Availability: Publicly available at: http://nbc.ece.drexel.edu. Contact: gailr@ece.drexel.edu

[1]  Victor Markowitz,et al.  Complete genome sequence of Streptosporangium roseum type strain (NI 9100T) , 2010, Standards in genomic sciences.

[2]  N. Kyrpides,et al.  Complete genome sequence of Xylanimonas cellulosilytica type strain (XIL07T) , 2010, Standards in genomic sciences.

[3]  Gail L. Rosen,et al.  Signal Processing for Metagenomics: Extracting Information from the Soup , 2009, Current genomics.

[4]  Alexander F. Auch,et al.  MEGAN analysis of metagenomic data. , 2007, Genome research.

[5]  Naryttza N. Diaz,et al.  The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes , 2005, Nucleic acids research.

[6]  Sergei L. Kosakovsky Pond,et al.  Windshield splatter analysis with the Galaxy metagenomic pipeline. , 2009, Genome research.

[7]  Gail L. Rosen,et al.  Metagenome Fragment Classification Using N-Mer Frequency Profiles , 2008, Adv. Bioinformatics.

[8]  Naryttza N. Diaz,et al.  The metagenome of a biogas-producing microbial community of a production-scale biogas plant fermenter analysed by the 454-pyrosequencing technology. , 2008, Journal of biotechnology.

[9]  Andreas Wilke,et al.  phylogenetic and functional analysis of metagenomes , 2022 .

[10]  Xavier Lefebvre,et al.  Monitoring of bacterial communities during low temperature thermal treatment of activated sludge combining DNA phylochip and respirometry techniques. , 2010, Water research.

[11]  S. Kravitz,et al.  CAMERA: A Community Resource for Metagenomics , 2007, PLoS biology.

[12]  Björn Vinnerås,et al.  Identification of the microbiological community in biogas systems and evaluation of microbial risks from gas usage. , 2006, The Science of the total environment.

[13]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[14]  Wolfgang Gerlach,et al.  WebCARMA: a web application for the functional and taxonomic classification of unassembled metagenomic reads , 2009, BMC Bioinformatics.

[15]  I. Rigoutsos,et al.  Accurate phylogenetic classification of variable-length DNA fragments , 2007, Nature Methods.

[16]  Folker Meyer,et al.  37. The Metagenomics RAST Server: A Public Resource for the Automatic Phylogenetic and Functional Analysis of Metagenomes , 2011 .