leBIBIQBPP: a set of databases and a webtool for automatic phylogenetic analysis of prokaryotic sequences

BackgroundEstimating the phylogenetic position of bacterial and archaeal organisms by genetic sequence comparisons is considered as the gold-standard in taxonomy. This is also a way to identify the species of origin of the sequence. The quality of the reference database used in such analyses is crucial: the database must reflect the up-to-date bacterial nomenclature and accurately indicate the species of origin of its sequences.DescriptionleBIBIQBPP is a web tool taking as input a series of nucleotide sequences belonging to one of a set of reference markers (e.g., SSU rRNA, rpoB, groEL2) and automatically retrieving closely related sequences, aligning them, and performing phylogenetic reconstruction using an approximate maximum likelihood approach. The system returns a set of quality parameters and, if possible, a suggested taxonomic assigment for the input sequences. The reference databases are extracted from GenBank and present four degrees of stringency, from the “superstringent” degree (one type strain per species) to the loosely parsed degree (“lax” database). A set of one hundred to more than a thousand sequences may be analyzed at a time. The speed of the process has been optimized through careful hardware selection and database design.ConclusionleBIBIQBPP is a powerful tool helping biologists to position bacterial or archaeal sequence commonly used markers in a phylogeny. It is a diagnostic tool for clinical, industrial and environmental microbiology laboratory, as well as an exploratory tool for more specialized laboratories. Its main advantages, relatively to comparable systems are: i) the use of a broad set of databases covering diverse markers with various degrees of stringency; ii) the use of an approximate Maximum Likelihood approach for phylogenetic reconstruction; iii) a speed compatible with on-line usage; and iv) providing fully documented results to help the user in decision making.

[1]  Kazutaka Katoh,et al.  Multiple alignment of DNA sequences with MAFFT. , 2009, Methods in molecular biology.

[2]  Erko Stackebrandt,et al.  Taxonomic Note: A Place for DNA-DNA Reassociation and 16S rRNA Sequence Analysis in the Present Species Definition in Bacteriology , 1994 .

[3]  Guangbiao Zhou,et al.  Rapid Sanger Sequencing of the 16S rRNA Gene for Identification of Some Common Pathogens , 2014, PloS one.

[4]  Ross A. Overbeek,et al.  The RDP (Ribosomal Database Project) , 1997, Nucleic Acids Res..

[5]  J P Flandrois,et al.  16S rRNA sequencing in routine bacterial identification: a 30-month experiment. , 2006, Journal of microbiological methods.

[6]  S. Corvec,et al.  Septic arthritis due to a Sneathia species most closely related to Sneathia sanguinegens. , 2011, Journal of medical microbiology.

[7]  Rafael Bosch,et al.  Characterization of bacterial consortia from diesel-contaminated Antarctic soils: Towards the design of tailored formulas for bioaugmentation , 2013 .

[8]  Tom Coenye,et al.  Development of a recA Gene-Based Identification Approach for the Entire Burkholderia Genus , 2005, Applied and Environmental Microbiology.

[9]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[10]  Markus Kostrzewa,et al.  Comparison of MALDI-TOF MS with HPLC and nucleic acid sequencing for the identification of Mycobacterium species in cultures using solid medium and broth. , 2014, American journal of clinical pathology.

[11]  S. Abbott,et al.  16S rRNA Gene Sequencing for Bacterial Identification in the Diagnostic Laboratory: Pluses, Perils, and Pitfalls , 2007, Journal of Clinical Microbiology.

[12]  N. Pace,et al.  Interpretive criteria for identification of bacteria and fungi by DNA target sequencing; approved guideline , 2008 .

[13]  E. Stackebrandt Taxonomic parameters revisited : tarnished gold standards , 2006 .

[14]  D. Gevers,et al.  Phylogeny and Identification of Enterococci by atpA Gene Sequence Analysis , 2005, Journal of Clinical Microbiology.

[15]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[16]  B. Demeneix,et al.  MYCOBACTERIUM SZULGAI INFECTION IN A CAPTIVE POPULATION OF AFRICAN CLAWED FROGS (XENOPUS TROPICALIS) , 2006, Journal of zoo and wildlife medicine : official publication of the American Association of Zoo Veterinarians.

[17]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[18]  Gregory E. Jordan,et al.  Assigning strains to bacterial species via the internet , 2009, BMC Biology.

[19]  Jui-Chang Tsai,et al.  Use of groESL as a Target for Identification of Abiotrophia, Granulicatella, and Gemella Species , 2010, Journal of Clinical Microbiology.

[20]  Michael Kemp,et al.  Routine ribosomal PCR and DNA sequencing for detection and identification of bacteria. , 2010, Future microbiology.

[21]  Nam Yong Lee,et al.  Evaluation of the GenBank, EzTaxon, and BIBI Services for Molecular Identification of Clinical Blood Culture Isolates That Were Unidentifiable or Misidentified by Conventional Methods , 2012, Journal of Clinical Microbiology.

[22]  G. Perrière,et al.  BIBI, a Bioinformatics Bacterial Identification Tool , 2003, Journal of Clinical Microbiology.

[23]  Tom Coenye,et al.  Opinion: Re-evaluating prokaryotic species. , 2005, Nature reviews. Microbiology.

[24]  Marie-Christine Brun,et al.  TreeDyn: towards dynamic graphics and annotations for analyses of trees , 2006, BMC Bioinformatics.

[25]  Susan Hopkins,et al.  Detection and identification of bacteria in clinical samples by 16S rRNA gene sequencing: comparison of two different approaches in clinical practice. , 2012, Journal of medical microbiology.

[26]  Alexandra Aubry,et al.  Illustration of the Difficulty of Identifying Streptococcus equi Strains at the Subspecies Level through a Case of Endocarditis in an Immunocompetent Man , 2013, Journal of Clinical Microbiology.

[27]  Ronald N. Jones,et al.  Case Report of Aurantimonas altamirensis Bloodstream Infection , 2008, Journal of Clinical Microbiology.

[28]  O. Gascuel,et al.  SeaView version 4: A multiplatform graphical user interface for sequence alignment and phylogenetic tree building. , 2010, Molecular biology and evolution.

[29]  Didier Raoult,et al.  16S Ribosomal DNA Sequence Analysis of a Large Collection of Environmental and Clinical Unidentifiable Bacterial Isolates , 2000, Journal of Clinical Microbiology.

[30]  Jean-Pierre Flandrois,et al.  Emerging Blockininfectious Blockindiseases Blockin@bullet Blockin Mycobacterium Species Related to M. Leprae and M. Lepromatosis from Cows with Bovine Nodular Thelitis , 2022 .

[31]  Alexander Pertsemlidis,et al.  Having a BLAST with bioinformatics (and avoiding BLASTphemy) , 2001, Genome Biology.

[32]  Thomas Weitzel,et al.  Catheter-associated bloodstream infection caused by Leifsonia aquatica in a haemodialysis patient: a case report. , 2012, Journal of medical microbiology.

[33]  Marcella Attimonelli,et al.  ACNUC - a portable retrieval system for nucleic acid sequence databases: logical and physical designs and usage , 1985, Comput. Appl. Biosci..

[34]  O. Kandler,et al.  Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[35]  João Carneiro,et al.  Identification of species with DNA-based technology: current progress and challenges. , 2008, Recent patents on DNA & gene sequences.

[36]  Rupert De Wachter,et al.  Amplification and sequencing of variable regions in bacterial 23S ribosomal RNA genes with conserved primer sequences , 1993, Current Microbiology.

[37]  J. Tiedje,et al.  Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy , 2007, Applied and Environmental Microbiology.

[38]  P. Trieu-Cuot,et al.  Rapid and Accurate Species-Level Identification of Coagulase-Negative Staphylococci by Using the sodA Gene as a Target , 2001, Journal of Clinical Microbiology.

[39]  Kevin de Queiroz,et al.  Phylogenetic definitions and taxonomic philosophy , 1992 .

[40]  Didier Raoult,et al.  The rpoB gene as a tool for clinical microbiologists. , 2009, Trends in microbiology.

[41]  M. Nei,et al.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. , 2011, Molecular biology and evolution.

[42]  K. Venkateswaran,et al.  Microbial Monitoring of Spacecraft and Associated Environments , 2004, Microbial Ecology.

[43]  M. Christner,et al.  Differentiation of Streptococcus pneumoniae from Nonpneumococcal Streptococci of the Streptococcus mitis Group by Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry , 2012, Journal of Clinical Microbiology.

[44]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[45]  Aidan C. Parte,et al.  LPSN—list of prokaryotic names with standing in nomenclature , 2013, Nucleic Acids Res..

[46]  Hidetoshi Shimodaira,et al.  Multiple Comparisons of Log-Likelihoods with Applications to Phylogenetic Inference , 1999, Molecular Biology and Evolution.

[47]  S. Richter,et al.  Identification of Enterobacteriaceae by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry using the VITEK MS system , 2013, European Journal of Clinical Microbiology & Infectious Diseases.

[48]  Arne Jung,et al.  Bedeutung und Diagnostik ausgewählter bakterieller Erreger des Geflügels , 2012, Tierärztliche Praxis G: Großtiere/Nutztiere.

[49]  P. Mercier,et al.  Extended surveillance for CBPP in a free country: Challenges and solutions regarding the potential caprine reservoir. , 2011, Preventive veterinary medicine.

[50]  Peter Dawyndt,et al.  Advantages of multilocus sequence analysis for taxonomic studies: a case study using 10 housekeeping genes in the genus Ensifer (including former Sinorhizobium). , 2008, International journal of systematic and evolutionary microbiology.

[51]  Seong-Ho Kang,et al.  Comparison of rpoB gene sequencing, 16S rRNA gene sequencing, gyrB multiplex PCR, and the VITEK2 system for identification of Acinetobacter clinical isolates. , 2014, Diagnostic microbiology and infectious disease.

[52]  P. Woo,et al.  Then and now: use of 16S rDNA gene sequencing for bacterial identification and discovery of novel bacteria in clinical microbiology laboratories. , 2008, Clinical microbiology and infection : the official publication of the European Society of Clinical Microbiology and Infectious Diseases.

[53]  D. Valiunas,et al.  Evaluation of the DNA-dependent RNA polymerase β-subunit gene (rpoB) for phytoplasma classification and phylogeny. , 2013, International journal of systematic and evolutionary microbiology.

[54]  O. Gascuel,et al.  Theoretical foundation of the balanced minimum evolution method of phylogenetic inference and its relationship to weighted least-squares tree fitting. , 2003, Molecular biology and evolution.

[55]  Jean-Pierre Flandrois,et al.  Identification of Mycobacterium using the EF-Tu encoding (tuf) gene and the tmRNA encoding (ssrA) gene. , 2007, Journal of medical microbiology.

[56]  Vitali Sintchenko,et al.  Assignment of Reference 5’-end 16S rDNA Sequences and Species-Specific Sequence Polymorphisms Improves Species Identification of Nocardia , 2009, The open microbiology journal.

[57]  E Bille,et al.  Evaluation of the Andromas Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry System for Identification of Aerobically Growing Gram-Positive Bacilli , 2012, Journal of Clinical Microbiology.

[58]  D. Gevers,et al.  Re-evaluating prokaryotic species , 2005, Nature Reviews Microbiology.

[59]  Alexander Mellmann,et al.  MALDI-TOF Mass Spectrometry-Based Microbial Identification , 2013 .

[60]  James R. Cole,et al.  The Ribosomal Database Project (RDP-II): previewing a new autoaligner that allows regular updates and the new prokaryotic taxonomy , 2003, Nucleic Acids Res..

[61]  Patrick Legrand,et al.  Aerococcus urinae and Aerococcus sanguinicola, two frequently misidentified uropathogens , 2010, Scandinavian journal of infectious diseases.

[62]  J. Clarridge,et al.  Impact of 16S rRNA Gene Sequence Analysis for Identification of Bacteria on Clinical Microbiology and Infectious Diseases , 2004, Clinical Microbiology Reviews.

[63]  D F Moore,et al.  Comparison of 16S rRNA sequencing with conventional and commercial phenotypic techniques for identification of enterococci from the marine environment , 2006, Journal of applied microbiology.

[64]  Sun-Hyun Kim,et al.  Differentiation of Mycobacterium species by analysis of the heat-shock protein 65 gene (hsp65). , 2005, International journal of systematic and evolutionary microbiology.

[65]  V. Jung-Schroers,et al.  Detection of Deefgea chitinilytica in freshwater ornamental fish , 2011, Letters in applied microbiology.

[66]  O. Gaillot,et al.  The sodA gene as a target for phylogenetic dissection of the genus Haemophilus and accurate identification of human clinical isolates. , 2006, International journal of medical microbiology : IJMM.

[67]  F. Laurent,et al.  Validation of a partial rpoB gene sequence as a tool for phylogenetic identification of aeromonads isolated from environmental sources. , 2010, Canadian journal of microbiology.

[68]  L. Koski,et al.  The Closest BLAST Hit Is Often Not the Nearest Neighbor , 2001, Journal of Molecular Evolution.

[69]  Jingjing Sun,et al.  Molecular identification of clinical “difficult-to-identify” microbes from sequencing 16S ribosomal DNA and internal transcribed spacer 2 , 2014, Annals of Clinical Microbiology and Antimicrobials.

[70]  Charles W. Stratton,et al.  Advanced Techniques in Diagnostic Microbiology , 2013, Springer US.

[71]  Patrick Ducoroy,et al.  Evaluation of MALDI-TOF mass spectrometry for the identification of medically-important yeasts in the clinical laboratories of Dijon and Lille hospitals. , 2013, Medical mycology.

[72]  W. Lipkin Microbe Hunting , 2010, Microbiology and Molecular Biology Reviews.

[73]  T. W. Fisher,et al.  Isolation, culture, preservation, and identification of entomopathogenic bacteria of the Bacilli , 2012 .

[74]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[75]  Susanna K. P. Lau,et al.  Usefulness of the MicroSeq 500 16S Ribosomal DNA-Based Bacterial Identification System for Identification of Clinically Significant Bacterial Isolates with Ambiguous Biochemical Profiles , 2003, Journal of Clinical Microbiology.

[76]  Mark A. Williams,et al.  Characterization of culturable bacterial endophytes of switchgrass (Panicum virgatum L.) and their capacity to influence plant growth , 2013 .

[77]  P. Bork,et al.  Accurate and universal delineation of prokaryotic species , 2013, Nature Methods.

[78]  Kasthuri Venkateswaran,et al.  Molecular bacterial community analysis of clean rooms where spacecraft are assembled. , 2007, FEMS microbiology ecology.

[79]  Stefan Bertilsson,et al.  Evaluation of 23S rRNA PCR Primers for Use in Phylogenetic Studies of Bacterial Diversity , 2006, Applied and Environmental Microbiology.

[80]  Paramvir S. Dehal,et al.  FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments , 2010, PloS one.

[81]  Guy Perrière,et al.  Bioinformatics in the complete genome sequence era. , 2008, Biochimie.

[82]  L. Lacey,et al.  Manual of techniques in invertebrate pathology , 2012 .

[83]  K. Schleifer,et al.  ARB: a software environment for sequence data. , 2004, Nucleic acids research.

[84]  Patrick D. Schloss,et al.  The Effects of Alignment Quality, Distance Calculation Method, Sequence Filtering, and Region on the Analysis of 16S rRNA Gene-Based Studies , 2010, PLoS Comput. Biol..

[85]  S. Tavaré Some probabilistic and statistical problems in the analysis of DNA sequences , 1986 .

[86]  C. Huttenhower,et al.  PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes , 2013, Nature Communications.

[87]  Manolo Gouy,et al.  Remote access to ACNUC nucleotide and protein sequence databases at PBIL. , 2008, Biochimie.

[88]  Richard Christen,et al.  Identifications of pathogens - a bioinformatic point of view. , 2008, Current opinion in biotechnology.

[89]  J. Chun,et al.  EzTaxon: a web-based tool for the identification of prokaryotes based on 16S ribosomal RNA gene sequences. , 2007, International journal of systematic and evolutionary microbiology.

[90]  W. Whitman,et al.  Report of the ad hoc committee for the re-evaluation of the species definition in bacteriology. , 2002, International journal of systematic and evolutionary microbiology.

[91]  Sean X. Zhang,et al.  Phylogeny and Identification of Nocardia Species on the Basis of Multilocus Sequence Analysis , 2010, Journal of Clinical Microbiology.

[92]  James R. Cole,et al.  The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis , 2004, Nucleic Acids Res..

[93]  Didier Raoult,et al.  What does the future hold for clinical microbiology? , 2004, Nature Reviews Microbiology.

[94]  Jorge Lalucat,et al.  An rpoD-based PCR procedure for the identification of Pseudomonas species and for their detection in environmental samples. , 2009, Molecular and cellular probes.

[95]  B. Berger,et al.  Characterisation of methionine adenosyltransferase from Mycobacterium smegmatis and M. tuberculosis , 2003, BMC Microbiology.

[96]  Alexis Criscuolo,et al.  BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments , 2010, BMC Evolutionary Biology.

[97]  Guido V. Bloemberg,et al.  Recognition of Potentially Novel Human Disease-Associated Pathogens by Implementation of Systematic 16S rRNA Gene Sequencing in the Diagnostic Laboratory , 2010, Journal of Clinical Microbiology.

[98]  A. Halpern,et al.  Weighted neighbor joining: a likelihood-based approach to distance-based phylogeny reconstruction. , 2000, Molecular biology and evolution.

[99]  Jean-Michel Claverie,et al.  Phylogeny.fr: robust phylogenetic analysis for the non-specialist , 2008, Nucleic Acids Res..

[100]  R. Christen,et al.  Comparison of phenotypical and molecular methods for the identification of bacterial strains isolated from a deep subsurface environment , 1995, Applied and environmental microbiology.

[101]  S. Goodison,et al.  16S ribosomal DNA amplification for phylogenetic study , 1991, Journal of bacteriology.