StaphyloBase: a specialized genomic resource for the staphylococcal research community

Abstract With the advent of high-throughput sequencing technologies, many staphylococcal genomes have been sequenced. Comparative analysis of these strains will provide better understanding of their biology, phylogeny, virulence and taxonomy, which may contribute to better management of diseases caused by staphylococcal pathogens. We developed StaphyloBase with the goal of having a one-stop genomic resource platform for the scientific community to access, retrieve, download, browse, search, visualize and analyse the staphylococcal genomic data and annotations. We anticipate this resource platform will facilitate the analysis of staphylococcal genomic data, particularly in comparative analyses. StaphyloBase currently has a collection of 754 032 protein-coding sequences (CDSs), 19 258 rRNAs and 15 965 tRNAs from 292 genomes of different staphylococcal species. Information about these features is also included, such as putative functions, subcellular localizations and gene/protein sequences. Our web implementation supports diverse query types and the exploration of CDS- and RNA-type information in detail using an AJAX-based real-time search system. JBrowse has also been incorporated to allow rapid and seamless browsing of staphylococcal genomes. The Pairwise Genome Comparison tool is designed for comparative genomic analysis, for example, to reveal the relationships between two user-defined staphylococcal genomes. A newly designed Pathogenomics Profiling Tool (PathoProT) is also included in this platform to facilitate comparative pathogenomics analysis of staphylococcal strains. In conclusion, StaphyloBase offers access to a range of staphylococcal genomic resources as well as analysis tools for comparative analyses. Database URL: http://staphylococcus.um.edu.my/

[1]  L. Stein,et al.  JBrowse: a next-generation genome browser. , 2009, Genome research.

[2]  Martin Ester,et al.  PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes , 2010, Bioinform..

[3]  J. Costerton,et al.  Biofilms: Survival Mechanisms of Clinically Relevant Microorganisms , 2002, Clinical Microbiology Reviews.

[4]  M F Lawrence,et al.  Impedance-based detection of DNA sequences using a silicon transducer with PNA as the probe layer. , 2004, Nucleic acids research.

[5]  O. Schneewind,et al.  Genome Sequence of Staphylococcus aureus Strain Newman and Comparative Analysis of Staphylococcal Genomes: Polymorphism and Evolution of Two Major Pathogenicity Islands , 2007, Journal of bacteriology.

[6]  Samuel V. Angiuoli,et al.  Insights on Evolution of Virulence and Resistance from the Complete Genome Analysis of an Early Methicillin-Resistant Staphylococcus aureus Strain and a Biofilm-Producing Methicillin-Resistant Staphylococcus epidermidis Strain , 2005, Journal of bacteriology.

[7]  Steven J. M. Jones,et al.  Circos: an information aesthetic for comparative genomics. , 2009, Genome research.

[8]  B. Barrell,et al.  Complete genomes of two clinical Staphylococcus aureus strains: evidence for the rapid evolution of virulence and drug resistance. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[9]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[10]  Michael G. Kemp,et al.  The histone deacetylase inhibitor trichostatin A alters the pattern of DNA replication origin activity in human cells , 2005, Nucleic acids research.

[11]  M. Stanhope,et al.  Comparative genomic analysis of the genus Staphylococcus including Staphylococcus aureus and its newly described sister species Staphylococcus simiae , 2012, BMC Genomics.

[12]  Haibao Tang,et al.  Finding and Comparing Syntenic Regions among Arabidopsis and the Outgroups Papaya, Poplar, and Grape: CoGe with Rosids1[W] , 2008, Plant Physiology.

[13]  Jian Yang,et al.  VFDB 2012 update: toward the genetic diversity and molecular evolution of bacterial virulence factors , 2011, Nucleic Acids Res..

[14]  J. Vederas,et al.  Response of Methicillin-Resistant Staphylococcus aureus to Amicoumacin A , 2012, PloS one.

[15]  Thomas L. Madden,et al.  BLAST: at the core of a powerful and diverse set of sequence analysis tools , 2004, Nucleic Acids Res..

[16]  A. van Belkum,et al.  Nasal carriage of Staphylococcus aureus: epidemiology, underlying mechanisms, and associated risks , 1997, Clinical microbiology reviews.

[17]  Jun Yu,et al.  VFDB: a reference database for bacterial virulence factors , 2004, Nucleic Acids Res..

[18]  Fangfang Xia,et al.  SEED Servers: High-Performance Access to the SEED Genomes, Annotations, and Metabolic Models , 2012, PloS one.

[19]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[20]  Jaideep P. Sundaram,et al.  Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial "pan-genome". , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[21]  A. Ankai,et al.  Whole-Genome Sequencing of Staphylococcus haemolyticus Uncovers the Extreme Plasticity of Its Genome and the Evolution of Human-Colonizing Staphylococcal Species , 2005, Journal of bacteriology.

[22]  S. Lewis,et al.  The generic genome browser: a building block for a model organism system database. , 2002, Genome research.

[23]  I. M. Gould VRSA-doomsday superbug or damp squib? , 2010, The Lancet. Infectious diseases.

[24]  Peter Williams,et al.  IMG: the integrated microbial genomes database and comparative analysis system , 2011, Nucleic Acids Res..

[25]  Jun Yu,et al.  VFDB 2008 release: an enhanced web-based resource for comparative pathogenomics , 2007, Nucleic Acids Res..

[26]  Rick L. Stevens,et al.  The RAST Server: Rapid Annotations using Subsystems Technology , 2008, BMC Genomics.

[27]  Nikos Darzentas,et al.  Circoletto: visualizing sequence similarity with Circos , 2010, Bioinform..