ProteoStorm: An Ultrafast Metaproteomics Database Search Framework.

Shotgun metaproteomics has the potential to reveal the functional landscape of microbial communities but lacks appropriate methods for complex samples with unknown compositions. In the absence of prior taxonomic information, tandem mass spectra would be searched against large pan-microbial databases, which requires heavy computational workload and reduces sensitivity. We present ProteoStorm, an efficient database search framework for large-scale metaproteomics studies, which identifies high-confidence peptide-spectrum matches (PSMs) while achieving a two-to-three orders-of-magnitude speedup over popular tools. A reanalysis of a urinary tract infection (UTI) dataset of 110 individuals revealed a complex pattern of polymicrobial expression, including sub-types of UTIs, cases of bacterial vaginosis, and evidence of no underlying disease. Importantly, compared to the initial UTI study that restricted the search database to a manually curated list of 20 genera, ProteoStorm identified additional genera that were previously unreported, including a case of infection with the rare pathogen Propionimicrobium.

[1]  K. Nelson,et al.  Diagnosing inflammation and infection in the urinary system via proteomics , 2015, Journal of Translational Medicine.

[2]  Natalie I. Tasman,et al.  A Cross-platform Toolkit for Mass Spectrometry and Proteomics , 2012, Nature Biotechnology.

[3]  Y. Homma,et al.  Propionimicrobium lymphophilum and Actinotignum schaalii bacteraemia: a case report , 2017, New microbes and new infections.

[4]  James Butcher,et al.  MetaPro-IQ: a universal metaproteomic approach to studying human and mouse gut microbiota , 2016, Microbiome.

[5]  K. R. Clarke,et al.  Statistical Design And Analysis For A Biological Effects Study , 1988 .

[6]  E. Domann,et al.  Hidden pathogens uncovered: metagenomic analysis of urinary tract infections , 2008, Andrologia.

[7]  R. Antonelli Incalzi,et al.  The burden of comorbidity is associated with symptomatic polymicrobial urinary tract infection among institutionalized elderly , 2015, Aging Clinical and Experimental Research.

[8]  Nitin S. Baliga,et al.  Community-integrated omics links dominance of a microbial generalist to fine-tuned resource usage , 2014, Nature Communications.

[9]  Cathy H. Wu,et al.  A fast Peptide Match service for UniProt Knowledgebase , 2013, Bioinform..

[10]  Ruedi Aebersold,et al.  Identification of cross-linked peptides from large sequence databases , 2008, Nature Methods.

[11]  Pavel A. Pevzner,et al.  Universal database search tool for proteomics , 2014, Nature Communications.

[12]  J. Eng,et al.  Comet: An open‐source MS/MS sequence database search tool , 2013, Proteomics.

[13]  R. Fichorova,et al.  The Human Microbiome during Bacterial Vaginosis , 2016, Clinical Microbiology Reviews.

[14]  Yong J. Kil,et al.  Comment on "Unbiased statistical analysis for multi-stage proteomic search strategies". , 2011, Journal of proteome research.

[15]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[16]  Haixu Tang,et al.  A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics , 2016, PLoS Comput. Biol..

[17]  J. Banfield,et al.  Genome-resolved metaproteomic characterization of preterm infant gut microbiota development reveals species-specific metabolic shifts and variabilities during early life , 2017, Microbiome.

[18]  Haixu Tang,et al.  A Meta-proteogenomic Approach to Peptide Identification Incorporating Assembly Uncertainty and Genomic Variation* , 2019, Molecular & Cellular Proteomics.

[19]  Meghan C. Burke,et al.  The Hybrid Search: A Mass Spectral Library Search Method for Discovery of Modifications in Proteomics. , 2017, Journal of proteome research.

[20]  Alexey I Nesvizhskii,et al.  MSFragger: ultrafast and comprehensive peptide identification in shotgun proteomics , 2017, Nature Methods.

[21]  A. Ronald The etiology of urinary tract infection: traditional and emerging pathogens. , 2002, The American journal of medicine.

[22]  Joel A. Kooren,et al.  A two‐step database search method improves sensitivity in peptide sequence matches for metaproteomics and proteogenomics studies , 2013, Proteomics.

[23]  Stephen E. Stein,et al.  Chemical substructure identification by mass spectral library searching , 1995, Journal of the American Society for Mass Spectrometry.

[24]  K. Nelson,et al.  Comprehensive Metaproteomic Analyses of Urine in the Presence and Absence of Neutrophil-Associated Inflammation in the Urinary Tract , 2017, Theranostics.

[25]  Andrew I. Su,et al.  A comprehensive and scalable database search system for metaproteomics , 2016, BMC Genomics.

[26]  Nuno Bandeira,et al.  False discovery rates in spectral identification , 2012, BMC Bioinformatics.

[27]  Jacques Ravel,et al.  Vaginal microbiome: rethinking health and disease. , 2012, Annual review of microbiology.