Metaproteomic analysis using the Galaxy framework

Metaproteomics characterizes proteins expressed by microorganism communities (microbiome) present in environmental samples or a host organism (e.g. human), revealing insights into the molecular functions conferred by these communities. Compared to conventional proteomics, metaproteomics presents unique data analysis challenges, including the use of large protein databases derived from hundreds or thousands of organisms, as well as numerous processing steps to ensure high data quality. These challenges limit the use of metaproteomics for many researchers. In response, we have developed an accessible and flexible metaproteomics workflow within the Galaxy bioinformatics framework. Via analysis of human oral tissue exudate samples, we have established a modular Galaxy‐based workflow that automates a reduction method for searching large sequence databases, enabling comprehensive identification of host proteins (human) as well as “meta‐proteins” from the nonhost organisms. Downstream, automated processing steps enable basic local alignment search tool analysis and evaluation/visualization of peptide sequence match quality, maximizing confidence in results. Outputted results are compatible with tools for taxonomic and functional characterization (e.g. Unipept, MEGAN5). Galaxy also allows for the sharing of complete workflows with others, promoting reproducibility and also providing a template for further modification and enhancement. Our results provide a blueprint for establishing Galaxy as a solution for metaproteomic data analysis. All MS data have been deposited in the ProteomeXchange with identifier PXD001655 (http://proteomecentral.proteomexchange.org/dataset/PXD001655).

[1]  M. Woodward,et al.  Metaproteomics Analysis Reveals the Adaptation Process for the Chicken Gut Microbiota , 2013, Applied and Environmental Microbiology.

[2]  Ryan S. Mueller,et al.  Sample handling and mass spectrometry for microbial metaproteomic analyses. , 2013, Methods in enzymology.

[3]  F. Bastida,et al.  Metaproteomics of soils from semiarid environment: functional and phylogenetic information obtained with different protein extraction methods. , 2014, Journal of proteomics.

[4]  Joel A. Kooren,et al.  A two‐step database search method improves sensitivity in peptide sequence matches for metaproteomics and proteogenomics studies , 2013, Proteomics.

[5]  I. Hewson,et al.  Metaproteomic Survey of Six Aquatic Habitats: Discovering the Identities of Microbial Populations Active in Biogeochemical Cycling , 2014, Microbial Ecology.

[6]  Peter Dawyndt,et al.  The Unipept metaproteomics analysis pipeline , 2015, Proteomics.

[7]  T. Mattes,et al.  Proteomic analysis of ethene-enriched groundwater microcosms from a vinyl chloride-contaminated site. , 2010, Environmental science & technology.

[8]  Bhuvanesh Singh,et al.  Comparison of oral microbiota in tumor and non-tumor tissues of patients with oral squamous cell carcinoma , 2012, BMC Microbiology.

[9]  Paul Wilmes,et al.  Metaproteomics Provides Functional Insight into Activated Sludge Wastewater Treatment , 2008, PloS one.

[10]  P. Wilmes,et al.  The application of two-dimensional polyacrylamide gel electrophoresis and downstream analyses to a mixed community of prokaryotic microorganisms. , 2004, Environmental microbiology.

[11]  David R Goodlett,et al.  Comparative metaproteomics reveals ocean-scale shifts in microbial nutrient utilization and energy transduction , 2010, The ISME Journal.

[12]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[13]  P. He,et al.  Metaproteomics of cellulose methanisation under thermophilic conditions reveals a surprisingly high proteolytic activity , 2013, The ISME Journal.

[14]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[15]  W. D. de Vos,et al.  Metaproteomics Approach To Study the Functionality of the Microbiota in the Human Infant Gastrointestinal Tract , 2006, Applied and Environmental Microbiology.

[16]  Jer-Horng Wu,et al.  Total Protein Extraction for Metaproteomics Analysis of Methane Producing Biofilm: The Effects of Detergents , 2014, International journal of molecular sciences.

[17]  Ruddy Wattiez,et al.  The long-term adaptation of bacterial communities in metal-contaminated sediments: a metaproteogenomic study. , 2015, Environmental microbiology.

[18]  Christian G. Klatt,et al.  Identification and Distribution of High-Abundance Proteins in the Octopus Spring Microbial Mat Community , 2012, Applied and Environmental Microbiology.

[19]  James E. Johnson,et al.  Flexible and Accessible Workflows for Improved Proteogenomic Analysis Using the Galaxy Framework , 2014, Journal of proteome research.

[20]  C. Botting,et al.  A metaproteomic approach gives functional insights into anaerobic digestion , 2011, Journal of applied microbiology.

[21]  R. Heyer,et al.  The MetaProteomeAnalyzer: a powerful open-source software suite for metaproteomics data analysis and interpretation. , 2015, Journal of proteome research.

[22]  A. Stensballe,et al.  Digging into the extracellular matrix of a complex microbial community using a combined metagenomic and metaproteomic approach. , 2013, Water science and technology : a journal of the International Association on Water Pollution Research.

[23]  S. Hubbard,et al.  Addressing Statistical Biases in Nucleotide-Derived Protein Databases for Proteogenomic Search Strategies , 2012, Journal of proteome research.

[24]  Lennart Martens,et al.  SearchGUI: An open‐source graphical user interface for simultaneous OMSSA and X!Tandem searches , 2011, Proteomics.

[25]  John C. Wooley,et al.  A Primer on Metagenomics , 2010, PLoS Comput. Biol..

[26]  W. D. de Vos,et al.  Comparative Metaproteomics and Diversity Analysis of Human Intestinal Microbiota Testifies for Its Temporal Stability and Expression of Core Functions , 2012, PloS one.

[27]  D. Walsh,et al.  Metaproteomic analysis of a winter to spring succession in coastal northwest Atlantic Ocean microbial plankton , 2014, The ISME Journal.

[28]  E. Madsen,et al.  In situ expression of nitrite-dependent anaerobic methane oxidation proteins by Candidatus Methylomirabilis oxyfera co-occurring with expressed anammox proteins in a contaminated aquifer. , 2015, Environmental microbiology reports.

[29]  Katherine H. Huang,et al.  A framework for human microbiome research , 2012, Nature.

[30]  Manesh Shah,et al.  Metaproteomics Reveals Abundant Transposase Expression in Mutualistic Endosymbionts , 2013, mBio.

[31]  Pratik D Jagtap,et al.  Evaluating the potential of a novel oral lesion exudate collection method coupled with mass spectrometry-based proteomics for oral cancer biomarker discovery , 2011, Clinical Proteomics.

[32]  Dagmar H. Leary,et al.  Which metaproteome? The impact of protein extraction bias on metaproteomic analyses. , 2013, Molecular and cellular probes.

[33]  Eystein Oveland,et al.  PeptideShaker enables reanalysis of MS-derived proteomics data sets , 2015, Nature Biotechnology.

[34]  Naryttza N. Diaz,et al.  The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes , 2005, Nucleic acids research.

[35]  Robert W. Li,et al.  A Previously Uncharacterized, Nonphotosynthetic Member of the Chromatiaceae Is the Primary CO2-Fixing Constituent in a Self-Regenerating Biocathode , 2014, Applied and Environmental Microbiology.

[36]  U Reichl,et al.  Sample prefractionation with liquid isoelectric focusing enables in depth microbial metaproteome analysis of mesophilic and thermophilic biogas plants. , 2014, Anaerobe.

[37]  Massimo Deligios,et al.  Evaluating the Impact of Different Sequence Databases on Metaproteome Analysis: Insights from a Lab-Assembled Microbial Mixture , 2013, PloS one.

[38]  Stephen J. Callister,et al.  Amino acid treatment enhances protein recovery from sediment and soils for metaproteomic studies , 2013, Proteomics.

[39]  A. Butte,et al.  The Integrative Human Microbiome Project: Dynamic Analysis of Microbiome-Host Omics Profiles during Periods of Human Health and Disease , 2014, Cell host & microbe.

[40]  F. Hugenholtz,et al.  Metaproteome analysis and molecular genetics of rat intestinal microbiota reveals section and localization resolved species distribution and enzymatic functionalities. , 2012, Journal of proteome research.

[41]  Inger Ljungberg,et al.  Integrated next-generation sequencing of 16S rDNA and metaproteomics differentiate the healthy urine microbiome from asymptomatic bacteriuria in neuropathic bladder associated with spinal cord injury , 2012, Journal of Translational Medicine.

[42]  T. Griffin,et al.  A dynamic range compression and three-dimensional peptide fractionation analysis platform expands proteome coverage and the diagnostic potential of whole saliva. , 2009, Journal of proteome research.

[43]  Emanuel Schmid,et al.  Soil metaproteomics – Comparative evaluation of protein extraction protocols , 2012, Soil biology & biochemistry.

[44]  Richard J. Giannone,et al.  Development of an Enhanced Metaproteomic Approach for Deepening the Microbiome Characterization of the Human Infant Gut , 2014, Journal of proteome research.

[45]  Piotr Wojtek Dabrowski,et al.  Pipasic: similarity and expression correction for strain-level identification and quantification in metaproteomics , 2014, Bioinform..

[46]  Brandi L. Cantarel,et al.  Integrated Metagenomics/Metaproteomics Reveals Human Host-Microbiota Signatures of Crohn's Disease , 2012, PloS one.

[47]  Ljiljana Paša-Tolić,et al.  Metaproteomics reveals differential modes of metabolic coupling among ubiquitous oxygen minimum zone microbes , 2014, Proceedings of the National Academy of Sciences.

[48]  A. Stensballe,et al.  Metaproteomics: Evaluation of protein extraction from activated sludge , 2014, Proteomics.

[49]  Y. Kasahara,et al.  Metaproteomic Identification of Diazotrophic Methanotrophs and Their Localization in Root Tissues of Field-Grown Rice Plants , 2014, Applied and Environmental Microbiology.

[50]  Bernard Henrissat,et al.  Effects of Diet on Resource Utilization by a Model Human Gut Microbiota Containing Bacteroides cellulosilyticus WH2, a Symbiont with an Extensive Glycobiome , 2013, PLoS biology.

[51]  S. Kleinsteuber,et al.  Metaproteogenomic analysis of a sulfate-reducing enrichment culture reveals genomic organization of key enzymes in the m-xylene degradation pathway and metabolic activity of proteobacteria. , 2014, Systematic and applied microbiology.

[52]  F. Chen,et al.  Metaproteomic analysis of Chesapeake Bay microbial communities , 2005, Saline systems.

[53]  Timothy J Griffin,et al.  Deep metaproteomic analysis of human salivary supernatant , 2012, Proteomics.

[54]  V. O’Flaherty,et al.  Optimisation of protein extraction and 2‐DE for metaproteomics of microbial communities from anaerobic wastewater treatment biofilms , 2009, Electrophoresis.

[55]  T. Griffin,et al.  A metaproteomic analysis of the human salivary microbiota by three-dimensional peptide fractionation and tandem mass spectrometry. , 2010, Molecular oral microbiology.

[56]  Haibin Wang,et al.  Comparative Metaproteomic Analysis on Consecutively Rehmannia glutinosa-Monocultured Rhizosphere Soil , 2011, PloS one.

[57]  Andrew R. Jones,et al.  ProteomeXchange provides globally co-ordinated proteomics data submission and dissemination , 2014, Nature Biotechnology.

[58]  M. Bellgard,et al.  High‐throughput parallel proteogenomics: A bacterial case study , 2014, Proteomics.

[59]  Adam Godzik,et al.  Shotgun metaproteomics of the human distal gut microbiota , 2008, The ISME Journal.

[60]  Daniel H Huson,et al.  Microbial community analysis using MEGAN. , 2013, Methods in enzymology.

[61]  Wen-Han Yu,et al.  The Human Oral Microbiome Database: a web accessible resource for investigating oral microbe taxonomic and genomic information , 2010, Database J. Biol. Databases Curation.

[62]  Pratik D Jagtap,et al.  Multi-omic data analysis using Galaxy , 2015, Nature Biotechnology.

[63]  Martin Taubert,et al.  MetaProSIP: automated inference of stable isotope incorporation rates in proteins for functional metaproteomics. , 2015, Journal of proteome research.

[64]  J. Schubert,et al.  Bacterial colonization of microbial biofilms in oral squamous cell carcinoma , 2013, Clinical Oral Investigations.

[65]  P. Schloss,et al.  Dynamics and associations of microbial community types across the human body , 2014, Nature.