Soil and leaf litter metaproteomics—a brief guideline from sampling to understanding

The increasing application of soil metaproteomics is providing unprecedented, in-depth characterization of the composition and functionality of in situ microbial communities. Despite recent advances in high-resolution mass spectrometry, soil metaproteomics still suffers from a lack of effective and reproducible protein extraction protocols and standardized data analyses. This review discusses the opportunities and limitations of selected techniques in soil-, and leaf litter metaproteomics, and presents a step-by-step guideline on their application, covering sampling, sample preparation, extraction and data evaluation strategies. In addition, we present recent applications of soil metaproteomics and discuss how such approaches, linking phylogenetics and functionality, can help gain deeper insights into terrestrial microbial ecology. Finally, we strongly recommend that to maximize the insights environmental metaproteomics may provide, such methods should be employed within a holistic experimental approach considering relevant aboveground and belowground ecosystem parameters.

[1]  Peter D. Karp,et al.  The MetaCyc Database , 2002, Nucleic Acids Res..

[2]  Ryan S. Mueller,et al.  Sample handling and mass spectrometry for microbial metaproteomic analyses. , 2013, Methods in enzymology.

[3]  Jodie J. Yin,et al.  A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes , 2004, Genome Biology.

[4]  F. Bastida,et al.  Metaproteomics of soils from semiarid environment: functional and phylogenetic information obtained with different protein extraction methods. , 2014, Journal of proteomics.

[5]  Joel A. Kooren,et al.  A two‐step database search method improves sensitivity in peptide sequence matches for metaproteomics and proteogenomics studies , 2013, Proteomics.

[6]  Martin Eisenacher,et al.  Using Laboratory Information Management Systems as central part of a proteomics data workflow , 2010, Proteomics.

[7]  Brandi L. Cantarel,et al.  The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics , 2008, Nucleic Acids Res..

[8]  M. Rillig,et al.  Improving soil protein extraction for metaproteome analysis and glomalin‐related soil protein detection , 2009, Proteomics.

[9]  G. Renella,et al.  High montmorillonite content may affect soil microbial proteomic analysis , 2013 .

[10]  J. Yates,et al.  Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. , 1995, Analytical chemistry.

[11]  F. Abram,et al.  Exploring mixed microbial community functioning: recent advances in metaproteomics , 2012, FEMS microbiology ecology.

[12]  Birgit Schilling,et al.  ScanRanker: Quality assessment of tandem mass spectra via sequence tagging. , 2011, Journal of proteome research.

[13]  R. Schwarzenbach,et al.  Protein encapsulation by humic substances. , 2011, Environmental science & technology.

[14]  William Stafford Noble,et al.  Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets. , 2009, Journal of proteome research.

[15]  Eoin L. Brodie,et al.  Direct cellular lysis/protein extraction protocol for soil metaproteomics. , 2010, Journal of proteome research.

[16]  S. Simpanen,et al.  Sample storage for soil enzyme activity and bacterial community profiles. , 2010, Journal of microbiological methods.

[17]  Haojie Lu,et al.  Systematic comparison between SDS-PAGE/RPLC and high-/low-pH RPLC coupled tandem mass spectrometry strategies in a whole proteome analysis. , 2015, The Analyst.

[18]  A. Otto,et al.  Quantitative proteomics in the field of microbiology , 2014, Proteomics.

[19]  Wei Wang,et al.  A universal and rapid protocol for protein extraction from recalcitrant plant tissues for proteomic analysis , 2006, Electrophoresis.

[20]  William Stafford Noble,et al.  Semi-supervised learning for peptide identification from shotgun proteomics datasets , 2007, Nature Methods.

[21]  Scott T. Bates,et al.  Cross-biome metagenomic analyses of soil microbial communities and their functional attributes , 2012, Proceedings of the National Academy of Sciences.

[22]  D. Zühlke,et al.  Cellulose and hemicellulose decomposition by forest soil bacteria proceeds by the action of structurally variable enzymatic systems , 2016, Scientific Reports.

[23]  P. Cai,et al.  Adsorption of Pseudomonas putida on clay minerals and iron oxide. , 2007, Colloids and surfaces. B, Biointerfaces.

[24]  Stephen J. Callister,et al.  Amino acid treatment enhances protein recovery from sediment and soils for metaproteomic studies , 2013, Proteomics.

[25]  Bernhard O. Palsson,et al.  BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions , 2010, BMC Bioinformatics.

[26]  D. Benndorf,et al.  Searching for a needle in a stack of needles: challenges in metaproteomics data analysis. , 2013, Molecular bioSystems.

[27]  Fang-Xiang Wu,et al.  A feedback framework for protein inference with peptides identified from tandem mass spectra , 2012, Proteome Science.

[28]  N. Jehmlich,et al.  Deforestation fosters bacterial diversity and the cyanobacterial community responsible for carbon fixation processes under semiarid climate: A metaproteomics study , 2015 .

[29]  W. Norde,et al.  Protein Adsorption at Solid Surfaces and Protein Complexation with Humic Acids , 2008 .

[30]  Emanuel Schmid,et al.  Soil metaproteomics – Comparative evaluation of protein extraction protocols , 2012, Soil biology & biochemistry.

[31]  L. Montanarella,et al.  Research needs in support of the European thematic strategy for soil protection , 2004 .

[32]  W. Wanek,et al.  Effects of stoichiometry and temperature perturbations on beech leaf litter decomposition, enzyme activities and protein expression , 2011, Biogeosciences.

[33]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[34]  N. Selevsek,et al.  Soil restoration with organic amendments: linking cellular functionality and ecosystem processes , 2015, Scientific Reports.

[35]  Adam Godzik,et al.  Shotgun metaproteomics of the human distal gut microbiota , 2008, The ISME Journal.

[36]  Keiryn L. Bennett,et al.  Introduction to Computational Proteomics , 2007, PLoS Comput. Biol..

[37]  Andreas Richter,et al.  Who is who in litter decomposition? Metaproteomics reveals major microbial players and their biogeochemical functions , 2012, The ISME Journal.

[38]  P. Reich,et al.  Lack of functional redundancy in the relationship between microbial diversity and ecosystem functioning , 2016 .

[39]  Juan Antonio Vizcaíno,et al.  Proteomics data exchange and storage: the need for common standards and public repositories. , 2013, Methods in molecular biology.

[40]  Susumu Goto,et al.  Data, information, knowledge and principle: back to metabolism in KEGG , 2013, Nucleic Acids Res..

[41]  Erin Beck,et al.  TIGRFAMs and Genome Properties in 2013 , 2012, Nucleic Acids Res..

[42]  Pedro M. Coutinho,et al.  The carbohydrate-active enzymes database (CAZy) in 2013 , 2013, Nucleic Acids Res..

[43]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[44]  R. Heyer,et al.  The MetaProteomeAnalyzer: a powerful open-source software suite for metaproteomics data analysis and interpretation. , 2015, Journal of proteome research.

[45]  M. Grube,et al.  Structure and function of the symbiosis partners of the lung lichen (Lobaria pulmonaria L. Hoffm.) analyzed by metaproteomics , 2011, Proteomics.

[46]  Eugene Kolker,et al.  A predictive model for identifying proteins by a single peptide match , 2007, Bioinform..

[47]  I. Baldwin,et al.  Bacteria dominate the short-term assimilation of plant-derived N in soil , 2016 .

[48]  N. Jehmlich,et al.  The ecological and physiological responses of the microbial community from a semiarid soil to hydrocarbon contamination and its bioremediation using compost amendment. , 2016, Journal of proteomics.

[49]  Andreas Richter,et al.  Proteome analysis of fungal and bacterial involvement in leaf litter decomposition , 2010, Proteomics.

[50]  J. V. van Elsas,et al.  Analysis of the dynamics of fungal communities in soil via fungal-specific PCR of soil DNA followed by denaturing gradient gel electrophoresis. , 2000, Journal of microbiological methods.

[51]  Damian Szklarczyk,et al.  eggNOG v4.0: nested orthology inference across 3686 organisms , 2013, Nucleic Acids Res..

[52]  Paul D Piehowski,et al.  Metagenomic and metaproteomic insights into bacterial communities in leaf-cutter ant fungus gardens , 2012, The ISME Journal.

[53]  Anthony N. Pettitt,et al.  Sampling Designs for Estimating Spatial Variance Components , 1993 .

[54]  M. Washburn,et al.  Refinements to label free proteome quantitation: how to deal with peptides shared by multiple proteins. , 2010, Analytical chemistry.

[55]  P. Mcgee,et al.  Polyphenolic compounds interfere with quantification of protein in soil extracts using the Bradford method , 2007 .

[56]  D. Benndorf,et al.  Functional metaproteome analysis of protein extracts from contaminated soil and groundwater , 2007, The ISME Journal.

[57]  O. Ogunseitan Soil Proteomics: Extraction and Analysis of Proteins from Soils , 2006 .

[58]  S. Marhan,et al.  Do general spatial relationships for microbial biomass and soil enzyme activities exist in temperate grassland soils , 2015 .

[59]  A. Mentler,et al.  Biochar application reduces protein sorption in soil , 2015 .

[60]  A. Samuel,et al.  Field management effects on soil enzyme activities. , 2008 .

[61]  Chao Yang,et al.  A Combinatorial Perspective of the Protein Inference Problem , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[62]  D. Cowan,et al.  A sequential co-extraction method for DNA, RNA and protein recovery from soil for future system-based approaches. , 2014, Journal of microbiological methods.

[63]  T. Urich,et al.  Organic carbon transformations in high-Arctic peat soils: key functions and microorganisms , 2012, The ISME Journal.

[64]  Interactions between proteins and humic substances affect protein identification by mass spectrometry , 2014, Biology and Fertility of Soils.

[65]  Sheng Lin,et al.  Metaproteomic analysis of ratoon sugarcane rhizospheric soil , 2013, BMC Microbiology.

[66]  A. Posch 2D PAGE: Sample Preparation and Fractionation , 2008, Methods in Molecular Biology™.

[67]  T. Amemiya,et al.  Quantitative measurement of fungal DNA extracted by three different methods using real-time polymerase chain reaction. , 2003, Journal of bioscience and bioengineering.

[68]  L. Bakken,et al.  Nucleic Acid Extraction from Soil , 2006 .

[69]  A. Nesvizhskii A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics. , 2010, Journal of proteomics.

[70]  I. Eidhammer,et al.  Improving the reliability and throughput of mass spectrometry‐based proteomics by spectrum quality filtering , 2006, Proteomics.

[71]  Jörg Bernhardt,et al.  Metaproteomics to unravel major microbial players in leaf litter and soil environments: Challenges and perspectives , 2013, Proteomics.

[72]  R. Aebersold,et al.  A statistical model for identifying proteins by tandem mass spectrometry. , 2003, Analytical chemistry.

[73]  K. Laukens,et al.  Preparation of protein extracts from recalcitrant plant tissues: An evaluation of different methods for two‐dimensional gel electrophoresis analysis , 2005, Proteomics.

[74]  William Stafford Noble,et al.  Computing Exact p-values for a Cross-correlation Shotgun Proteomics Score Function , 2014, Molecular & Cellular Proteomics.

[75]  K. Nielsen,et al.  Stabilization of Extracellular DNA and Proteins by Transient Binding to Various Soil Components , 2006 .

[76]  K. Riedel,et al.  Environmental proteomics: Analysis of structure and function of microbial communities , 2010, Proteomics.

[77]  E. Wellington,et al.  Comparison of extraction methods for recovery of extracellular β-glucosidase in two different forest soils , 2008 .

[78]  Ville R. Koskinen,et al.  Hierarchical Clustering of Shotgun Proteomics Data , 2011, Molecular & Cellular Proteomics.

[79]  I. Singleton,et al.  The potential of soil protein-based methods to indicate metal contamination , 2003 .

[80]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[81]  W. D. de Vos,et al.  Metaproteomics Approach To Study the Functionality of the Microbiota in the Human Infant Gastrointestinal Tract , 2006, Applied and Environmental Microbiology.

[82]  L. Ranjard,et al.  Soil microbial diversity: Methodological strategy, spatial overview and functional interest. , 2011, Comptes rendus biologies.

[83]  Massimo Deligios,et al.  Evaluating the Impact of Different Sequence Databases on Metaproteome Analysis: Insights from a Lab-Assembled Microbial Mixture , 2013, PloS one.

[84]  Fang-Xiang Wu,et al.  Model based clustering for tandem mass spectrum quality assessment , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[85]  Fahad Saeed,et al.  CAMS-RS: Clustering Algorithm for Large-Scale Mass Spectrometry Data Using Restricted Search Space and Intelligent Random Sampling , 2014, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[86]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[87]  Fang-Xiang Wu,et al.  Quality assessment of tandem mass spectra using support vector machine (SVM) , 2009, BMC Bioinformatics.

[88]  F. Bastida,et al.  Microbiological degradation index of soils in a semiarid climate , 2006 .

[89]  David L. Jones,et al.  Critical evaluation of methods for determining total protein in soil solution , 2008 .

[90]  A. Görg,et al.  Sample solublization buffers for two-dimensional electrophoresis. , 2008, Methods in molecular biology.

[91]  K. Yonebayashi,et al.  Isolation of extracellular protein from greenhouse soil , 2003 .

[92]  Martin Taubert,et al.  Insights from quantitative metaproteomics and protein-stable isotope probing into microbial ecology , 2013, The ISME Journal.

[93]  Erik Sjölund,et al.  Fast and accurate database searches with MS-GF+Percolator. , 2014, Journal of proteome research.

[94]  Rob Knight,et al.  Comparative metagenomic, phylogenetic and physiological analyses of soil microbial communities across nitrogen gradients , 2011, The ISME Journal.

[95]  A. Modesti,et al.  Extraction of microbial proteome from soil: potential and limitations assessed through a model study , 2011 .

[96]  J. Connolly,et al.  Impact of lime, nitrogen and plant species on bacterial community structure in grassland microcosms. , 2004, Environmental microbiology.

[97]  William Stafford Noble,et al.  Faster Mass Spectrometry-Based Protein Inference: Junction Trees Are More Efficient than Sampling and Marginalization by Enumeration , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[98]  G. Kowalchuk,et al.  Micro-scale determinants of bacterial diversity in soil. , 2013, FEMS microbiology reviews.

[99]  Ralf Rabus,et al.  Proteomic tools for environmental microbiology—A roadmap from sample preparation to protein identification and quantification , 2013, Proteomics.

[100]  Leigh A Weston,et al.  Comparison of bottom-up proteomic approaches for LC-MS analysis of complex proteomes. , 2013, Analytical methods : advancing methods and applications.

[101]  Richard P. Dick,et al.  Cold Storage and Pretreatment Incubation Effects on Soil Microbial Properties , 2007 .

[102]  L. M. Daniels,et al.  Removal of PCR inhibitors from soil DNA by chemical flocculation. , 2003, Journal of microbiological methods.

[103]  J. Gilbert,et al.  Investigating the Impact of Storage Conditions on Microbial Community Composition in Soil Samples , 2013, PloS one.

[104]  A. T. Vasconcelos,et al.  Metagenomic analysis reveals microbial functional redundancies and specificities in a soil under different tillage and crop-management regimes , 2015 .

[105]  Robert Hettich,et al.  Environmental Proteomics: a Paradigm Shift in Characterizing Microbial Activities at the Molecular Level , 2009, Microbiology and Molecular Biology Reviews.

[106]  Yasset Perez-Riverol,et al.  Open source libraries and frameworks for mass spectrometry based proteomics: A developer's perspective , 2014, Biochimica et biophysica acta.

[107]  A. Farnet,et al.  Protein measurement in forest litter , 2002, Biology and Fertility of Soils.

[108]  M. Faurobert,et al.  Phenol extraction of proteins for proteomic studies of recalcitrant plant tissues. , 2007, Methods in molecular biology.

[109]  J. Jansson,et al.  The Potential of Metagenomic Approaches for Understanding Soil Microbial Processes , 2014 .

[110]  Sheng Lin,et al.  Characterization of metaproteomics in crop rhizospheric soil. , 2011, Journal of proteome research.

[111]  Jörg Bernhardt,et al.  Data visualization in environmental proteomics , 2013, Proteomics.

[112]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[113]  Fang-Xiang Wu,et al.  An unsupervised machine learning method for assessing quality of tandem mass spectra , 2012, Proteome Science.

[114]  Mark P. Waldrop,et al.  Multi-omics of permafrost, active layer and thermokarst bog soil microbiomes , 2015, Nature.

[115]  E. Taylor,et al.  Microbial Protein in Soil: Influence of Extraction Method and C Amendment on Extraction and Recovery , 2010, Microbial Ecology.

[116]  J. Prosser Dispersing misconceptions and identifying opportunities for the use of 'omics' in soil microbial ecology , 2015, Nature Reviews Microbiology.

[117]  David J Van Horn,et al.  Soil Microbial Responses to Increased Moisture and Organic Resources along a Salinity Gradient in a Polar Desert , 2014, Applied and Environmental Microbiology.

[118]  Michael K. Coleman,et al.  Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae. , 2006, Journal of proteome research.

[119]  C. Algora,et al.  Feasibility of a cell separation-proteomic based method for soils with different edaphic properties and microbial biomass , 2012 .

[120]  A. Timperman,et al.  Proteome analysis. , 2004, Methods in molecular biology.

[121]  Xiaojin Zhu,et al.  Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[122]  K. Jindo,et al.  Effects of organic amendments on soil carbon fractions, enzyme activity and humus–enzyme complexes under semi-arid conditions , 2012 .

[123]  Paul Wilmes,et al.  Metaproteomics Provides Functional Insight into Activated Sludge Wastewater Treatment , 2008, PloS one.

[124]  P. Nannipieri Role of Stabilised Enzymes in Microbial Ecology and Enzyme Extraction from Soil with Potential Applications in Soil Proteomics , 2006 .

[125]  David R Goodlett,et al.  Comparative metaproteomics reveals ocean-scale shifts in microbial nutrient utilization and energy transduction , 2010, The ISME Journal.

[126]  Bin Ma,et al.  De Novo Sequencing Methods in Proteomics , 2010, Proteome Bioinformatics.

[127]  B. Searle Scaffold: A bioinformatic tool for validating MS/MS‐based proteomic studies , 2010, Proteomics.

[128]  Richard D. Smith,et al.  Clustering millions of tandem mass spectra. , 2008, Journal of proteome research.

[129]  William Stafford Noble,et al.  Estimating relative abundances of proteins from shotgun proteomics data , 2012, BMC Bioinformatics.

[130]  R. Doyle,et al.  Contribution of the hydrophobic effect to microbial infection. , 2000, Microbes and infection.

[131]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[132]  Fuchu He,et al.  Protein probabilities in shotgun proteomics: Evaluating different estimation methods using a semi‐random sampling model , 2006, Proteomics.

[133]  F. Bastida,et al.  Soil metaproteomics: a review of an emerging environmental science. Significance, methodology and perspectives , 2009 .

[134]  Alexey I Nesvizhskii,et al.  Interpretation of Shotgun Proteomic Data , 2005, Molecular & Cellular Proteomics.

[135]  G. Babnigg,et al.  A database of unique protein sequence identifiers for proteome studies , 2006, Proteomics.

[136]  Noah Fierer,et al.  Using network analysis to explore co-occurrence patterns in soil microbial communities , 2011, The ISME Journal.