Comparative evaluation of label-free quantification methods for shotgun proteomics.

RATIONALE Label-free quantification (LFQ) is a popular strategy for shotgun proteomics. A variety of LFQ algorithms have been developed recently. However, a comprehensive comparison of the most commonly used LFQ methods is still rare, in part due to a lack of clear metrics for their evaluation and an annotated and quantitatively well-characterized data set. METHODS Five LFQ methods were compared: spectral counting based algorithms SIN , emPAI, and NSAF, and approaches relying on the extracted ion chromatogram (XIC) intensities, MaxLFQ and Quanti. We used three criteria for performance evaluation: coefficient of variation (CV) of protein abundances between replicates; analysis of variance (ANOVA); and the root-mean-square error of logarithmized calculated concentration ratios, referred to as standard quantification error (SQE). Comparison was performed using a quantitatively annotated publicly available data set. RESULTS The best results in terms of inter-replicate reproducibility were observed for MaxLFQ and NSAF, although they exhibited larger standard quantification errors. Using NSAF, all quantitatively annotated proteins were correctly identified in the Bonferronni-corrected results of the ANOVA test. SIN was found to be the most accurate in terms of SQE. Finally, the current implementations of XIC-based LFQ methods did not outperform the methods based on spectral counting for the data set used in this study. CONCLUSIONS Surprisingly, the performances of XIC-based approaches measured using three independent metrics were found to be comparable with more straightforward and simple MS/MS-based spectral counting approaches. The study revealed no clear leader among the latter. Copyright © 2017 John Wiley & Sons, Ltd.

[1]  Mathias Wilhelm,et al.  Global proteome analysis of the NCI-60 cell line panel. , 2013, Cell reports.

[2]  Roman A Zubarev,et al.  In Silico Instrumental Response Correction Improves Precision of Label-free Proteomics and Accuracy of Proteomics-based Predictive Models* , 2013, Molecular & Cellular Proteomics.

[3]  Michael K. Coleman,et al.  Statistical analysis of membrane proteome expression changes in Saccharomyces cerevisiae. , 2006, Journal of proteome research.

[4]  Christopher H. Becker,et al.  Recent developments in quantitative proteomics. , 2011, Mutation research.

[5]  Robert Burke,et al.  ProteoWizard: open source software for rapid proteomics tools development , 2008, Bioinform..

[6]  D. Lauffenburger,et al.  Multiple reaction monitoring for robust quantitative proteomic analysis of cellular signaling networks , 2007, Proceedings of the National Academy of Sciences.

[7]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[8]  M. Mann,et al.  Andromeda: a peptide search engine integrated into the MaxQuant environment. , 2011, Journal of proteome research.

[9]  Guilong Cheng,et al.  Mass spectrometry of peptides and proteins. , 2005, Methods.

[10]  Mehdi Mirzaei,et al.  Less label, more free: Approaches in label‐free quantitative mass spectrometry , 2011, Proteomics.

[11]  A. Otto,et al.  Quantitative proteomics in the field of microbiology , 2014, Proteomics.

[12]  Yasset Perez-Riverol,et al.  A multi-center study benchmarks software tools for label-free proteome quantification , 2016, Nature Biotechnology.

[13]  Jens M. Rick,et al.  Quantitative mass spectrometry in proteomics: a critical review , 2007, Analytical and bioanalytical chemistry.

[14]  David Fenyö,et al.  Protein quantitation using mass spectrometry. , 2010, Methods in molecular biology.

[15]  M. Mann,et al.  Exponentially Modified Protein Abundance Index (emPAI) for Estimation of Absolute Protein Amount in Proteomics by the Number of Sequenced Peptides per Protein*S , 2005, Molecular & Cellular Proteomics.

[16]  R. Branca,et al.  Quantitative accuracy in mass spectrometry based proteomics of complex samples: the impact of labeling and precursor interference. , 2014, Journal of proteomics.

[17]  Michael L. Gatza,et al.  Proteogenomics connects somatic mutations to signaling in breast cancer , 2016, Nature.

[18]  Marco Y. Hein,et al.  Accurate Proteome-wide Label-free Quantification by Delayed Normalization and Maximal Peptide Ratio Extraction, Termed MaxLFQ * , 2014, Molecular & Cellular Proteomics.

[19]  Lev I Levitsky,et al.  Empirical multidimensional space for scoring peptide spectrum matches in shotgun proteomics. , 2014, Journal of proteome research.

[20]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[21]  Pei Wang,et al.  Demonstrating the feasibility of large-scale development of standardized assays to quantify human proteins , 2013, Nature Methods.

[22]  Gerard Cagney,et al.  An Overview of Label-Free Quantitation Methods in Proteomics by Mass Spectrometry , 2010, Proteome Bioinformatics.

[23]  Linfeng Wu,et al.  Role of spectral counting in quantitative proteomics , 2010, Expert review of proteomics.

[24]  K. Anderson,et al.  Mixed-effects statistical model for comparative LC-MS proteomics studies. , 2008, Journal of proteome research.

[25]  Terence P Speed,et al.  Statistical modeling of sequencing errors in SAGE libraries. , 2004, Bioinformatics.

[26]  Bas van Breukelen,et al.  Current challenges in software solutions for mass spectrometry-based quantitative proteomics , 2012, Amino Acids.

[27]  Benjamin Thomas,et al.  Comparative evaluation of label‐free SINQ normalized spectral index quantitation in the central proteomics facilities pipeline , 2011, Proteomics.

[28]  Robertson Craig,et al.  TANDEM: matching proteins with tandem mass spectra. , 2004, Bioinformatics.

[29]  J. Koziol,et al.  Label-free, normalized quantification of complex mass spectrometry data for proteomics analysis , 2009, Nature Biotechnology.