Critical assessment of proteome‐wide label‐free absolute abundance estimation strategies

There is a great interest in reliable ways to obtain absolute protein abundances at a proteome‐wide scale. To this end, label‐free LC‐MS/MS quantification methods have been proposed where all identified proteins are assigned an estimated abundance. Several variants of this quantification approach have been presented, based on either the number of spectral counts per protein or MS1 peak intensities. Equipped with several datasets representing real biological environments, containing a high number of accurately quantified reference proteins, we evaluate five popular low‐cost and easily implemented quantification methods (Absolute Protein Expression, Exponentially Modified Protein Abundance Index, Intensity‐Based Absolute Quantification Index, Top3, and MeanInt). Our results demonstrate considerably improved abundance estimates upon implementing accurately quantified reference proteins; that is, using spiked in stable isotope labeled standard peptides or a standard protein mix, to generate a properly calibrated quantification model. We show that only the Top3 method is directly proportional to protein abundance over the full quantification range and is the preferred method in the absence of reference protein measurements. Additionally, we demonstrate that spectral count based quantification methods are associated with higher errors than MS1 peak intensity based methods. Furthermore, we investigate the impact of miscleaved, modified, and shared peptides as well as protein size and the number of employed reference proteins on quantification accuracy.

[1]  M. Mann,et al.  Extensive quantitative remodeling of the proteome between normal colon tissue and adenocarcinoma , 2012, Molecular systems biology.

[2]  Jonas Grossmann,et al.  Implementation and evaluation of relative and absolute quantification in shotgun proteomics with label-free methods. , 2010, Journal of proteomics.

[3]  R. Aebersold,et al.  Proteome-wide cellular protein concentrations of the human pathogen Leptospira interrogans , 2009, Nature.

[4]  Ruedi Aebersold,et al.  Reproducible isolation of distinct, overlapping segments of the phosphoproteome , 2007, Nature Methods.

[5]  Hua Lin,et al.  Quantifying reproducibility for differential proteomics: noise analysis for protein liquid chromatography-mass spectrometry of human serum , 2004, Bioinform..

[6]  E. Marcotte,et al.  Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation , 2007, Nature Biotechnology.

[7]  Natalie I. Tasman,et al.  A guided tour of the Trans‐Proteomic Pipeline , 2010, Proteomics.

[8]  M. Record,et al.  Characterization of the cytoplasm of Escherichia coli K-12 as a function of external osmolarity. Implications for protein-DNA interactions in vivo. , 1991, Journal of molecular biology.

[9]  D. Frishman,et al.  Protein abundance profiling of the Escherichia coli cytosol , 2008, BMC Genomics.

[10]  M. Gorenstein,et al.  Absolute Quantification of Proteins by LCMSE , 2006, Molecular & Cellular Proteomics.

[11]  M. Mann,et al.  Comparative Proteomic Analysis of Eleven Common Cell Lines Reveals Ubiquitous but Varying Expression of Most Proteins* , 2012, Molecular & Cellular Proteomics.

[12]  R. Aebersold,et al.  Selected reaction monitoring for quantitative proteomics: a tutorial , 2008, Molecular systems biology.

[13]  K. Valgepea,et al.  Comparison and applications of label-free absolute proteome quantification methods on Escherichia coli. , 2012, Journal of proteomics.

[14]  Jens M. Rick,et al.  Quantitative mass spectrometry in proteomics: a critical review , 2007, Analytical and bioanalytical chemistry.

[15]  Soyoung Ryu,et al.  Comparison of a Label-Free Quantitative Proteomic Method Based on Peptide Ion Current Area to the Isotope Coded Affinity Tag Method , 2008, Cancer informatics.

[16]  Ruedi Aebersold,et al.  Options and considerations when selecting a quantitative proteomics strategy , 2010, Nature Biotechnology.

[17]  Alexey I Nesvizhskii,et al.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. , 2002, Analytical chemistry.

[18]  Henry H. N. Lam,et al.  Absolute quantification of microbial proteomes at different states by directed mass spectrometry , 2011, Molecular systems biology.

[19]  Benjamin Thomas,et al.  Comparative evaluation of label‐free SINQ normalized spectral index quantitation in the central proteomics facilities pipeline , 2011, Proteomics.

[20]  M. Tomita,et al.  Gene expression emPAI Calc — for the estimation of protein abundance from large-scale identification data by liquid chromatography-tandem mass spectrometry , 2010 .

[21]  R. Aebersold,et al.  Quantification of mRNA and protein and integration with protein turnover in a bacterium , 2011, Molecular systems biology.

[22]  Mehdi Mirzaei,et al.  Less label, more free: Approaches in label‐free quantitative mass spectrometry , 2011, Proteomics.

[23]  Martin Eisenacher,et al.  Peek a peak: a glance at statistics for quantitative label-free proteomics , 2010, Expert review of proteomics.

[24]  Rong Wang,et al.  The APEX Quantitative Proteomics Tool: Generating protein quantitation estimates from LC-MS/MS proteomics results , 2008, BMC Bioinformatics.

[25]  M. Mann,et al.  Exponentially Modified Protein Abundance Index (emPAI) for Estimation of Absolute Protein Amount in Proteomics by the Number of Sequenced Peptides per Protein*S , 2005, Molecular & Cellular Proteomics.

[26]  Hyungwon Choi,et al.  Modularity and hormone sensitivity of the Drosophila melanogaster insulin receptor/target of rapamycin interaction proteome , 2011, Molecular systems biology.

[27]  S. Zimmerman,et al.  Estimation of macromolecule concentrations and excluded volume effects for the cytoplasm of Escherichia coli. , 1991, Journal of molecular biology.

[28]  J. Ellenberg,et al.  The quantitative proteome of a human cell line , 2011, Molecular systems biology.

[29]  J. Yates,et al.  A model for random sampling and estimation of relative protein abundance in shotgun proteomics. , 2004, Analytical chemistry.

[30]  R. Aebersold,et al.  Large-scale quantitative assessment of different in-solution protein digestion protocols reveals superior cleavage efficiency of tandem Lys-C/trypsin proteolysis over trypsin digestion. , 2012, Journal of proteome research.

[31]  Sophie Brachat,et al.  Reinvestigation of the Saccharomyces cerevisiae genome annotation by comparison to the genome of a related fungus: Ashbya gossypii , 2003, Genome Biology.

[32]  Ruedi Aebersold,et al.  Estimation of Absolute Protein Quantities of Unlabeled Samples by Selected Reaction Monitoring Mass Spectrometry , 2011, Molecular & Cellular Proteomics.

[33]  A. Nesvizhskii,et al.  Comparative analysis of different label-free mass spectrometry based protein abundance estimates and their correlation with RNA-Seq gene expression data. , 2012, Journal of proteome research.

[34]  Da Qi,et al.  A software toolkit and interface for performing stable isotope labeling and top3 quantification using Progenesis LC-MS. , 2012, Omics : a journal of integrative biology.

[35]  M. Selbach,et al.  Global quantification of mammalian gene expression control , 2011, Nature.

[36]  K. Kito,et al.  Mass Spectrometry-Based Approaches Toward Absolute Quantitative Proteomics , 2008, Current genomics.

[37]  R. Aebersold,et al.  Quantitative Analysis of Fission Yeast Transcriptomes and Proteomes in Proliferating and Quiescent Cells , 2012, Cell.

[38]  H. Mischak,et al.  Urine Proteome Analysis Reflects Atherosclerotic Disease in an ApoE−/− Mouse Model and Allows the Discovery of New Candidate Biomarkers in Mouse and Human Atherosclerosis* , 2012, Molecular & Cellular Proteomics.

[39]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[40]  K. Resing,et al.  Comparison of Label-free Methods for Quantifying Human Proteins by Shotgun Proteomics*S , 2005, Molecular & Cellular Proteomics.

[41]  Emma Lundberg,et al.  A Protein Epitope Signature Tag (PrEST) Library Allows SILAC-based Absolute Quantification and Multiplexed Determination of Protein Copy Numbers in Cell Lines* , 2011, Molecular & Cellular Proteomics.

[42]  Sylvie Huet,et al.  Including shared peptides for estimating protein abundances: A significant improvement for quantitative proteomics , 2012, Proteomics.