Including shared peptides for estimating protein abundances: A significant improvement for quantitative proteomics

Inferring protein abundances from peptide intensities is the key step in quantitative proteomics. The inference is necessarily more accurate when many peptides are taken into account for a given protein. Yet, the information brought by the peptides shared by different proteins is commonly discarded. We propose a statistical framework based on a hierarchical modeling to include that information. Our methodology, based on a simultaneous analysis of all the quantified peptides, handles the biological and technical errors as well as the peptide effect. In addition, we propose a practical implementation suitable for analyzing large data sets. Compared to a method based on the analysis of one protein at a time (that does not include shared peptides), our methodology proved to be far more reliable for estimating protein abundances and testing abundance changes. The source codes are available at http://pappso.inra.fr/bioinfo/all_p/.

[1]  Alexey I Nesvizhskii,et al.  Abacus: A computational tool for extracting and pre‐processing spectral count data for label‐free quantitative proteomic analysis , 2011, Proteomics.

[2]  Edward N Pugh,et al.  The Proteome of the Mouse Photoreceptor Sensory Cilium Complex*S , 2007, Molecular & Cellular Proteomics.

[3]  Birgit Schilling,et al.  Interlaboratory Study Characterizing a Yeast Performance Standard for Benchmarking LC-MS Platform Performance* , 2009, Molecular & Cellular Proteomics.

[4]  Gunther Schadow,et al.  Protein quantification in label-free LC-MS experiments. , 2009, Journal of proteome research.

[5]  Olivier Langella,et al.  MassChroQ: A versatile tool for mass spectrometry quantification , 2011, Proteomics.

[6]  B. Usadel,et al.  Quantitation in mass-spectrometry-based proteomics. , 2010, Annual review of plant biology.

[7]  Martyn Plummer,et al.  JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling , 2003 .

[8]  Stephan C Peipei Ping Peek a peak: A glance at statistics for quantitative label-free proteomics , 2013 .

[9]  J. Miller,et al.  The effects of shared peptides on protein quantitation in label-free proteomics by LC/MS/MS. , 2008, Journal of proteome research.

[10]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[11]  M. Washburn,et al.  Refinements to label free proteome quantitation: how to deal with peptides shared by multiple proteins. , 2010, Analytical chemistry.

[12]  Ruedi Aebersold,et al.  Options and considerations when selecting a quantitative proteomics strategy , 2010, Nature Biotechnology.

[13]  Terry M Therneau,et al.  Statistical analysis of relative labeled mass spectrometry data from complex samples using ANOVA. , 2008, Journal of proteome research.

[14]  Peter Chu,et al.  Design and Analysis of Quantitative Differential Proteomics Investigations Using LC-MS Technology , 2008, J. Bioinform. Comput. Biol..