A maximum-likelihood approach to absolute protein quantification in mass spectrometry

Label-free absolute protein quantification refers to a process of estimating protein abundances in complex biological samples based on the data acquired from a liquid chromatography mass spectrometry (LC-MS) analysis. Most approaches to label-free quantification rely on measuring peak areas from an extracted-ion chromatogram. However, because of the differences in physicochemical properties associated with different peptide ions, observed peak areas in a single experiment are determined not only by peptide abundances, but also the intrinsic biases of analytical platforms. Therefore, accurate modeling of these biases provides an opportunity to developing new computational methods for precise absolute protein quantification. In this work, we developed a new algorithm for absolute quantification of proteins. The approach is based on the concept of peptide response rate, which characterizes the peptide-specific signal detection bias in an LC-MS experiment. We argue that peptide response rate is an intrinsic and reproducible property of peptide ions that can be reliably predicted using non-linear regression and features extracted from the sequence of the parent protein. Protein abundances are estimated using a maximum likelihood model in which the observed peak areas of peptide ions are adjusted using predicted peptide response rates. We evaluate our approach on a large LC-MS dataset as well as simulated data and provide evidence that the accuracy of absolute protein quantification is improved when peptide-specific response rates are taken into account.

[1]  E. Diamandis Mass Spectrometry as a Diagnostic and a Cancer Biomarker Discovery Tool , 2004, Molecular & Cellular Proteomics.

[2]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[3]  Alexander Schmidt,et al.  Critical assessment of proteome‐wide label‐free absolute abundance estimation strategies , 2013, Proteomics.

[4]  Da Qi,et al.  A software toolkit and interface for performing stable isotope labeling and top3 quantification using Progenesis LC-MS. , 2012, Omics : a journal of integrative biology.

[5]  E. Marcotte,et al.  Calculating absolute and relative protein abundance from mass spectrometry-based protein expression data , 2008, Nature Protocols.

[6]  Joshua N. Adkins,et al.  MASIC: A software program for fast quantitation and flexible visualization of chromatographic profiles from detected LC-MS(/MS) features , 2008, Comput. Biol. Chem..

[7]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[8]  M. Selbach,et al.  Global quantification of mammalian gene expression control , 2011, Nature.

[9]  R. Cole,et al.  Some tenets pertaining to electrospray ionization mass spectrometry. , 2000, Journal of mass spectrometry : JMS.

[10]  James P. Reilly,et al.  A computational approach toward label-free protein quantification using predicted peptide detectability , 2006, ISMB.

[11]  Nell Sedransk,et al.  Improved Normalization of Systematic Biases Affecting Ion Current Measurements in Label-free Proteomics Data* , 2014, Molecular & Cellular Proteomics.

[12]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[13]  Steven A Carr,et al.  Protein biomarker discovery and validation: the long and uncertain path to clinical utility , 2006, Nature Biotechnology.

[14]  R. Aebersold,et al.  Quantitative Analysis of Fission Yeast Transcriptomes and Proteomes in Proliferating and Quiescent Cells , 2012, Cell.

[15]  S. Bark,et al.  Linear and accurate quantitation of proenkephalin-derived peptides by isotopic labeling with internal standards and mass spectrometry. , 2009, Analytical biochemistry.

[16]  M. Stephens,et al.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. , 2008, Genome research.

[17]  Paul Taylor,et al.  Measurement of protein phosphorylation stoichiometry by selected reaction monitoring mass spectrometry. , 2010, Journal of proteome research.

[18]  Predrag Radivojac,et al.  The importance of peptide detectability for protein identification, quantification, and experiment design in MS/MS proteomics. , 2010, Journal of proteome research.

[19]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[20]  Christoph H Borchers,et al.  A Human Proteome Detection and Quantitation Project* , 2009, Molecular & Cellular Proteomics.

[21]  Ruedi Aebersold,et al.  Statistical Approach to Protein Quantification* , 2013, Molecular & Cellular Proteomics.

[22]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[23]  Predrag Radivojac,et al.  Computational approaches to protein inference in shotgun proteomics , 2012, BMC Bioinformatics.

[24]  R. Aebersold,et al.  A statistical model for identifying proteins by tandem mass spectrometry. , 2003, Analytical chemistry.

[25]  N. Rajewsky,et al.  Widespread changes in protein synthesis induced by microRNAs , 2008, Nature.

[26]  Paul P. Gardner,et al.  Identification of miRNA targets with stable isotope labeling by amino acids in cell culture , 2006, Nucleic acids research.

[27]  Jennifer A. Siepen,et al.  Prediction of missed cleavage sites in tryptic peptides aids protein identification in proteomics. , 2007, Journal of proteome research.

[28]  Hyungwon Choi,et al.  Significance Analysis of Spectral Count Data in Label-free Shotgun Proteomics*S , 2008, Molecular & Cellular Proteomics.

[29]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1989, Math. Control. Signals Syst..

[30]  Marina A Gritsenko,et al.  Proteomic profiling of human liver biopsies: Hepatitis C virus–induced fibrosis and mitochondrial dysfunction , 2007, Hepatology.

[31]  Jonas Grossmann,et al.  Implementation and evaluation of relative and absolute quantification in shotgun proteomics with label-free methods. , 2010, Journal of proteomics.

[32]  T. Colgan,et al.  Search for cancer markers from endometrial tissues using differentially labeled tags iTRAQ and cICAT with multidimensional liquid chromatography and tandem mass spectrometry. , 2005, Journal of proteome research.

[33]  E. Marcotte,et al.  Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation , 2007, Nature Biotechnology.

[34]  N. Samatova,et al.  Detecting differential and correlated protein expression in label-free shotgun proteomics. , 2006, Journal of proteome research.

[35]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[36]  Richard D. Smith,et al.  Does trypsin cut before proline? , 2008, Journal of proteome research.