A scoring model for phosphopeptide site localization and its impact on the question of whether to use MSA.

The production of structurally significant product ions during the dissociation of phosphopeptides is a key to the successful determination of phosphorylation sites. These diagnostic ions can be generated using the widely adopted MS/MS approach, MS3 (Data Dependent Neutral Loss - DDNL), or by multistage activation (MSA). The main purpose of this work is to introduce a false-localization rate (FLR) probabilistic model to enable unbiased phosphoproteomics studies. Briefly, our algorithm infers a probabilistic function from the distribution of the identified phosphopeptides' XCorr Delta scores (XD-Scores) in the current experiment. Our module infers p-values by relying on Gaussian mixture models and a logistic function. We demonstrate the usefulness of our probabilistic model by revisiting the "to MSA, or not to MSA" dilemma. For this, we use human leukemia-derived cells (K562) as a study model and enriched for phosphopeptides using the hydroxyapatite (HAP) chromatography. The aliquots were analyzed with and without MSA on an Orbitrap-XL. Our XD-Scoring analysis revealed that the MS/MS approach provides more identifications because of its faster scan rate, but that for the same given scan rate higher-confidence spectra can be achieved with MSA. Our software is integrated into the PatternLab for proteomics freely available for academic community at http://www.patternlabforproteomics.org. Biological significance Assigning statistical confidence to phosphorylation sites is necessary for proper phosphoproteomic assessment. Here we present a rigorous statistical model, based on Gaussian mixture models and a logistic function, which overcomes shortcomings of previous tools. The algorithm described herein is made readily available to the scientific community by integrating it into the widely adopted PatternLab for proteomics. This article is part of a Special Issue entitled: Computational Proteomics.

[1]  V. Barbosa,et al.  PatternLab: From Mass Spectra to Label‐Free Differential Shotgun Proteomics , 2012, Current protocols in bioinformatics.

[2]  M. Mann,et al.  Global, In Vivo, and Site-Specific Phosphorylation Dynamics in Signaling Networks , 2006, Cell.

[3]  Gabriel Padrón,et al.  Peptide fractionation by acid pH SDS‐free electrophoresis , 2011, Electrophoresis.

[4]  Ishtiaq Rehman,et al.  iTRAQ underestimation in simple and complex mixtures: "the good, the bad and the ugly". , 2009, Journal of proteome research.

[5]  Steven P Gygi,et al.  A probability-based approach for high-throughput protein phosphorylation analysis and site localization , 2006, Nature Biotechnology.

[6]  N. Ahn,et al.  Quantifying the impact of chimera MS/MS spectra on peptide identification in large-scale proteomics studies. , 2010, Journal of proteome research.

[7]  Bernhard Kuster,et al.  DMSO enhances electrospray response, boosting sensitivity of proteomic experiments , 2013, Nature Methods.

[8]  Valmir Carneiro Barbosa,et al.  PatternLab for proteomics: a tool for differential shotgun proteomics , 2008, BMC Bioinformatics.

[9]  W. Weckwerth,et al.  Enrichment of phosphorylated proteins and peptides from complex mixtures using metal oxide/hydroxide affinity chromatography (MOAC) , 2005, Proteomics.

[10]  T. Hunter,et al.  Signaling—2000 and Beyond , 2000, Cell.

[11]  P. Cohen The role of protein phosphorylation in human health and disease. The Sir Hans Krebs Medal Lecture. , 2001, European journal of biochemistry.

[12]  A. Heck,et al.  Selective isolation at the femtomole level of phosphopeptides from proteolytic digests using 2D-NanoLC-ESI-MS/MS and titanium oxide precolumns. , 2004, Analytical chemistry.

[13]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[14]  S. Schmidt,et al.  Current methods for phosphoprotein isolation and enrichment. , 2007, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[15]  Arend Sidow,et al.  A Cell Cycle Phosphoproteome of the Yeast Centrosome , 2011, Science.

[16]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[17]  C. Nilsson Advances in quantitative phosphoproteomics. , 2012, Analytical chemistry.

[18]  John R Yates,et al.  Search engine processor: Filtering and organizing peptide spectrum matches , 2012, Proteomics.

[19]  F. Švec,et al.  Control of selectivity via nanochemistry: monolithic capillary column containing hydroxyapatite nanoparticles for separation of proteins and enrichment of phosphopeptides. , 2010, Analytical chemistry.

[20]  Alexey I Nesvizhskii,et al.  Comparison of MS(2)-only, MSA, and MS(2)/MS(3) methodologies for phosphopeptide identification. , 2009, Journal of proteome research.

[21]  M. Larsen,et al.  Analytical strategies for phosphoproteomics , 2009, Expert review of neurotherapeutics.

[22]  Nikolaos Scarmeas,et al.  The good, bad, and ugly? , 2012, Neurology.

[23]  Albert Sickmann,et al.  State‐of‐the‐art in phosphoproteomics , 2005, Proteomics.

[24]  P. Ferranti,et al.  Hydroxyapatite affinity chromatography for the highly selective enrichment of mono‐ and multi‐phosphorylated peptides in phosphoproteome analysis , 2010, Proteomics.

[25]  B. Kuster,et al.  Confident Phosphorylation Site Localization Using the Mascot Delta Score , 2010, Molecular & Cellular Proteomics.

[26]  L. J. Cummings,et al.  Protein chromatography on hydroxyapatite columns. , 2009, Methods in enzymology.

[27]  J. Eng,et al.  Comet: An open‐source MS/MS sequence database search tool , 2013, Proteomics.

[28]  M. Mann,et al.  Higher-energy C-trap dissociation for peptide modification analysis , 2007, Nature Methods.

[29]  M. Mann,et al.  Profiling the Trypanosoma cruzi Phosphoproteome , 2011, PloS one.

[30]  A. S. Posner,et al.  Crystal Structure of Hydroxyapatite , 1964, Nature.

[31]  M. Mann,et al.  A large synthetic peptide and phosphopeptide reference library for mass spectrometry–based proteomics , 2013, Nature Biotechnology.

[32]  J R Yates,et al.  Protein sequencing by tandem mass spectrometry. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[33]  A. Tichý,et al.  Phosphoproteomics: searching for a needle in a haystack. , 2011, Journal of proteomics.

[34]  T. Isobe,et al.  A New Spherical Hydroxyapatite for High Performance Liquid Chromatography of Proteins , 1986 .

[35]  Lennart Martens,et al.  D‐score: A search engine independent MD‐score , 2013, Proteomics.

[36]  C. Gahmberg,et al.  K562—A human erythroleukemic cell line , 1979, International journal of cancer.

[37]  M. J. Gorbunoff,et al.  Protein chromatography on hydroxyapatite columns. , 1990, Methods in enzymology.

[38]  John R Yates,et al.  Single-step inline hydroxyapatite enrichment facilitates identification and quantitation of phosphopeptides from mass-limited proteomes with MudPIT. , 2012, Journal of proteome research.

[39]  J. Shabanowitz,et al.  A neutral loss activation method for improved phosphopeptide sequence analysis by quadrupole ion trap mass spectrometry. , 2004, Analytical chemistry.