Updated MS²PIP web server delivers fast and accurate MS² peak intensity prediction for multiple fragmentation methods, instruments and labeling techniques

Abstract MS²PIP is a data-driven tool that accurately predicts peak intensities for a given peptide's fragmentation mass spectrum. Since the release of the MS²PIP web server in 2015, we have brought significant updates to both the tool and the web server. In addition to the original models for CID and HCD fragmentation, we have added specialized models for the TripleTOF 5600+ mass spectrometer, for TMT-labeled peptides, for iTRAQ-labeled peptides, and for iTRAQ-labeled phosphopeptides. Because the fragmentation pattern is heavily altered in each of these cases, these additional models greatly improve the prediction accuracy for their corresponding data types. We have also substantially reduced the computational resources required to run MS²PIP, and have completely rebuilt the web server, which now allows predictions of up to 100 000 peptide sequences in a single request. The MS²PIP web server is freely available at https://iomics.ugent.be/ms2pip/.

[1]  P. Willems,et al.  N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana , 2017, Molecular & Cellular Proteomics.

[2]  Albert Sickmann,et al.  Temporal quantitative phosphoproteomics of ADP stimulation reveals novel central nodes in platelet activation and inhibition. , 2017, Blood.

[3]  Lennart Martens,et al.  Accurate peptide fragmentation predictions allow data driven approaches to replace and improve upon proteomics search engine scoring functions , 2019, Bioinform..

[4]  G. D. de Souza,et al.  Proteome analysis of an Escherichia coli ptsN-null strain under different nitrogen regimes. , 2018, Journal of proteomics.

[5]  Lennart Martens,et al.  PRIDE: The proteomics identifications database , 2005, Proteomics.

[6]  Pavel A. Pevzner,et al.  Universal database search tool for proteomics , 2014, Nature Communications.

[7]  Lennart Martens,et al.  MS2PIP prediction server: compute and visualize MS2 peak intensity predictions for CID and HCD fragmentation , 2015, Nucleic Acids Res..

[8]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[9]  John C Whittaker,et al.  Review of factors that influence the abundance of ions produced in a tandem mass spectrometer and statistical methods for discovering these factors. , 2009, Mass spectrometry reviews.

[10]  William Stafford Noble,et al.  Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0 , 2016, Journal of The American Society for Mass Spectrometry.

[11]  K. Parker,et al.  Multiplexed Protein Quantitation in Saccharomyces cerevisiae Using Amine-reactive Isobaric Tagging Reagents*S , 2004, Molecular & Cellular Proteomics.

[12]  Mathias Wilhelm,et al.  Building ProteomeTools based on a complete synthetic human proteome , 2017, Nature Methods.

[13]  Jian Wang,et al.  Assembling the Community-Scale Discoverable Human Proteome , 2018, Cell systems.

[14]  Niels Jørgensen,et al.  Development and validation of a mass spectrometry-based assay for quantification of insulin-like factor 3 in human serum , 2018, Clinical chemistry and laboratory medicine.

[15]  Jianqiao Shen,et al.  Spectral Library Search Improves Assignment of TMT Labeled MS/MS Spectra. , 2018, Journal of proteome research.

[16]  M. Savitski,et al.  Thermal proteome profiling in bacteria: probing protein state in vivo , 2018, Molecular systems biology.

[17]  Martin Eisenacher,et al.  The PRIDE database and related tools and resources in 2019: improving support for quantification data , 2018, Nucleic Acids Res..

[18]  Predrag Radivojac,et al.  A Machine Learning Approach to Predicting Peptide Fragmentation Spectra , 2005, Pacific Symposium on Biocomputing.

[19]  Chih-Chiang Tsou,et al.  DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics , 2015, Nature Methods.

[20]  Michael J MacCoss,et al.  Using BiblioSpec for Creating and Searching Tandem MS Peptide Libraries , 2007, Current protocols in bioinformatics.

[21]  Brendan MacLean,et al.  Bioinformatics Applications Note Gene Expression Skyline: an Open Source Document Editor for Creating and Analyzing Targeted Proteomics Experiments , 2022 .

[22]  W. Luyten,et al.  Comprehensive Peptide Analysis of Mouse Brain Striatum Identifies Novel sORF‐Encoded Polypeptides , 2018, Proteomics.

[23]  Eric W. Deutsch,et al.  A repository of assays to quantify 10,000 human proteins by SWATH-MS , 2014, Scientific Data.

[24]  Peter Dawyndt,et al.  The unique peptidome: Taxon‐specific tryptic peptides as biomarkers for targeted metaproteomics , 2016, Proteomics.

[25]  Lennart Martens,et al.  MS2PIP: a tool for MS/MS peak intensity prediction , 2013, Bioinform..

[26]  Andrew H. Thompson,et al.  Tandem mass tags: a novel quantification strategy for comparative analysis of complex protein mixtures by MS/MS. , 2003, Analytical chemistry.