Updated MS²PIP web server supports cutting-edge proteomics applications

Abstract Interest in the use of machine learning for peptide fragmentation spectrum prediction has been strongly on the rise over the past years, especially for applications in challenging proteomics identification workflows such as immunopeptidomics and the full-proteome identification of data independent acquisition spectra. Since its inception, the MS²PIP peptide spectrum predictor has been widely used for various downstream applications, mostly thanks to its accuracy, ease-of-use, and broad applicability. We here present a thoroughly updated version of the MS²PIP web server, which includes new and more performant prediction models for both tryptic- and non-tryptic peptides, for immunopeptides, and for CID-fragmented TMT-labeled peptides. Additionally, we have also added new functionality to greatly facilitate the generation of proteome-wide predicted spectral libraries, requiring only a FASTA protein file as input. These libraries also include retention time predictions from DeepLC. Moreover, we now provide pre-built and ready-to-download spectral libraries for various model organisms in multiple DIA-compatible spectral library formats. Besides upgrading the back-end models, the user experience on the MS²PIP web server is thus also greatly enhanced, extending its applicability to new domains, including immunopeptidomics and MS3-based TMT quantification experiments. MS²PIP is freely available at https://iomics.ugent.be/ms2pip/.

[1]  Benjamin A. Neely,et al.  Toward an Integrated Machine Learning Model of a Proteomics Experiment , 2023, Journal of proteome research.

[2]  S. Degroeve,et al.  MS2Rescore: Data-Driven Rescoring Dramatically Boosts Immunopeptide Identification Rates , 2021, bioRxiv.

[3]  Donald J L Jones,et al.  Cov-MS: A Community-Based Template Assay for Mass-Spectrometry-Based Protein Detection in SARS-CoV-2 Patients , 2021, JACS Au.

[4]  Lindsay K. Pino,et al.  The Skyline ecosystem: Informatics for quantitative mass spectrometry proteomics. , 2020, Mass spectrometry reviews.

[5]  Lennart Martens,et al.  The Age of Data‐Driven Proteomics: How Machine Learning Enables Novel Workflows , 2020, Proteomics.

[6]  S. Degroeve,et al.  DeepLC can predict retention times for peptides that carry as-yet unseen modifications , 2020, Nature Methods.

[7]  Christoph B. Messner,et al.  DIA-NN: Neural networks and interference correction enable deep proteome coverage in high throughput , 2019, Nature Methods.

[8]  Mathias Wilhelm,et al.  Generating high quality libraries for DIA MS with empirically corrected peptide predictions , 2019, Nature Communications.

[9]  B Van Puyvelde,et al.  Removing the hidden data dependency of DIA with predicted spectral libraries , 2019, bioRxiv.

[10]  Lennart Martens,et al.  Updated MS²PIP web server delivers fast and accurate MS² peak intensity prediction for multiple fragmentation methods, instruments and labeling techniques , 2019, Nucleic Acids Res..

[11]  Martin Eisenacher,et al.  The PRIDE database and related tools and resources in 2019: improving support for quantification data , 2018, Nucleic Acids Res..

[12]  Lennart Martens,et al.  Accurate peptide fragmentation predictions allow data driven approaches to replace and improve upon proteomics search engine scoring functions , 2018, bioRxiv.

[13]  Anthony W Purcell,et al.  In Immunopeptidomics We Need a Sniper Instead of a Shotgun , 2018, Proteomics.

[14]  Peter Dawyndt,et al.  The unique peptidome: Taxon‐specific tryptic peptides as biomarkers for targeted metaproteomics , 2016, Proteomics.

[15]  Lennart Martens,et al.  MS2PIP prediction server: compute and visualize MS2 peak intensity predictions for CID and HCD fragmentation , 2015, Nucleic Acids Res..

[16]  Lennart Martens,et al.  MS2PIP: a tool for MS/MS peak intensity prediction , 2013, Bioinform..

[17]  S. Gygi,et al.  MS3 eliminates ratio distortion in isobaric labeling-based multiplexed quantitative proteomics , 2011, Nature Methods.

[18]  Frank Kjeldsen,et al.  Undesirable charge-enhancement of isobaric tagged phosphopeptides leads to reduced identification efficiency. , 2010, Journal of proteome research.

[19]  Lennart Martens,et al.  PRIDE: The proteomics identifications database , 2005, Proteomics.

[20]  V. Wysocki,et al.  Mobile and localized protons: a framework for understanding peptide dissociation. , 2000, Journal of mass spectrometry : JMS.

[21]  S. Degroeve,et al.  MS 2 Rescore: Data-Driven Rescoring Dramatically Boosts Immunopeptide Identi fi cation Rates , 2022 .