DIAmeter: matching peptides to data-independent acquisition mass spectrometry data

Tandem mass spectrometry data acquired using data independent acquisition (DIA) is challenging to interpret because the data exhibits complex structure along both the mass-to-charge (m/z) and time axes. The most common approach to analyzing this type of data makes use of a library of previously observed DIA data patterns (a “spectral library”), but this approach is expensive because the libraries do not typically generalize well across laboratories. Here we propose DIAmeter, a search engine that detects peptides in DIA data using only a peptide sequence database. Unlike other library-free DIA analysis methods, DIAmeter supports data generated using both wide and narrow isolation windows, can readily detect peptides containing post-translational modifications, can analyze data from a variety of instrument platforms, and is capable of detecting peptides even in the absence of detectable signal in the survey (MS1) scan.

[1]  O. Ohara,et al.  Data-Independent Acquisition Mass Spectrometry-Based Deep Proteome Analysis for Hydrophobic Proteins from Dried Blood Spots Enriched by Sodium Carbonate Precipitation. , 2021, Methods in molecular biology.

[2]  Xue Cai,et al.  Data‐Independent Acquisition Mass Spectrometry‐Based Proteomics and Software Tools: A Glimpse in 2020 , 2020, Proteomics.

[3]  S. Degroeve,et al.  DeepLC can predict retention times for peptides that carry as-yet unseen modifications , 2020, Nature Methods.

[4]  Pavel Sulimov,et al.  Tailor: non-parametric and rapid score calibration method for database search-based peptide identification in shotgun proteomics. , 2020, Journal of proteome research.

[5]  Haixu Tang,et al.  Full-Spectrum Prediction of Peptides Tandem Mass Spectra using Deep Neural Network. , 2020, Analytical chemistry.

[6]  Christoph B. Messner,et al.  DIA-NN: Neural networks and interference correction enable deep proteome coverage in high throughput , 2019, Nature Methods.

[7]  Mathias Wilhelm,et al.  Generating high quality libraries for DIA MS with empirically corrected peptide predictions , 2019, Nature Communications.

[8]  Mathias Wilhelm,et al.  Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning , 2019, Nature Methods.

[9]  J. Cox,et al.  High-quality MS/MS spectrum prediction for data-dependent and data-independent acquisition data analysis , 2019, Nature Methods.

[10]  Michael J MacCoss,et al.  Chromatogram libraries improve peptide detection and quantification by data independent acquisition mass spectrometry , 2018, Nature Communications.

[11]  Huanming Yang,et al.  Improved Peptide Retention Time Prediction in Liquid Chromatography through Deep Learning. , 2018, Analytical chemistry.

[12]  William Stafford Noble,et al.  Joint precursor elution profile inference via regression for peptide detection in data-independent acquisition mass spectra , 2018, bioRxiv.

[13]  Chunjie Luo,et al.  pDeep: Predicting MS/MS Spectra of Peptides with Deep Learning. , 2017, Analytical chemistry.

[14]  Oliver M. Bernhardt,et al.  Optimization of Experimental Parameters in Data-Independent Mass Spectrometry Significantly Increases Depth and Reproducibility of Results* , 2017, Molecular & Cellular Proteomics.

[15]  Samuel H Payne,et al.  PECAN: Library Free Peptide Detection for Data-Independent Acquisition Tandem Mass Spectrometry Data , 2017, Nature Methods.

[16]  Alexey I Nesvizhskii,et al.  MSFragger: ultrafast and comprehensive peptide identification in shotgun proteomics , 2017, Nature Methods.

[17]  Judit Villén,et al.  Evolution of protein phosphorylation across 18 fungal species , 2016, Science.

[18]  Yasset Perez-Riverol,et al.  A multi-center study benchmarks software tools for label-free proteome quantification , 2016, Nature Biotechnology.

[19]  William Stafford Noble,et al.  Bipartite matching generalizations for peptide identification in tandem mass spectrometry , 2016, BCB.

[20]  Jian Wang,et al.  MSPLIT-DIA: sensitive peptide identification for data-independent acquisition , 2015, Nature Methods.

[21]  José A. Dianes,et al.  2016 update of the PRIDE database and its related tools , 2015, Nucleic Acids Res..

[22]  Yuanyue Li,et al.  Group-DIA: analyzing multiple data-independent acquisition mass spectrometry data files , 2015, Nature Methods.

[23]  Oliver M. Bernhardt,et al.  Extending the Limits of Quantitative Proteome Profiling with Data-Independent Acquisition and Application to Acetaminophen-Treated Three-Dimensional Liver Microtissues* , 2015, Molecular & Cellular Proteomics.

[24]  Ruedi Aebersold,et al.  Quantitative variability of 342 plasma proteins in a human twin population , 2015 .

[25]  Chih-Chiang Tsou,et al.  DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics , 2015, Nature Methods.

[26]  Allison Doerr DIA mass spectrometry , 2014, Nature Methods.

[27]  Stephan M. Winkler,et al.  MS Amanda, a Universal Identification Algorithm Optimized for High Accuracy Tandem Mass Spectra , 2014, Journal of proteome research.

[28]  William Stafford Noble,et al.  A cross-validation scheme for machine learning algorithms in shotgun proteomics , 2012, BMC Bioinformatics.

[29]  William Stafford Noble,et al.  Faster SEQUEST searching for peptide identification from tandem mass spectra. , 2011, Journal of proteome research.

[30]  William Stafford Noble,et al.  On using samples of known protein content to assess the statistical calibration of scores assigned to peptide-spectrum matches in shotgun proteomics. , 2011, Journal of proteome research.

[31]  P. Pevzner,et al.  The Generating Function of CID, ETD, and CID/ETD Pairs of Tandem Mass Spectra: Applications to Database Search* , 2010, Molecular & Cellular Proteomics.

[32]  Robert Burke,et al.  ProteoWizard: open source software for rapid proteomics tools development , 2008, Bioinform..

[33]  William Stafford Noble,et al.  Rapid and accurate peptide identification from tandem mass spectra. , 2008, Journal of proteome research.

[34]  William Stafford Noble,et al.  Semi-supervised learning for peptide identification from shotgun proteomics datasets , 2007, Nature Methods.

[35]  John D. Venable,et al.  Automated approach for quantitative analysis of complex peptide mixtures from tandem mass spectra , 2004, Nature Methods.

[36]  Robertson Craig,et al.  TANDEM: matching proteins with tandem mass spectra. , 2004, Bioinformatics.

[37]  S. Bryant,et al.  Open mass spectrometry search algorithm. , 2004, Journal of proteome research.

[38]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[39]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[40]  Susan S. Taylor,et al.  The evolution of protein phosphorylation , 2016 .

[41]  M. Mann,et al.  Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips , 2007, Nature Protocols.

[42]  Brendan MacLean,et al.  Bioinformatics Applications Note Gene Expression Skyline: an Open Source Document Editor for Creating and Analyzing Targeted Proteomics Experiments , 2022 .

[43]  B. Fredholm,et al.  P O B O X 1 1 7 2 2 1 0 0 L U N D + 4 6 4 6 -2 2 2 0 0 0 0 , 2022 .