Wavelet-Based Peak Detection and a New Charge Inference Procedure for MS/MS Implemented in ProteoWizard’s msConvert

We report the implementation of high-quality signal processing algorithms into ProteoWizard, an efficient, open-source software package designed for analyzing proteomics tandem mass spectrometry data. Specifically, a new wavelet-based peak-picker (CantWaiT) and a precursor charge determination algorithm (Turbocharger) have been implemented. These additions into ProteoWizard provide universal tools that are independent of vendor platform for tandem mass spectrometry analyses and have particular utility for intralaboratory studies requiring the advantages of different platforms convergent on a particular workflow or for interlaboratory investigations spanning multiple platforms. We compared results from these tools to those obtained using vendor and commercial software, finding that in all cases our algorithms resulted in a comparable number of identified peptides for simple and complex samples measured on Waters, Agilent, and AB SCIEX quadrupole time-of-flight and Thermo Q-Exactive mass spectrometers. The mass accuracy of matched precursor ions also compared favorably with vendor and commercial tools. Additionally, typical analysis runtimes (∼1–100 ms per MS/MS spectrum) were short enough to enable the practical use of these high-quality signal processing tools for large clinical and research data sets.

[1]  Natalie I. Tasman,et al.  A Cross-platform Toolkit for Mass Spectrometry and Proteomics , 2012, Nature Biotechnology.

[2]  Jean Yee Hwa Yang,et al.  BIOINFORMATICS ORIGINAL PAPER , 2022 .

[3]  Pei Wang,et al.  Bioinformatics Original Paper a Suite of Algorithms for the Comprehensive Analysis of Complex Protein Mixtures Using High-resolution Lc-ms , 2022 .

[4]  Richard D. Smith,et al.  Mass Spectrometry‐Based Proteomics: Existing Capabilities and Future Directions , 2012 .

[5]  R D Appel,et al.  Improving protein identification from peptide mass fingerprinting through a parameterized multi‐level scoring algorithm and an optimized peak detection , 1999, Electrophoresis.

[6]  Navdeep Jaitly,et al.  VIPER: an advanced software package to support high-throughput LC-MS peptide identification , 2007, Bioinform..

[7]  Chao Yang,et al.  Comparison of public peak detection algorithms for MALDI mass spectrometry data analysis , 2009, BMC Bioinformatics.

[8]  Bernhard Kuster,et al.  Quantitative mass spectrometry in proteomics: critical review update from 2007 to the present , 2012, Analytical and Bioanalytical Chemistry.

[9]  N. Ahn,et al.  Quantifying the impact of chimera MS/MS spectra on peptide identification in large-scale proteomics studies. , 2010, Journal of proteome research.

[10]  Zhongqi Zhang,et al.  A universal algorithm for fast and automated charge state deconvolution of electrospray mass-to-charge ratio spectra , 1998, Journal of the American Society for Mass Spectrometry.

[11]  Michael D. Litton,et al.  IDPicker 2.0: Improved protein assembly with high discrimination peptide identification filtering. , 2009, Journal of proteome research.

[12]  Andreas Hildebrandt,et al.  Efficient Analysis of Mass Spectrometry Data Using the Isotope Wavelet , 2008 .

[13]  M. Senko,et al.  Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions , 1995, Journal of the American Society for Mass Spectrometry.

[14]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[15]  Edmond J. Breen,et al.  Automatic Poisson peak harvesting for high throughput protein identification , 2000, Electrophoresis.

[16]  Pan Du,et al.  Bioinformatics Original Paper Improved Peak Detection in Mass Spectrum by Incorporating Continuous Wavelet Transform-based Pattern Matching , 2022 .

[17]  D. Tabb,et al.  MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis. , 2007, Journal of proteome research.

[18]  上原 勉 TOF(Time of Flight)方式による三次元画像距離カメラ (特集:非接触三次元計測の最新動向) , 2008 .

[19]  David L Tabb,et al.  DirecTag: accurate sequence tags from peptide MS/MS through statistical scoring. , 2008, Journal of proteome research.

[20]  F. McLafferty,et al.  Automated reduction and interpretation of , 2000, Journal of the American Society for Mass Spectrometry.

[21]  F. McLafferty,et al.  High-resolution electrospray mass spectra of large molecules , 1991 .

[22]  Dante Mantini,et al.  LIMPIC: a computational method for the separation of protein MALDI-TOF-MS signals from noise , 2007, BMC Bioinformatics.

[23]  M. MacCoss,et al.  High-speed data reduction, feature detection, and MS/MS spectrum quality assessment of shotgun proteomics data sets using high-resolution mass spectrometry. , 2007, Analytical chemistry.

[24]  David L Tabb,et al.  Determination of peptide and protein ion charge states by fourier transformation of isotope-resolved mass spectra , 2006, Journal of the American Society for Mass Spectrometry.

[25]  P. Pevzner,et al.  Spectral probabilities and generating functions of tandem mass spectra: a strike against decoy databases. , 2008, Journal of proteome research.

[26]  Vincent A Emanuele,et al.  Benchmarking currently available SELDI‐TOF MS preprocessing techniques , 2009, Proteomics.

[27]  Alexey I Nesvizhskii,et al.  Analysis and validation of proteomic data generated by tandem mass spectrometry , 2007, Nature Methods.

[28]  P Berndt,et al.  Reliable automatic protein identification from matrix‐assisted laser desorption/ionization mass spectrometric peptide fingerprints , 1999, Electrophoresis.

[29]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.