Automated reduction and interpretation of multidimensional mass spectra for analysis of complex peptide mixtures

Abstract Here we develop a fully automated procedure for the analysis of liquid chromatography–mass spectrometry (LC–MS) datasets collected during the analysis of complex peptide mixtures. We present the underlying algorithm and outcomes of several experiments justifying its applicability. The novelty of our approach is to exploit the multidimensional character of the datasets. It is common knowledge that highly complex peptide mixtures can be analyzed by liquid chromatography coupled with mass spectrometry, but we are not aware of any existing automated MS spectra interpretation procedure designed to take into account the multidimensional character of the data. Our work fills this gap by providing an effective algorithm for this task, allowing for automated conversion of raw data to the list of masses of peptides.

[1]  Emanuel F Petricoin,et al.  Proteomic analysis at the bedside: early detection of cancer. , 2002, Trends in biotechnology.

[2]  Robert Tibshirani,et al.  Sample classification from protein mass spectrometry, by 'peak probability contrasts' , 2004, Bioinform..

[3]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[4]  J. Glimm,et al.  Detection of cancer-specific markers amid massive mass spectral data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[5]  S. Grzesiek,et al.  NMRPipe: A multidimensional spectral processing system based on UNIX pipes , 1995, Journal of biomolecular NMR.

[6]  Thomas P Conrads,et al.  Multidimensional separation of peptides for effective proteomic analysis. , 2005, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[7]  Ruedi Aebersold,et al.  A Software Suite for the Generation and Comparison of Peptide Arrays from Sets of Data Collected by Liquid Chromatography-Mass Spectrometry*S , 2005, Molecular & Cellular Proteomics.

[8]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[9]  E. Petricoin,et al.  Clinical proteomics: translating benchside promise into bedside reality , 2002, Nature Reviews Drug Discovery.

[10]  Mukesh Verma,et al.  Proteomics for Cancer Biomarker Discovery , 2002 .

[11]  E. Diamandis Mass Spectrometry as a Diagnostic and a Cancer Biomarker Discovery Tool , 2004, Molecular & Cellular Proteomics.

[12]  E. Petricoin,et al.  Clinical proteomics: personalized molecular medicine. , 2001, JAMA.

[13]  D. Figeys,et al.  18O labeling: a tool for proteomics. , 2001, Rapid communications in mass spectrometry : RCM.

[14]  J. Kowalak,et al.  Determination of oligonucleotide composition from mass spectrometrically measured molecular weight , 1993, Journal of the American Society for Mass Spectrometry.

[15]  P. Schellhammer,et al.  Serum protein fingerprinting coupled with a pattern-matching algorithm distinguishes prostate cancer from benign prostate hyperplasia and healthy men. , 2002, Cancer research.

[16]  Zhongqi Zhang,et al.  A universal algorithm for fast and automated charge state deconvolution of electrospray mass-to-charge ratio spectra , 1998, Journal of the American Society for Mass Spectrometry.

[17]  P. Schellhammer,et al.  Boosted decision tree analysis of surface-enhanced laser desorption/ionization mass spectral serum profiles discriminates prostate cancer from noncancer patients. , 2002, Clinical chemistry.

[18]  M. Senko,et al.  Determination of monoisotopic masses and ion populations for large biomolecules from resolved isotopic distributions , 1995, Journal of the American Society for Mass Spectrometry.

[19]  F. McLafferty,et al.  Automated assignment of charge states from resolved isotopic peaks for multiply charged ions , 1995, Journal of the American Society for Mass Spectrometry.

[20]  Emanuel F Petricoin,et al.  Mass spectrometry-based diagnostics: the upcoming revolution in disease detection. , 2003, Clinical chemistry.

[21]  D. Chan,et al.  Proteomics and bioinformatics approaches for identification of serum biomarkers to detect breast cancer. , 2002, Clinical chemistry.

[22]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[23]  F. McLafferty,et al.  Automated reduction and interpretation of , 2000, Journal of the American Society for Mass Spectrometry.

[24]  Richard D. Smith,et al.  Two-dimensional gas-phase separations coupled to mass spectrometry for analysis of complex mixtures. , 2005, Analytical chemistry.

[25]  Yongyi Mao,et al.  Informatics Platform for Global Proteomic Profiling and Biomarker Discovery Using Liquid Chromatography-Tandem Mass Spectrometry*S , 2004, Molecular & Cellular Proteomics.

[26]  E. Fung,et al.  Proteomic approaches to tumor marker discovery. , 2002, Archives of pathology & laboratory medicine.

[27]  E. Diamandis Analysis of serum proteomic patterns for early cancer diagnosis: drawing attention to potential problems. , 2004, Journal of the National Cancer Institute.

[28]  E. Petricoin,et al.  Use of proteomic patterns in serum to identify ovarian cancer , 2002, The Lancet.

[29]  Joseph O'Rourke,et al.  Handbook of Discrete and Computational Geometry, Second Edition , 1997 .

[30]  E. Diamandis Point: Proteomic patterns in biological fluids: do they represent the future of cancer diagnostics? , 2003, Clinical chemistry.

[31]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[32]  Liang Li,et al.  Microcolumn capture and digestion of proteins combined with mass spectrometry for protein identification. , 2002, Journal of proteome research.

[33]  Michael Olivier,et al.  ZoomQuant: An application for the quantitation of stable isotope labeled peptides , 2005, Journal of the American Society for Mass Spectrometry.

[34]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.