Understanding the characteristics of mass spectrometry data through the use of simulation

Background Mass spectrometry is actively being used to discover disease-related proteomic patterns in complex mixtures of proteins derived from tissue samples or from easily obtained biological fluids. The potential importance of these clinical applications has made the development of better methods for processing and analyzing the data an active area of research. It is, however, difficult to determine which methods are better without knowing the true biochemical composition of the samples used in the experiments. Methods We developed a mathematical model based on the physics of a simple MALDI-TOF mass spectrometer with time-lag focusing. Using this model, we implemented a statistical simulation of mass spectra. We used the simulation to explore some of the basicoperating characteristics of MALDI or SELDI instruments. Results The simulation reproduced several characteristics of actual instruments. We found that the relative mass error is affected by the time discretization of the detector (about 0.01%) and the spread of initial velocities (about 0.1%). The accuracy of calibration based on external standards decays rapidly outside the range spanned by the calibrants. Natural isotope distributions play a major role inbroadening peaks associated with individual proteins. The area of a peak is a more accurate measure of its size than the height. Conclusions The model described here is capable of simulating realistic mass spectra. The simulation should become a useful tool forgenerating spectra where the true inputs are known, allowing researchers to evaluate the performance of new methods for processing and analyzing mass spectra. Availability: http://bioinformatics.mdanderson.org/cromwell.html

[1]  Jeffrey S. Morris,et al.  Improved peak detection and quantification of mass spectrometry data acquired from surface‐enhanced laser desorption and ionization by denoising spectra with the undecimated discrete wavelet transform , 2005, Proteomics.

[2]  Jeffrey S. Morris,et al.  Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments , 2004, Bioinform..

[3]  J. Glimm,et al.  Detection of cancer-specific markers amid massive mass spectral data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Y. Yasui,et al.  An Automated Peak Identification/Calibration Procedure for High-Dimensional Protein Measures From Mass Spectrometers , 2003, Journal of biomedicine & biotechnology.

[5]  Jeffrey S. Morris,et al.  Quality control and peak finding for proteomics data collected from nipple aspirate fluid by surface-enhanced laser desorption and ionization. , 2003, Clinical chemistry.

[6]  Jeffrey S. Morris,et al.  A comprehensive approach to the analysis of matrix‐assisted laser desorption/ionization‐time of flight proteomics spectra from serum samples , 2003, Proteomics.

[7]  Xiwu Lin,et al.  Megavariate data analysis of mass spectrometric proteomics data using latent variable projection method , 2003, Proteomics.

[8]  Douglas M. Hawkins,et al.  Exploring Blood Spectra for Signs of Ovarian Cancer , 2003 .

[9]  L. Cazares,et al.  A Look at Mass Spectral Measurement , 2003 .

[10]  A. Pothen,et al.  Protocols for disease classification from mass spectrometry data , 2003, Proteomics.

[11]  J. Potter,et al.  A data-analytic strategy for protein biomarker discovery: profiling of high-dimensional proteomic data for cancer detection. , 2003, Biostatistics.

[12]  M. Zhan,et al.  A data review and re-assessment of ovarian cancer serum proteomic profiling , 2003, BMC Bioinform..

[13]  T. Zhukov,et al.  Discovery of distinct protein profiles specific for lung tumors and pre-malignant lung lesions by SELDI mass spectrometry. , 2003, Lung cancer.

[14]  Pierre Legrain,et al.  Comprehensive Proteomic Analysis of Breast Cancer Cell Membranes Reveals Unique Proteins with Potential Roles in Clinical Cancer* 210 , 2003, The Journal of Biological Chemistry.

[15]  E. Fung,et al.  Proteomic approaches to tumor marker discovery. , 2002, Archives of pathology & laboratory medicine.

[16]  W Henry Benner,et al.  A simple algorithm improves mass accuracy to 50-100 ppm for delayed extraction linear matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. , 2002, Rapid communications in mass spectrometry : RCM.

[17]  P. Schellhammer,et al.  Serum protein fingerprinting coupled with a pattern-matching algorithm distinguishes prostate cancer from benign prostate hyperplasia and healthy men. , 2002, Cancer research.

[18]  L. Liotta,et al.  Proteomic Patterns of Nipple Aspirate Fluids Obtained by SELDI-TOF: Potential for New Biomarkers to Aid in the Diagnosis of Breast Cancer , 2002, Disease markers.

[19]  Peter Albers,et al.  Analysis of microdissected prostate tissue with ProteinChip arrays--a way to new insights into carcinogenesis and to diagnostic tools. , 2002, International journal of molecular medicine.

[20]  J. Reilly,et al.  Improved calibration of time-of-flight mass spectra by simplex optimization of electrostatic ion calculations. , 2000, Analytical chemistry.

[21]  S. Weinberger,et al.  Recent advancements in surface‐enhanced laser desorption/ionization‐time of flight‐mass spectrometry , 2000, Electrophoresis.

[22]  E. Petricoin,et al.  Rapid protein display profiling of cancer progression directly from human tissue using a protein biochip , 2000 .

[23]  Michael Karas,et al.  The initial ion velocity and its dependence on matrix, analyte and preparation method in ultraviolet matrix‐assisted laser desorption/ionization , 1999 .

[24]  Marvin L. Vestal,et al.  Resolution and mass accuracy in matrix-assisted laser desorption ionization-time-of-flight , 1998 .

[25]  R. Edmondson,et al.  Evaluation of the variables that affect resolution in delayed extraction MALDI-TOF , 1997 .

[26]  S. Guan,et al.  Enhancement of the effective resolution of mass spectra of high-mass biomolecules by maximum entropy-based deconvolution to eliminate the isotopic natural abundance distribution , 1997 .

[27]  Stephen A. Martin,et al.  On the initial velocity of ions generated by matrix-assisted laser desorption ionization and its effect on the calibration of delayed extraction time-of-flight mass spectra , 1997 .

[28]  M. Karas,et al.  Factors affecting the resolution in matrix-assisted laser desorption--ionization mass spectrometry , 1994 .

[29]  Ronald Beavis,et al.  Velocity distributions of intact high mass polypeptide molecule ions produced by matrix assisted laser desorption , 1991 .

[30]  I. Mclaren,et al.  TIME-OF-FLIGHT MASS SPECTROMETER WITH IMPROVED RESOLUTION , 1955 .

[31]  S. Weinberger,et al.  Current developments in SELDI affinity technology. , 2004, Mass spectrometry reviews.

[32]  P. Nickerson,et al.  Urine protein profiling with surface-enhanced laser-desorption/ionization time-of-flight mass spectrometry. , 2004, Kidney international.

[33]  Stephen A. Martin,et al.  Delayed extraction matrix‐assisted laser desorption time‐of‐flight mass spectrometry , 1995 .

[34]  E. Petricoin,et al.  Use of proteomic patterns in serum to identify ovarian cancer , 2002, The Lancet.