Improving feature detection and analysis of surface‐enhanced laser desorption/ionization‐time of flight mass spectra

Discovering valid biological information from surface‐enhanced laser desorption/ionization‐time of flight mass spectrometry (SELDI‐TOF MS) depends on clear experimental design, meticulous sample handling, and sophisticated data processing. Most published literature deals with the biological aspects of these experiments, or with computer‐learning algorithms to locate sets of classifying biomarkers. The process of locating and measuring proteins across spectra has received less attention. This process should be tunable between sensitivity and false‐discovery, and should guarantee that features are biologically meaningful in that they represent chemical species that can be identified and investigated. Existing feature detection in SELDI‐TOF MS is not optimal for acquiring biologically relevant data. Most methods have so many user‐defined settings that reproducibility and comparability among studies suffer considerably. To address these issues, we have developed an approach, called simultaneous spectrum analysis (SSA), which (i) locates proteins across spectra, (ii) measures their abundance, (iii) subtracts baseline, (iv) excludes irreproducible measurements, and (v) computes normalization factors for comparing spectra. SSA uses only two key parameters for feature detection and one parameter each for quality thresholds on spectra and peaks. The effectiveness of SSA is demonstrated by identifying proteins differentially expressed in SELDI‐TOF spectra from plasma of wild‐type and knockout mice for plasma glutathione peroxidase. Comparing analyses by SSA and CiphergenExpress Data Manager 2.1 finds similar results for large signal peaks, but SSA improves the number and quality of differences betweens groups among lower signal peaks. SSA is also less likely to introduce systematic bias when normalizing spectra.

[1]  J. Whitin,et al.  Plasma Glutathione Peroxidase and Its Relationship to Renal Proximal Tubule Function , 1998 .

[2]  E. Fung,et al.  Protein biochips for differential profiling. , 2001, Current opinion in biotechnology.

[3]  G. Li,et al.  An integrated approach utilizing artificial neural networks and SELDI mass spectrometry for the classification of human tumours and rapid identification of potential biomarkers , 2002, Bioinform..

[4]  P. Schellhammer,et al.  Serum protein fingerprinting coupled with a pattern-matching algorithm distinguishes prostate cancer from benign prostate hyperplasia and healthy men. , 2002, Cancer research.

[5]  E. Fung,et al.  Proteomic approaches to tumor marker discovery. , 2002, Archives of pathology & laboratory medicine.

[6]  Bruce Randall Donald,et al.  Probabilistic Disease Classification of Expression-Dependent Proteomic Data from Mass Spectrometry of Human Serum , 2003, J. Comput. Biol..

[7]  J. Glimm,et al.  Detection of cancer-specific markers amid massive mass spectral data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8]  J. Potter,et al.  A data-analytic strategy for protein biomarker discovery: profiling of high-dimensional proteomic data for cancer detection. , 2003, Biostatistics.

[9]  Jeffrey S. Morris,et al.  Quality control and peak finding for proteomics data collected from nipple aspirate fluid by surface-enhanced laser desorption and ionization. , 2003, Clinical chemistry.

[10]  Xiwu Lin,et al.  Megavariate data analysis of mass spectrometric proteomics data using latent variable projection method , 2003, Proteomics.

[11]  Jeffrey S. Morris,et al.  A comprehensive approach to the analysis of matrix‐assisted laser desorption/ionization‐time of flight proteomics spectra from serum samples , 2003, Proteomics.

[12]  C. Floyd,et al.  Decision tree classification of proteins identified by mass spectrometry of blood serum samples from people with and without lung cancer , 2003, Proteomics.

[13]  Charles M Guttman,et al.  An operator-independent approach to mass spectral peak identification and integration. , 2004, Analytical chemistry.

[14]  Jeffrey S. Morris,et al.  Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments , 2004, Bioinform..

[15]  R. Bast,et al.  Three Biomarkers Identified from Serum Proteomic Analysis for the Detection of Early Stage Ovarian Cancer , 2004, Cancer Research.

[16]  K. Rodland Proteomics and cancer diagnosis: the potential of mass spectrometry. , 2004, Clinical biochemistry.

[17]  D. Chan,et al.  Bioinformatics strategies for proteomic profiling. , 2004, Clinical biochemistry.