Analysis of MALDI-TOF Serum Profiles for Biomarker Selection and Sample Classification

Mass spectrometric profiles of peptides and proteins obtained by current technologies are characterized by complex spectra, high dimensionality, and substantial noise. These characteristics generate challenges in discovery of proteins and protein-profiles that distinguish disease states, e.g. cancer patients from healthy individuals. A challenging aspect of biomarker discovery in serum is the interference of abundant proteins with identification of disease-related proteins and peptides. We present data processing methods and computational intelligence that combines support vector machines (SVM) with particle swarm optimization (PSO) for biomarker selection from MALDI-TOF spectra of enriched serum. SVM classifiers were built for various combinations of m/z windows guided by the PSO algorithm. The method identified mass points that achieved high classification accuracy in distinguishing cancer patients from non-cancer controls. Based on their frequency of occurrence in multiple runs, six m/z windows were selected as candidate biomarkers. These biomarkers yielded 100% sensitivity and 91% specificity in distinguishing liver cancer patients from healthy individuals in an independent dataset.

[1]  E. Diamandis Mass Spectrometry as a Diagnostic and a Cancer Biomarker Discovery Tool , 2004, Molecular & Cellular Proteomics.

[2]  E. Petricoin,et al.  High-resolution serum proteomic features for ovarian cancer detection. , 2004, Endocrine-related cancer.

[3]  Jeffrey S. Morris,et al.  A comprehensive approach to the analysis of matrix‐assisted laser desorption/ionization‐time of flight proteomics spectra from serum samples , 2003, Proteomics.

[4]  Jeffrey S. Morris,et al.  Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments , 2004, Bioinform..

[5]  Habtom W. Ressom,et al.  Analysis of mass spectral serum profiles for biomarker selection , 2005, Bioinform..

[6]  E. Petricoin,et al.  Serum proteomic patterns for detection of prostate cancer. , 2002, Journal of the National Cancer Institute.

[7]  Rency S Varghese,et al.  Enrichment of low molecular weight fraction of serum for MS analysis of peptides associated with hepatocellular carcinoma , 2006, Proteomics.

[8]  M. Shigenaga,et al.  Evaluation of a novel, integrated approach using functionalized magnetic beads, bench-top MALDI-TOF-MS with prestructured sample supports, and pattern recognition software for profiling potential biomarkers in human plasma. , 2004, Journal of biomolecular techniques : JBT.

[9]  J. Glimm,et al.  Detection of cancer-specific markers amid massive mass spectral data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[10]  M. Trosset,et al.  Enhancement of sensitivity and resolution of surface-enhanced laser desorption/ionization time-of-flight mass spectrometric records for serum peptides using time-series analysis techniques. , 2005, Clinical chemistry.

[11]  R. Bast,et al.  Three Biomarkers Identified from Serum Proteomic Analysis for the Detection of Early Stage Ovarian Cancer , 2004, Cancer Research.

[12]  E. Petricoin,et al.  Use of proteomic patterns in serum to identify ovarian Cancer , 2002 .

[13]  J. Potter,et al.  A data-analytic strategy for protein biomarker discovery: profiling of high-dimensional proteomic data for cancer detection. , 2003, Biostatistics.

[14]  Tamer Hifnawy,et al.  Associations of pesticides, HCV, HBV, and hepatocellular carcinoma in Egypt. , 2005, International journal of hygiene and environmental health.

[15]  Habtom W. Ressom,et al.  Particle swarm optimization for analysis of mass spectral serum profiles , 2005, GECCO '05.

[16]  Pierre Bedossa,et al.  Identification of a new marker of hepatocellular carcinoma by serum protein profiling of patients with chronic liver diseases , 2005, Hepatology.

[17]  D. Ransohoff Bias as a threat to the validity of cancer molecular-marker research , 2005, Nature reviews. Cancer.

[18]  Terence P. Speed,et al.  NORMALIZATION , BASELINE CORRECTION AND ALIGNMENT OF HIGH-THROUGHPUT MASS SPECTROMETRY DATA , 2004 .

[19]  D. Chan,et al.  Evaluation of serum protein profiling by surface-enhanced laser desorption/ionization time-of-flight mass spectrometry for the detection of prostate cancer: I. Assessment of platform reproducibility. , 2005, Clinical chemistry.

[20]  K. Baggerly,et al.  Pharmacoproteomic analysis of prechemotherapy and postchemotherapy plasma samples from patients receiving neoadjuvant or adjuvant chemotherapy for breast carcinoma , 2004, Cancer.

[21]  D. Chan,et al.  Serum Diagnosis of Pancreatic Adenocarcinoma Using Surface-Enhanced Laser Desorption and Ionization Mass Spectrometry , 2004, Clinical Cancer Research.

[22]  E. Fung,et al.  ProteinChip clinical proteomics: computational challenges and solutions. , 2002, BioTechniques.

[23]  Jeffrey S. Morris,et al.  Improved peak detection and quantification of mass spectrometry data acquired from surface‐enhanced laser desorption and ionization by denoising spectra with the undecimated discrete wavelet transform , 2005, Proteomics.

[24]  T. Veenstra,et al.  Characterization of the Low Molecular Weight Human Serum Proteome*S , 2003, Molecular & Cellular Proteomics.

[25]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .