A Mass Spectrometric Analysis Method Based on PPCA and SVM for Early Detection of Ovarian Cancer

Background. Surfaced-enhanced laser desorption-ionization-time of flight mass spectrometry (SELDI-TOF-MS) technology plays an important role in the early diagnosis of ovarian cancer. However, the raw MS data is highly dimensional and redundant. Therefore, it is necessary to study rapid and accurate detection methods from the massive MS data. Methods. The clinical data set used in the experiments for early cancer detection consisted of 216 SELDI-TOF-MS samples. An MS analysis method based on probabilistic principal components analysis (PPCA) and support vector machine (SVM) was proposed and applied to the ovarian cancer early classification in the data set. Additionally, by the same data set, we also established a traditional PCA-SVM model. Finally we compared the two models in detection accuracy, specificity, and sensitivity. Results. Using independent training and testing experiments 10 times to evaluate the ovarian cancer detection models, the average prediction accuracy, sensitivity, and specificity of the PCA-SVM model were 83.34%, 82.70%, and 83.88%, respectively. In contrast, those of the PPCA-SVM model were 90.80%, 92.98%, and 88.97%, respectively. Conclusions. The PPCA-SVM model had better detection performance. And the model combined with the SELDI-TOF-MS technology had a prospect in early clinical detection and diagnosis of ovarian cancer.

[1]  Chung-Chuan Cheng,et al.  An automatic segmentation and classification framework for anti-nuclear antibody images , 2013, BioMedical Engineering OnLine.

[2]  E. Petricoin,et al.  Use of proteomic patterns in serum to identify ovarian cancer , 2002, The Lancet.

[3]  Patrice Mathevet,et al.  Ovarian cancer screening in the general population , 2013, Revue medicale suisse.

[4]  M. Lamberto,et al.  Principal component analysis in fast atom bombardment-mass spectrometry of triacylglycerols in edible oils , 1995 .

[5]  P. Schellhammer,et al.  Serum protein fingerprinting coupled with a pattern-matching algorithm distinguishes prostate cancer from benign prostate hyperplasia and healthy men. , 2002, Cancer research.

[6]  Bowei Xi,et al.  Principal component directed partial least squares analysis for combining nuclear magnetic resonance and mass spectrometry data in metabolomics: application to the detection of breast cancer. , 2011, Analytica chimica acta.

[7]  Petr G. Lokhov,et al.  Diagnosis of lung cancer based on direct-infusion electrospray mass spectrometry of blood plasma metabolites , 2012 .

[8]  Paul Terry,et al.  Application of the GA/KNN method to SELDI proteomics data , 2004, Bioinform..

[9]  E. Petricoin,et al.  High-resolution serum proteomic features for ovarian cancer detection. , 2004, Endocrine-related cancer.

[10]  Elena Marchiori,et al.  Robust SVM-Based Biomarker Selection with Noisy Mass Spectrometric Proteomic Data , 2006, EvoWorkshops.

[11]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[12]  Yair Lotan,et al.  Prostate cancer biomarker discovery using high performance mass spectral serum profiling , 2009, Comput. Methods Programs Biomed..

[13]  J. Miller,et al.  Artificial neural network for charge prediction in metabolite identification by mass spectrometry. , 2015, Methods in molecular biology.

[14]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[15]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[16]  E. Petricoin,et al.  Serum proteomic patterns for detection of prostate cancer. , 2002, Journal of the National Cancer Institute.

[17]  O John Semmes,et al.  Normal, benign, preneoplastic, and malignant prostate cells have distinct protein expression profiles resolved by surface enhanced laser desorption/ionization mass spectrometry. , 2002, Clinical cancer research : an official journal of the American Association for Cancer Research.

[18]  Kyu Jong Lee,et al.  Matrix-assisted laser desorption/ionization-mass spectrometry of cuticular lipid profiles can differentiate sex, age, and mating status of Anopheles gambiae mosquitoes. , 2011, Analytica chimica acta.