Application of Support Vector Machine to Predict 5-year Survival Status of Patients with Nasopharyngeal Carcinoma after Treatment

AbstractObjective: Support Vector Machine (SVM) is a machine-learning method, based on the principle of structural risk minimization, which performs well when applied to data outside the training set. In this paper, SVM was applied to predict 5-year survival status of patients with nasopharyngeal carcinoma (NPC) after treatment, we expect to find a new way for prognosis studies in cancer so as to assist right clinical decision for individual patient. Methods: Two modelling methods were used in the study; SVM network and a standard parametric logistic regression were used to model 5-year survival status. And the two methods were compared on a prospective set of patients not used in model construction via receiver operating characteristic (ROC) curve analysis. Results: The SVM1, trained with the 25 original input variables without screening, yielded a ROC area of 0.868, at sensitivity to mortality of 79.2% and the specificity of 94.5%. Similarly, the SVM2, trained with 9 input variables which were obtained by optimal input variable selection from the 25 original variables by logistic regression screening, yielded a ROC area of 0.874, at a sensitivity to mortality of 79.2% and the specificity of 95.6%, while the logistic regression yielded a ROC area of 0.751 at a sensitivity to mortality of 66.7% and gave a specificity of 83.5%. Conclusion: SVM found a strong pattern in the database predictive of 5-year survival status. The logistic regression produces somewhat similar, but better, results. These results show that the SVM models have the potential to predict individual patient's 5-year survival status after treatment, and to assist the clinicians for making a good clinical decision.

[1]  C. Hsieh,et al.  Prognostic factors affecting the outcome of nasopharyngeal carcinoma. , 2003, Japanese journal of clinical oncology.

[2]  S. H. Cheng,et al.  Long-term survival of nasopharyngeal carcinoma following concomitant radiotherapy and chemotherapy. , 2000, International journal of radiation oncology, biology, physics.

[3]  E. B. Butler,et al.  Important prognostic factors in patients with skull base erosion from nasopharyngeal carcinoma after radiotherapy. , 2001, International journal of radiation oncology, biology, physics.

[4]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[5]  Sabine Van Huffel,et al.  Preoperative prediction of malignancy of ovarian tumors using least squares support vector machines , 2003, Artif. Intell. Medicine.

[6]  David Haussler,et al.  Classifying G-protein coupled receptors with support vector machines , 2002, Bioinform..

[7]  L. Sobin,et al.  Prognostic factors in clinical decision making , 2001, Cancer.

[8]  B. Yegnanarayana,et al.  Artificial Neural Networks , 2004 .

[9]  Feng Luan,et al.  Diagnosing Breast Cancer Based on Support Vector Machines , 2003, J. Chem. Inf. Comput. Sci..

[10]  J. Sham,et al.  Prognostic impact of hemoglobin levels on treatment outcome in patients with nasopharyngeal carcinoma treated with sequential chemoradiotherapy or radiotherapy alone , 2004 .

[11]  T. Farias,et al.  Prognostic factors and outcome for nasopharyngeal carcinoma. , 2003, Archives of otolaryngology--head & neck surgery.

[12]  Gisbert Schneider,et al.  Support vector machine applications in bioinformatics. , 2003, Applied bioinformatics.

[13]  J. Concato Challenges in prognostic analysis , 2001, Cancer.

[14]  Yingdong Zhao,et al.  Application of support vector machines for T-cell epitopes prediction , 2003, Bioinform..

[15]  Stanislaw Osowski,et al.  Support vector machine-based expert system for reliable heartbeat recognition , 2004, IEEE Transactions on Biomedical Engineering.

[16]  J. Roh,et al.  Nasopharyngeal carcinoma with skull base invasion: a necessity of staging subdivision. , 2004, American journal of otolaryngology.

[17]  K. Chou,et al.  Application of SVM to predict membrane protein types. , 2004, Journal of theoretical biology.

[18]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .