Analyzing Potential of SVM Based Classifiers for Intelligent and Less Invasive Breast Cancer Prognosis

Accurate and less invasive personalized predictive medicine relieves many breast cancer patients from agonizingly complex surgical treatments, their colossal costs and primarily letting the patient to forgo the morbidity of a treatment that proffers no benefit. Cancer prognosis estimates recurrence of disease and predict survival of patient; hence resulting in improved patient management. Support Vector Machines (SVMs) are shown to be powerful tools for analyzing data sets where there are complicated nonlinear interactions between the input data and the information to be predicted. In this paper, we have targeted this strength of SVMs to analyze the potential of classification through feature vectors for predicting the survival chances of a breast cancer patient. Experiments were performed using different types of SVM algorithms analyzing their classification efficiency using different kernel parameters. SEER breast cancer data set (1973-2003), the most comprehensible source of information on cancer incidence in United States, is considered. Sensitivity, specificity and accuracy parameters along with RoC curves have been used to explain the performance of each SVM algorithm with different kernel types.

[1]  Yuh-Jye Lee,et al.  Breast cancer survival and chemotherapy: A support vector machine analysis , 1999, Discrete Mathematical Problems with Medical Applications.

[2]  R. Kay The Analysis of Survival Data , 2012 .

[3]  Sotiris B. Kotsiantis,et al.  Machine learning: a review of classification and combining techniques , 2006, Artificial Intelligence Review.

[4]  Yanchun Zhang,et al.  Toward breast cancer survivability prediction models through improving training space , 2009, Expert Syst. Appl..

[5]  David S. Wishart,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.

[6]  Michael G. Madden,et al.  The Genetic Kernel Support Vector Machine: Description and Evaluation , 2005, Artificial Intelligence Review.

[7]  A Howell,et al.  Evaluation of breast cancer risk assessment packages in the family history evaluation and screening programme , 2003, Journal of medical genetics.

[8]  Hyunjung Shin,et al.  wFDT - Weighted Fuzzy Decision Trees for Prognosis of Breast Cancer Survivability , 2008, AusDM.

[9]  H Brenner,et al.  A computer program for period analysis of cancer patient survival. , 2002, European journal of cancer.

[10]  Li Liu,et al.  Improved breast cancer prognosis through the combination of clinical and genetic markers , 2007, Bioinform..

[11]  Charles P. Staelin Parameter selection for support vector machines , 2002 .

[12]  Chih-Jen Lin,et al.  Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..

[13]  Kyle T. Bradley Prognostic and Predictive Factors in Breast Cancer , 2007 .

[14]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[15]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.