Sequential minimal optimization for support vector machine with feature selection in breast cancer diagnosis

Accurate and proper diagnosis in shorter time avoids the breast cancer death. The goal is to find breast cancer as early as possible because earlier staging of breast cancer is curable. Support Vector Machine is a useful classifier among other method but the main disadvantage of Support Vector Machine (SVM) is that it's time-consuming to train large dataset because of the traditional Quadratic Programming (QP) optimization problem. In this paper; Sequential Minimal Optimization (SMO) for SVM with feature selection in breast cancer diagnosis has been proposed. This method is more efficient on diagnosis that increases the classification accuracy with faster training time to train the datasets. We have done experiments on different training-test sets of the Wisconsin breast cancer dataset (WBCD) which is the most popular dataset among the researchers for breast cancer diagnosis. After that, performance evaluation is measured which shows the diagnostic performance of the SVM. At last, proposed approach obtained 100% accuracy with faster training time and there is no misclassification sample because false positive (FP) and false negative (FN) is zero for the model 4 in 80-20% training-test dataset.

[1]  William Nick Street,et al.  Breast Cancer Diagnosis and Prognosis Via Linear Programming , 1995, Oper. Res..

[2]  J. Ross Quinlan,et al.  Improved Use of Continuous Attributes in C4.5 , 1996, J. Artif. Intell. Res..

[3]  Mehmet Fatih Akay,et al.  Support vector machines combined with feature selection for breast cancer diagnosis , 2009, Expert Syst. Appl..

[4]  Ferenc Szeifert,et al.  Supervised fuzzy clustering for the identification of fuzzy classifiers , 2003, Pattern Recognit. Lett..

[5]  K. Bennett,et al.  A support vector machine approach to decision trees , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[6]  Rudy Setiono,et al.  Generating concise and accurate classification rules for breast cancer diagnosis , 2000, Artif. Intell. Medicine.

[7]  Moshe Sipper,et al.  A fuzzy-genetic approach to breast cancer diagnosis , 1999, Artif. Intell. Medicine.

[8]  Kemal Polat,et al.  Breast cancer diagnosis using least square support vector machine , 2007, Digit. Signal Process..

[9]  M. Cevdet Ince,et al.  An expert system for detection of breast cancer based on association rules and neural network , 2009, Expert Syst. Appl..

[10]  Chih-Jen Lin,et al.  Combining SVMs with Various Feature Selection Strategies , 2006, Feature Extraction.

[11]  Dr. M. G. Worster Methods of Mathematical Physics , 1947, Nature.

[12]  Rudolf Kruse,et al.  Obtaining interpretable fuzzy classification rules from medical data , 1999, Artif. Intell. Medicine.

[13]  Domenico Conforti,et al.  Kernel based support vector machine via semidefinite programming: Application to medical diagnosis , 2010, Comput. Oper. Res..

[14]  C. Goose,et al.  Glossary of Terms , 2004, Machine Learning.

[15]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[16]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[17]  Robert G. Harrison,et al.  Detecting false benign in breast cancer diagnosis , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[18]  John C. Platt Using Analytic QP and Sparseness to Speed Training of Support Vector Machines , 1998, NIPS.

[19]  A.A. Albrecht,et al.  Two applications of the LSA machine , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..