Breast cancer classification by using support vector machines with reduced dimension

Correct and timely diagnosis of diseases is an essential matter in medical field. Limited human capability and limitations decrease the rate of correct diagnosis. Machine learning algorithms such as support vector machine (SVM) can help physicians to diagnose more correctly. In this study, Wisconsin diagnostic breast cancer (WDBC) data set is used to classify tumors as benign and malignant. Independent component analysis (ICA) is used to reduce the dimensionality of WDBC data into two feature vectors. The effect of using two reduced features to classify breast cancer with SVM and polynomial or radial basis function (RBF) kernels are investigated. Performances of these classifiers are evaluated to find out accuracy, sensitivity and specificity. In addition, the receiver operating characteristics (ROC) curves of SVM with these kernels are presented. Results show that SVM with quadratic kernel provides the most accurate diagnosis results (94.40%) and decreases the accuracy and sensitivity values slightly when the dimensionality is reduced into two feature vector computing two independent components.

[1]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[2]  M. Zweig,et al.  Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine. , 1993, Clinical chemistry.

[3]  Yiu-ming Cheung,et al.  An experimental study: on reducing RBF input dimension by ICA and PCA , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[4]  Vivian West,et al.  Computing, Artificial Intelligence and Information Technology Ensemble strategies for a medical diagnostic decision support system: A breast cancer diagnosis application , 2005 .

[5]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[6]  Vennila Ramalingam,et al.  Breast mass classification based on cytological patterns using RBFNN and SVM , 2009, Expert Syst. Appl..

[7]  Ahmet Sertbas,et al.  Evaluation of face recognition techniques using PCA, wavelets and SVM , 2010, Expert Syst. Appl..

[8]  Rüdiger W. Brause,et al.  Medical Analysis and Diagnosis by Neural Networks , 2001, ISMDA.

[9]  Nasser Hassan Sweilam,et al.  Support vector machine for diagnosis cancer disease: A comparative study , 2010 .

[10]  Chandan Chakraborty,et al.  Statistical analysis of mammographic features and its classification using support vector machine , 2010, Expert Syst. Appl..

[11]  Mehmet Fatih Akay,et al.  Support vector machines combined with feature selection for breast cancer diagnosis , 2009, Expert Syst. Appl..

[12]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.