A hybrid classification algorithm approach for breast cancer diagnosis

Early diagnosis of Breast Cancer is significantly important to treat the disease easily therefore it is necessary to develop techniques that can help physicians to get accurate diagnosis. This study suggests a hybrid classification algorithm which is based upon Genetic Algorithm (GA) and k Nearest neighbor algorithm (kNN). GA algorithm has been used for its primary purpose as an optimization technique for kNN by selecting best features as well as optimization of the k value, while the kNN is used for classification purpose. The planned algorithm is tested by applying it on Wisconsin Breast Cancer Dataset from UCI Repository of Machine Learning Databases using different datasets in which the first is Wisconsin Breast Cancer Database (WBCD) and the second one is Wisconsin Diagnosis Breast Cancer (WDBC) which has changes in the number of attributes and number of instances. The proposed algorithm was measured against different classifier algorithms on the same database. The evaluation results of the algorithm proposed have achieved 99% accuracy.

[1]  Soo-Hong Kim,et al.  Analysis of breast cancer using data mining & statistical techniques , 2005, Sixth International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing and First ACIS International Workshop on Self-Assembling Wireless Network.

[2]  Salwani Abdullah,et al.  Controlling Multi Algorithms Using Round Robin for University Course Timetabling Problem , 2010, FGIT-DTA/BSBT.

[3]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[4]  Ethem Alpaydin,et al.  Voting over Multiple Condensed Nearest Neighbors , 1997, Artificial Intelligence Review.

[5]  A. Jemal,et al.  Global Cancer Statistics , 2011 .

[6]  Abdelkader Benyettou,et al.  Breast Cancer Diagnosis by using k-Nearest Neighbor with Different Distances and Classification Rules , 2013 .

[7]  ci UniversityTR Voting over Multiple Condensed Nearest Neighbors , 1995 .

[8]  Mei-Ling Huang,et al.  Usage of Case-Based Reasoning, Neural Network and Adaptive Neuro-Fuzzy Inference System Classification Techniques in Breast Cancer Dataset Classification Diagnosis , 2012, Journal of Medical Systems.

[9]  M. B. Abdelhalim,et al.  Breast Cancer Diagnosis on Three Different Datasets Using Multi-Classifiers , 2012 .

[10]  Mehmet Fatih Akay,et al.  Support vector machines combined with feature selection for breast cancer diagnosis , 2009, Expert Syst. Appl..

[11]  Sang Won Yoon,et al.  Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms , 2014, Expert Syst. Appl..