Medical data mining using BGA and RGA for weighting of features in fuzzy k-NN classification

The k-nearest neighbor (k-NN) algorithm is commonly used in applications of classifiers and data mining and the related area due to its simplicity and effectiveness. In this study, all of features and optimal feature subsets with three features are investigated. For classification, crisp k-NN, fuzzy k-NN, and weighting fuzzy k-NN classifiers are compared. For weighting of features, two types of coding including binary-coded genetic algorithms (BGA) and real-coded genetic algorithms (BGA) are evaluated. Experiments are conducted on the Wisconsin diagnosis breast cancer (WDBC) dataset and the Pima (PIMA) Indians diabetes dataset, and the classification accuracy, false negative, and computation time are reported in this paper.

[1]  Ming-Hseng Tseng,et al.  A genetic algorithm rule-based approach for land-cover classification , 2008 .

[2]  David E. Goldberg,et al.  Genetic Algorithms with Sharing for Multimodalfunction Optimization , 1987, ICGA.

[3]  Paul Scheunders,et al.  Genetic feature selection combined with composite fuzzy nearest neighbor classifiers for hyperspectral satellite imagery , 2002, Pattern Recognit. Lett..

[4]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[5]  Hung-Chang Liao,et al.  The genetic algorithm for breast tumor diagnosis - The case of DNA viruses , 2009, Appl. Soft Comput..

[6]  Michael J. Shaw,et al.  Genetic algorithms with dynamic niche sharing for multimodal function optimization , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[7]  Erkki Tomppo,et al.  Using coarse scale forest variables as ancillary information and weighting of variables in k-NN estimation: a genetic algorithm approach , 2004 .

[8]  Andrew O. Finley,et al.  Delineation of forest/nonforest land use classes using nearest neighbor methods , 2004 .

[9]  David Coley,et al.  Introduction to Genetic Algorithms for Scientists and Engineers , 1999 .

[10]  T. Warren Liao,et al.  Medical data mining by fuzzy modeling with selected features , 2008, Artif. Intell. Medicine.

[11]  Ingoo Han,et al.  Case-based reasoning supported by genetic algorithms for corporate bond rating , 1999 .