A Hybrid Approach from Ant Colony Optimization and K-nearest Neighbor for Classifying Datasets Using Selected Features

This paper presents an Ant Colony Optimization (ACO) approach for feature selection. The challenge in the feature selection problem is the large search space that exists due to either redundant or irrelevant features which affects the classifier performance negatively. The proposed approach aims to minimize the subset of features used in classification and maximize the classification accuracy. The proposed approach uses several groups of ants, each group selects the candidate features using different criteria. The used ACO approach introduces the datasets to a fitness function that is composed of heuristic value component and pheromone value component. The heuristic information is represented with the Class-Separability (CS) value of the feature. The pheromone value calculation is based on the classification accuracy resulted by adding the feature. A K-Nearest Neighbor based classifier was used. The sequential forward feature selection is used, so it selects from the highest recommended features sequentially until the accuracy is enhanced. The proposed approach is applied on different medical datasets yielding promising results and findings.

[1]  Mengjie Zhang,et al.  Particle Swarm Optimization for Feature Selection in Classification: A Multi-Objective Approach , 2013, IEEE Transactions on Cybernetics.

[2]  Carlos A. Coello Coello,et al.  An Introduction to Swarm Intelligence for Multi-objective Problems , 2009 .

[3]  Bo Liu,et al.  Swarm Intelligence and its Application in Abnormal Data Detection , 2015, Informatica.

[4]  S. S. Iyengar,et al.  An Evaluation of Filter and Wrapper Methods for Feature Selection in Categorical Clustering , 2005, IDA.

[5]  J. Jona,et al.  Ant-cuckoo colony optimization for feature selection in digital mammogram. , 2014, Pakistan journal of biological sciences : PJBS.

[6]  Parham Moradi,et al.  An unsupervised feature selection algorithm based on ant colony optimization , 2014, Eng. Appl. Artif. Intell..

[7]  Carlos A. Coello Coello,et al.  Swarm Intelligence for Multi-objective Problems in Data Mining , 2009 .

[8]  Thanh Tung Khuat,et al.  Optimizing Parameters of Software Effort Estimation Models using Directed Artificial Bee Colony Algorithm , 2016 .

[9]  Alan J. Miller Subset Selection in Regression , 1992 .

[10]  Antonio J. Tallón-Ballesteros,et al.  Tackling Ant Colony Optimization Meta-Heuristic as Search Method in Feature Subset Selection Based on Correlation or Consistency Measures , 2014, IDEAL.

[11]  Leslie S. Smith,et al.  Feature subset selection in large dimensionality domains , 2010, Pattern Recognit..

[12]  Ali Khazaee,et al.  Heart Beat Classification Using Particle Swarm Optimization , 2013 .

[13]  Karim Faez,et al.  Feature Selection Using Ant Colony Optimization (ACO): A New Method and Comparative Study in the Application of Face Recognition System , 2007, ICDM.

[14]  Marco Dorigo,et al.  Optimization, Learning and Natural Algorithms , 1992 .

[15]  Wei-Chang Yeh,et al.  Novel swarm optimization for mining classification rules on thyroid gland data , 2012, Inf. Sci..

[16]  C. A. Murthy,et al.  Unsupervised Feature Selection Using Feature Similarity , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Kun-Huang Chen,et al.  A new particle swarm feature selection method for classification , 2013, Journal of Intelligent Information Systems.

[18]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[19]  Feng Chu,et al.  Applications of support vector machines to cancer classification with microarray data , 2005, Int. J. Neural Syst..

[20]  Sreeram Ramakrishnan,et al.  A hybrid approach for feature subset selection using neural networks and ant colony optimization , 2007, Expert Syst. Appl..

[21]  Marco Dorigo,et al.  Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[22]  Charu C. Aggarwal,et al.  Data Mining: The Textbook , 2015 .

[23]  Xin-She Yang,et al.  A Binary Cuckoo Search and Its Application for Feature Selection , 2014 .

[24]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[25]  Aboul Ella Hassanien,et al.  Ant Colony based Feature Selection Heuristics for Retinal Vessel Segmentation , 2014, ArXiv.

[26]  Alper Ekrem Murat,et al.  A discrete particle swarm optimization method for feature selection in binary classification problems , 2010, Eur. J. Oper. Res..

[27]  Gang Wang,et al.  Multiple parameter control for ant colony optimization applied to feature selection problem , 2015, Neural Computing and Applications.

[28]  Ali Zakerolhosseini,et al.  Unsupervised probabilistic feature selection using ant colony optimization , 2016, Expert Syst. Appl..

[29]  Huan Liu,et al.  Feature selection for clustering - a filter solution , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[30]  Yixin Chen,et al.  Efficient ant colony optimization for image feature selection , 2013, Signal Process..

[31]  Khuat Thanh Tung,et al.  Optimizing Parameters of Software Effort Estimation Models using Directed Artificial Bee Colony Algorithm , 2016, Informatica.

[32]  Simon Fong,et al.  Feature Selection in Life Science Classification: Metaheuristic Swarm Search , 2014, IT Professional.

[33]  Thomas A. Runkler,et al.  Multi-Criteria Ant Feature Selection Using Fuzzy Classifiers , 2009 .