Feature Selection for Classification Using an Ant Colony System

Many applications such as pattern recognition require selecting a subset of the input features in order to represent the whole set of features. The aim of feature selection is to remove irrelevant or redundant features while keeping the most informative ones. In this paper, an ant colony system approach for solving feature selection for classification is presented. The proposed algorithm was tested using artificial and real-world datasets. The results are promising in terms of the accuracy of the classifier and the number of selected features in all the used datasets. The results of the proposed algorithm have been compared with other results available in the literature and found to be favorable.

[1]  Thomas Stützle,et al.  Ant Colony Optimization Theory , 2004 .

[2]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..

[3]  Klaus Nordhausen,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman , 2009 .

[4]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[5]  Li-Yeh Chuang,et al.  Chaotic Binary Particle Swarm Optimization for Feature Selection using Logistic Map , 2008 .

[6]  Thomas Stützle,et al.  Ant colony optimization , 2006, IEEE Computational Intelligence Magazine.

[7]  G. Theraulaz,et al.  Inspiration for optimization from social insect behaviour , 2000, Nature.

[8]  Akash Khandelwal,et al.  In silico ADME modelling 2: computational models to predict human serum albumin binding affinity using ant colony systems. , 2006, Bioorganic & medicinal chemistry.

[9]  Yafei Zhang,et al.  Dynamic Adaboost learning with feature selection based on parallel genetic algorithm for image annotation , 2010, Knowl. Based Syst..

[10]  Sanghamitra Bandyopadhyay,et al.  Classification and learning using genetic algorithms - applications in bioinformatics and web intelligence , 2007, Natural computing series.

[11]  Peter Dalgaard,et al.  Introductory statistics with R , 2002, Statistics and computing.

[12]  J. van Leeuwen,et al.  Ant Algorithms , 2002, Lecture Notes in Computer Science.

[13]  Nadia Abd-Alsabour,et al.  Feature Selection for Classification Using an Ant System Approach , 2010, DIPES/BICC.

[14]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[15]  Chaoyang Zhang,et al.  Parallelization of multicategory support vector machines (PMC-SVM) for classifying microarray data , 2006, BMC Bioinformatics.

[16]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[17]  Hiroshi Motoda,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998, The Springer International Series in Engineering and Computer Science.

[18]  Wei Kong,et al.  QSAR analysis of tyrosine kinase inhibitor using modified ant colony optimization and multiple linear regression. , 2007, European journal of medicinal chemistry.

[19]  Li-Yeh Chuang,et al.  Feature Selection using PSO-SVM , 2007, IMECS.

[20]  Jian-Hui Jiang,et al.  Modified Ant Colony Optimization Algorithm for Variable Selection in QSAR Modeling: QSAR Studies of Cyclooxygenase Inhibitors , 2005, J. Chem. Inf. Model..

[21]  C. Archer,et al.  Dermatology , 1906, The Lancet.

[22]  Bahram Hemmateenejad,et al.  Ant colony optimisation: a powerful tool for wavelength selection , 2006 .

[23]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[24]  D. Agrafiotis,et al.  Variable selection for QSAR by artificial ant colony systems , 2002, SAR and QSAR in environmental research.

[25]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[26]  Luca Maria Gambardella,et al.  Ant colony system: a cooperative learning approach to the traveling salesman problem , 1997, IEEE Trans. Evol. Comput..