Instance Selection with Ant Colony Optimization

Abstract Classification is a supervised learning task where a training set is used to construct a classifi- cation model, which is then used to predict the class of unforeseen test instances. It is often beneficial to use only a subset of the full training set to construct the classification model, and Instance Selection is the task of selecting the most appropriate subset of the training set. In many cases, the classification model induced from the reduced training set can have bet- ter predictive accuracy on test instances. ADR-Miner is a recently introduced Ant Colony Optimization algorithm for Instance Selection that aims to produce classification models with improved test set predictive accuracy. In this paper, we present an extension of ADR-Miner, where one classification algorithm is employed in the instance selection process, and potentially a different algorithm is employed in the final model construction phase. We evaluate perfor- mance using 37 UCI datasets, and we note the combinations of algorithms which produce the best results.

[1]  Alex Alves Freitas,et al.  Multiple pheromone types and other extensions to the Ant-Miner classification rule discovery algorithm , 2011, Swarm Intelligence.

[2]  Alex Alves Freitas,et al.  Comprehensible classification models: a position paper , 2014, SKDD.

[3]  Khalid M. Salama,et al.  ADR-Miner: An ant-based data reduction algorithm for classification , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[4]  Thomas Stützle,et al.  MAX-MIN Ant System , 2000, Future Gener. Comput. Syst..

[5]  Thomas Stützle,et al.  Ant Colony Optimization for Mixed-Variable Optimization Problems , 2014, IEEE Transactions on Evolutionary Computation.

[6]  I. Tomek An Experiment with the Edited Nearest-Neighbor Rule , 1976 .

[7]  Tony R. Martinez,et al.  Reduction Techniques for Instance-Based Learning Algorithms , 2000, Machine Learning.

[8]  Dennis L. Wilson,et al.  Asymptotic Properties of Nearest Neighbor Rules Using Edited Data , 1972, IEEE Trans. Syst. Man Cybern..

[9]  Khalid M. Salama,et al.  A Novel Ant Colony Algorithm for Building Neural Network Topologies , 2014, ANTS Conference.

[10]  Manuel López-Ibáñez,et al.  Ant colony optimization , 2010, GECCO '10.

[11]  Alex Alves Freitas,et al.  A New Sequential Covering Strategy for Inducing Classification Rules With Ant Colony Algorithms , 2013, IEEE Transactions on Evolutionary Computation.

[12]  D. Kibler,et al.  Instance-based learning algorithms , 2004, Machine Learning.

[13]  Alex Alves Freitas,et al.  Ant colony algorithms for constructing Bayesian multi-net classifiers , 2015, Intell. Data Anal..

[14]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[15]  Alex Alves Freitas,et al.  Learning Bayesian network classifiers using ant colony optimization , 2013, Swarm Intelligence.

[16]  Fernando E. B. Otero,et al.  Learning Multi-tree Classification Models with Ant Colony Optimization , 2014, IJCCI.

[17]  Chris Mellish,et al.  Advances in Instance Selection for Instance-Based Learning Algorithms , 2002, Data Mining and Knowledge Discovery.