Applying Ant Colony Optimization in configuring stacking ensemble

A stacking ensemble is a collective decision making system employing some strategy to combine the predictions of learned classifiers to generate its prediction on new instances. The early research has proved that a stacking ensemble is usually more accurate than any individual component classifiers both empirically and theoretically. Though many ensemble methods are proposed, it is still not an easy task to find a suitable ensemble configuration for a specific dataset. In some early works, the ensemble is selected manually according to the experience of the specialists. Metaheuristic methods can be alternative solutions to find configurations. Ant Colony Optimization (ACO) is one popular approach among the metaheuristics. In this paper, we propose a new ensemble construction method which applies ACO in the Stacking ensemble construction process to generate domain-specific configurations. Different kinds of local information are applied in facilitating the learning process. A number of experiments are performed to compare the proposed approach with some well-known ensemble methods on 18 benchmark datasets. The experiment results show that the new approach can generate better stacking ensembles.

[1]  Walter J. Gutjahr,et al.  ACO algorithms with guaranteed convergence to the optimal solution , 2002, Inf. Process. Lett..

[2]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[3]  Ricardo Aler,et al.  Heuristic Search-Based Stacking of Classifiers , 2002 .

[4]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[5]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[6]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Xindong Wu,et al.  Ensemble pruning via individual contribution ordering , 2010, KDD.

[9]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[10]  Alexander K. Seewald,et al.  How to Make Stacking Better and Faster While Also Taking Care of an Unknown Weakness , 2002, International Conference on Machine Learning.

[11]  Bernard Zenko,et al.  Stacking with Multi-response Model Trees , 2002, Multiple Classifier Systems.

[12]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[13]  Araceli Sanchis,et al.  Genetic Approach for Optimizing Ensembles of Classifiers , 2008, FLAIRS.

[14]  Christopher J. Merz,et al.  Using Correspondence Analysis to Combine Classifiers , 1999, Machine Learning.

[15]  Manuel López-Ibáñez,et al.  Ant colony optimization , 2010, GECCO '10.