Forming classifier ensembles with multimodal evolutionary algorithms

Ensemble classifiers have become popular in recent years owing to their ability to produce robust predictive models that generalise well to previously unseen data. In principle, Evolutionary Algorithms (EAs) are well suited to ensemble generation since they result in a pool of trained classifiers. However, in practice they are infrequently used for this purpose. Current research trends in the EA community focus on relatively complex mechanisms for building ensembles, such as co-evolution and multi-objective optimisation. In this paper, we take a back-to-basics approach, studying whether conventional EAs, augmented with simple niching strategies, can be used to form accurate ensembles. We focus on crowding for this, considering both deterministic and probabilistic variants. We also consider the effect of different similarity measures. Our results suggest that simple niching methods can lead to accurate ensemble classifiers and that the choice of similarity measure is not a significant factor. A further study using heterogeneous classifier models within the population showed no added benefit.

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  K. Dejong,et al.  An analysis of the behavior of a class of genetic adaptive systems , 1975 .

[3]  Jack Sklansky,et al.  A note on genetic algorithms for large-scale feature selection , 1989, Pattern Recognit. Lett..

[4]  David W. Opitz,et al.  Feature Selection for Ensembles , 1999, AAAI/IAAI.

[5]  D. Opitz,et al.  Popular Ensemble Methods: An Empirical Study , 1999, J. Artif. Intell. Res..

[6]  Stephen L. Smith,et al.  A comparison of evolved linear and non-linear ensemble vote aggregators , 2015, 2015 IEEE Congress on Evolutionary Computation (CEC).

[7]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[8]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[9]  Samir W. Mahfoud A Comparison of Parallel and Sequential Niching Methods , 1995, ICGA.

[10]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[11]  Xin Yao,et al.  DIVACE: Diverse and Accurate Ensemble Learning Algorithm , 2004, IDEAL.

[12]  Hussein A. Abbass,et al.  A Memetic Pareto Evolutionary Approach to Artificial Neural Networks , 2001, Australian Joint Conference on Artificial Intelligence.

[13]  Lars Kai Hansen,et al.  Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[15]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[16]  Bogdan Gabrys,et al.  Classifier selection for majority voting , 2005, Inf. Fusion.

[17]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[18]  Kenneth Alan De Jong,et al.  An analysis of the behavior of a class of genetic adaptive systems. , 1975 .

[19]  Luiz Eduardo Soares de Oliveira,et al.  Multi-objective Genetic Algorithms to Create Ensemble of Classifiers , 2005, EMO.

[20]  Christian Gagné,et al.  Ensemble learning for free with evolutionary algorithms? , 2007, GECCO '07.

[21]  Samir W. Mahfoud Crowding and Preselection Revisited , 1992, PPSN.

[22]  Norbert Schuff,et al.  Characterisation of movement disorder in parkinson's disease using evolutionary algorithms , 2013, GECCO.

[23]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[24]  Robert Sabourin,et al.  Overfitting cautious selection of classifier ensembles with genetic algorithms , 2009, Inf. Fusion.

[25]  Xin Yao,et al.  Evolutionary ensembles with negative correlation learning , 2000, IEEE Trans. Evol. Comput..

[26]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[27]  David E. Goldberg,et al.  Probabilistic Crowding: Deterministic Crowding with Probabilistic Replacement , 1999 .

[28]  Xin Yao,et al.  Ensemble learning via negative correlation , 1999, Neural Networks.

[29]  Xin Yao,et al.  Diversity creation methods: a survey and categorisation , 2004, Inf. Fusion.

[30]  Jihoon Yang,et al.  Feature Subset Selection Using a Genetic Algorithm , 1998, IEEE Intell. Syst..