Obtaining Pareto Front in Instance Selection with Ensembles and Populations

Collective computational intelligence can be used in several ways, for example as taking the decision together by some form of a bagging ensemble or as finding the solutions by multi-objective evolutionary algorithms. In this paper we examine and compare the application of the two approaches to instance selection for creating the Pareto front of the selected subsets, where the two objectives are classification accuracy and data size reduction. As the bagging ensemble members we use DROP5 algorithms. The evolutionary algorithm is based on NSGA-II. The findings are that the evolutionary approach is faster (contrary to the popular belief) and usually provides better quality solutions, with some exceptions, were the outcome of the DROP5 ensemble is better.

[1]  José Francisco Martínez Trinidad,et al.  A review of instance selection methods , 2010, Artificial Intelligence Review.

[2]  Marcin Blachnik,et al.  A Hybrid System with Regression Trees in Steel-Making Process , 2011, HAIS.

[3]  Nicolás García-Pedrajas,et al.  Constructing Ensembles of Classifiers by Means of Weighted Instance Selection , 2009, IEEE Transactions on Neural Networks.

[4]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[5]  William Eberle,et al.  Genetic algorithms in feature and instance selection , 2013, Knowl. Based Syst..

[6]  Michela Antonelli,et al.  Genetic Training Instance Selection in Multiobjective Evolutionary Fuzzy Systems: A Coevolutionary Approach , 2012, IEEE Transactions on Fuzzy Systems.

[7]  Mirosław Kordos,et al.  Data Selection for Neural Networks , 2017 .

[8]  Marcin Blachnik,et al.  Fusion of instance selection methods in regression tasks , 2016, Inf. Fusion.

[9]  Zbigniew Michalewicz,et al.  Parameter Setting in Evolutionary Algorithms , 2007, Studies in Computational Intelligence.

[10]  Nicolás García-Pedrajas,et al.  Boosting instance selection algorithms , 2014, Knowl. Based Syst..

[11]  Mirosław Kordos,et al.  VARIABLE STEP SEARCH ALGORITHM FOR MLP TRAINING , 2005 .

[12]  Marcin Blachnik,et al.  Ensembles of Instance Selection Methods based on Feature Subset , 2014, KES.

[13]  Kalyanmoy Deb,et al.  Multi-objective optimization using evolutionary algorithms , 2001, Wiley-Interscience series in systems and optimization.

[14]  Tony R. Martinez,et al.  Reduction Techniques for Instance-Based Learning Algorithms , 2000, Machine Learning.

[15]  Francisco Herrera,et al.  Using evolutionary algorithms as instance selection for data reduction in KDD: an experimental study , 2003, IEEE Trans. Evol. Comput..

[16]  Richard Nock,et al.  Stopping Criterion for Boosting-Based Data Reduction Techniques: from Binary to Multiclass Problem , 2003, J. Mach. Learn. Res..

[17]  Francisco Herrera,et al.  Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Krystian Lapa,et al.  Hybrid Initialization in the Process of Evolutionary Learning , 2017, ICAISC.

[19]  José Ramón Cano,et al.  Instance Selection Using Evolutionary Algorithms: An Experimental Study , 2005, KDD 2005.

[20]  David W. Coit,et al.  Multi-objective optimization using genetic algorithms: A tutorial , 2006, Reliab. Eng. Syst. Saf..

[21]  Francisco Herrera,et al.  Enhancing evolutionary instance selection algorithms by means of fuzzy rough set based feature selection , 2012, Inf. Sci..

[22]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[23]  Juan José Rodríguez Diez,et al.  Instance selection for regression: Adapting DROP , 2016, Neurocomputing.

[24]  Frank Neumann,et al.  Benefits and drawbacks for the use of epsilon-dominance in evolutionary multi-objective optimization , 2008, GECCO '08.

[25]  Marcin Blachnik,et al.  Bagging of Instance Selection Algorithms , 2014, ICAISC.

[26]  Miroslaw Kordos Optimization of Evolutionary Instance Selection , 2017, ICAISC.

[27]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .