Integration of classifier diversity measures for feature selection-based classifier ensemble reduction

A classifier ensemble combines a set of individual classifier’s predictions to produce more accurate results than that of any single classifier system. However, one classifier ensemble with too many classifiers may consume a large amount of computational time. This paper proposes a new ensemble subset evaluation method that integrates classifier diversity measures into a novel classifier ensemble reduction framework. The framework converts the ensemble reduction into an optimization problem and uses the harmony search algorithm to find the optimized classifier ensemble. Both pairwise and non-pairwise diversity measure algorithms are applied by the subset evaluation method. For the pairwise diversity measure, three conventional diversity algorithms and one new diversity measure method are used to calculate the diversity’s merits. For the non-pairwise diversity measure, three classical algorithms are used. The proposed subset evaluation methods are demonstrated by the experimental data. In comparison with other classifier ensemble methods, the method implemented by the measurement of the interrater agreement exhibits a high accuracy prediction rate against the current ensembles’ performance. In addition, the framework with the new diversity measure achieves relatively good performance with less computational time.

[1]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[2]  Xin Gu,et al.  Response model based on weighted bagging GMDH , 2014, Soft Computing.

[3]  Derek Partridge,et al.  Software Diversity: Practical Statistics for Its Measurement and Exploitation | Draft Currently under Revision , 1996 .

[4]  Xin Yao,et al.  Diversity creation methods: a survey and categorisation , 2004, Inf. Fusion.

[5]  Ian Witten,et al.  Data Mining , 2000 .

[6]  Min Jiang,et al.  A reduced classifier ensemble approach to human gesture classification for robotic Chinese handwriting , 2014, 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[7]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[8]  Ling Zheng,et al.  Self-adjusting harmony search-based feature selection , 2014, Soft Computing.

[9]  Ron Kohavi,et al.  Bias Plus Variance Decomposition for Zero-One Loss Functions , 1996, ICML.

[10]  Indrajit Mandal,et al.  A novel approach for predicting DNA splice junctions using hybrid machine learning algorithms , 2015, Soft Comput..

[11]  Belhadri Messabih,et al.  Effect of simple ensemble methods on protein secondary structure prediction , 2015, Soft Comput..

[12]  Qiang Shen,et al.  Feature Selection With Harmony Search , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[13]  Alois Knoll,et al.  Action recognition using ensemble weighted multi-instance learning , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[14]  B Sun Diversity measures in ensemble learning , 2014 .

[15]  Yu-Jun Zheng,et al.  Biogeographic harmony search for emergency air transportation , 2016, Soft Comput..

[16]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[17]  Wenjia Wang,et al.  A Novel Ensemble of Distance Measures for Feature Evaluation: Application to Sonar Imagery , 2011, IDEAL.

[18]  Witold Pedrycz,et al.  A Tabu–Harmony Search-Based Approach to Fuzzy Linear Regression , 2011, IEEE Transactions on Fuzzy Systems.

[19]  Lawrence O. Hall,et al.  Ensemble diversity measures and their application to thinning , 2004, Inf. Fusion.

[20]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[21]  Zong Woo Geem,et al.  Recent Advances In Harmony Search Algorithm , 2010, Recent Advances In Harmony Search Algorithm.

[22]  Min Jiang,et al.  Integrate classifier diversity evaluation to feature selection based classifier ensemble reduction , 2014, 2014 14th UK Workshop on Computational Intelligence (UKCI).

[23]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[24]  Xin Yao,et al.  An analysis of diversity measures , 2006, Machine Learning.

[25]  Josef Kittler,et al.  Multilabel classification using heterogeneous ensemble of multi-label classifiers , 2012, Pattern Recognit. Lett..

[26]  Fabio Roli,et al.  Design of effective neural network ensembles for image classification purposes , 2001, Image Vis. Comput..

[27]  José Salvador Sánchez,et al.  Two-level classifier ensembles for credit risk assessment , 2012, Expert Syst. Appl..

[28]  João Paulo Papa,et al.  A novel algorithm for feature selection using Harmony Search and its application for non-technical losses detection , 2011, Comput. Electr. Eng..

[29]  Xiangyang Wang,et al.  Feature selection based on rough sets and particle swarm optimization , 2007, Pattern Recognit. Lett..

[30]  Pan Su,et al.  A hierarchical fuzzy cluster ensemble approach and its application to big data clustering , 2015, J. Intell. Fuzzy Syst..

[31]  Fei Chao,et al.  Feature Selection Inspired Classifier Ensemble Reduction , 2014, IEEE Transactions on Cybernetics.

[32]  Jakub Wroblewski,et al.  Ensembles of Classifiers Based on Approximate Reducts , 2001, Fundam. Informaticae.

[33]  Padraig Cunningham,et al.  Diversity versus Quality in Classification Ensembles Based on Feature Selection , 2000, ECML.

[34]  Loris Nanni,et al.  Ensemblator: An ensemble of classifiers for reliable classification of biological data , 2007, Pattern Recognit. Lett..