A stochastic approach to handle knapsack problems in the creation of ensembles

Ensemble-based methods are highly popular approaches that increase the accuracy of a decision by aggregating the opinions of individual voters. The common point is to maximize accuracy; however, a natural limitation occurs if incremental costs are also assigned to the individual voters. Consequently, we investigate creating ensembles under an additional constraint on the total cost of the members. This task can be formulated as a knapsack problem, where the energy is the ensemble accuracy formed by some aggregation rules. However, the generally applied aggregation rules lead to a nonseparable energy function, which takes the common solution tools -- such as dynamic programming -- out of action. We introduce a novel stochastic approach that considers the energy as the joint probability function of the member accuracies. This type of knowledge can be efficiently incorporated in a stochastic search process as a stopping rule, since we have the information on the expected accuracy or, alternatively, the probability of finding more accurate ensembles. Experimental analyses of the created ensembles of pattern classifiers and object detectors confirm the efficiency of our approach. Moreover, we propose a novel stochastic search strategy that better fits the energy, compared with general approaches such as simulated annealing.

[1]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  András Hajdu,et al.  Diversity Measures for Majority Voting in the Spatial Domain , 2013, HAIS.

[3]  Thomas G. Dietterich,et al.  A Comparison of Dynamic Reposing and Tangent Distance for Drug Activity Prediction , 1993, NIPS.

[4]  Thiago J. M. Moura,et al.  Combining diversity measures for ensemble pruning , 2016, Pattern Recognit. Lett..

[5]  Thomas G. Dietterich,et al.  Error-Correcting Output Coding Corrects Bias and Variance , 1995, ICML.

[6]  Bálint Antal,et al.  An ensemble-based system for automatic screening of diabetic retinopathy , 2014, Knowl. Based Syst..

[7]  András Hajdu,et al.  Generalizing the Majority Voting Scheme to Spatially Constrained Voting , 2013, IEEE Transactions on Image Processing.

[8]  Paolo Toth,et al.  Knapsack Problems: Algorithms and Computer Implementations , 1990 .

[9]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[10]  Petr Savický,et al.  Methods for multidimensional event classification: A case study using images from a Cherenkov gamma-ray telescope , 2004 .

[11]  András Hajdu,et al.  Optimizing Majority Voting Based Systems Under a Resource Constraint for Multiclass Problems , 2019, ArXiv.

[12]  András Hajdu,et al.  Composing ensembles by a stochastic approach under execution time constraint , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[13]  T. Klastorin On a discrete nonlinear and nonseparable knapsack problem , 1990 .

[14]  Stelios Timotheou,et al.  The Random Neural Network: A Survey , 2010, Comput. J..

[15]  Oleksandr Makeyev,et al.  Neural network with ensembles , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[16]  Adam Krzyżak,et al.  Methods of combining multiple classifiers and their applications to handwriting recognition , 1992, IEEE Trans. Syst. Man Cybern..

[17]  Ching Y. Suen,et al.  Application of majority voting to pattern recognition: an analysis of its behavior and performance , 1997, IEEE Trans. Syst. Man Cybern. Part A.

[18]  Joseph Geunes,et al.  A class of nonlinear nonseparable continuous knapsack and multiple-choice knapsack problems , 2011, Math. Program..

[19]  Alois Ferscha,et al.  Enabling Dynamic Sensor Configuration and Cooperation in Opportunistic Activity Recognition Systems , 2013, Int. J. Distributed Sens. Networks.

[20]  Yoshua Bengio,et al.  Classification using discriminative restricted Boltzmann machines , 2008, ICML '08.

[21]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[22]  Bálint Antal,et al.  An Ensemble-Based System for Microaneurysm Detection and Diabetic Retinopathy Grading , 2012, IEEE Transactions on Biomedical Engineering.

[23]  Sung-Bae Cho,et al.  Combining multiple neural networks by fuzzy integral for robust classification , 1995, IEEE Trans. Syst. Man Cybern..

[24]  Ching Y. Suen,et al.  A Method of Combining Multiple Experts for the Recognition of Unconstrained Handwritten Numerals , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Jen Tang,et al.  On the distribution of the product of independent beta random variables , 1984 .

[26]  András Hajdu,et al.  Classification Of Skin Lesions Using An Ensemble Of Deep Neural Networks , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[27]  M.N.S. Swamy,et al.  Search and Optimization by Metaheuristics: Techniques and Algorithms Inspired by Nature , 2016 .

[28]  Sargur N. Srihari,et al.  Decision Combination in Multiple Classifier Systems , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.

[30]  Mohammad Bagher Tavakoli,et al.  Modified Levenberg-Marquardt Method for Neural Networks Training , 2007 .

[31]  P. Baldi,et al.  Searching for exotic particles in high-energy physics with deep learning , 2014, Nature Communications.

[32]  Roberto Tempo,et al.  Monte carlo and las vegas randomized algorithms for systems and control : An introduction , 2007 .