Multiobjective optimization of classifiers by means of 3D convex-hull-based evolutionary algorithms

The receiver operating characteristic (ROC) and detection error tradeoff (DET) curves are frequently used in the machine learning community to analyze the performance of binary classifiers. Recently, the convex-hull-based multiobjective genetic programming algorithm was proposed and successfully applied to maximize the convex hull area for binary classification problems by minimizing false positive rate and maximizing true positive rate at the same time using indicator-based evolutionary algorithms. The area under the ROC curve was used for the performance assessment and to guide the search. Here we extend this research and propose two major advancements: Firstly we formulate the algorithm in detection error tradeoff space, minimizing false positives and false negatives, with the advantage that misclassification cost tradeoff can be assessed directly. Secondly, we add complexity as an objective function, which gives rise to a 3D objective space (as opposed to a 2D previous ROC space). A domain specific performance indicator for 3D Pareto front approximations, the volume above DET surface, is introduced, and used to guide the indicator-based evolutionary algorithm to find optimal approximation sets. We assess the performance of the new algorithm on designed theoretical problems with different geometries of Pareto fronts and DET surfaces, and two application-oriented benchmarks: (1) Designing spam filters with low numbers of false rejects, false accepts, and low computational cost using rule ensembles, and (2) finding sparse neural networks for binary classification of test data from the UCI machine learning benchmark. The results show a high performance of the new algorithm as compared to conventional methods for multicriteria optimization.

[1]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[2]  Qingfu Zhang,et al.  Adaptive Replacement Strategies for MOEA/D , 2016, IEEE Transactions on Cybernetics.

[3]  Tom Fawcett,et al.  Using rule sets to maximize ROC performance , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[4]  Yves Lecourtier,et al.  A multi-model selection framework for unknown and/or evolutive misclassification cost problems , 2010, Pattern Recognit..

[5]  Antonio J. Nebro,et al.  jMetal: A Java framework for multi-objective optimization , 2011, Adv. Eng. Softw..

[6]  Beatrice Lazzerini,et al.  A new multi-objective evolutionary algorithm based on convex hull for binary classifier optimization , 2007, 2007 IEEE Congress on Evolutionary Computation.

[7]  øöö Blockinøø Well-Trained PETs : Improving Probability Estimation , 2000 .

[8]  Francisco Herrera,et al.  On the use of MapReduce for imbalanced big data using Random Forest , 2014, Inf. Sci..

[9]  Robert P. W. Duin,et al.  A simplified extension of the Area under the ROC to the multiclass domain , 2006 .

[10]  Iryna Yevseyeva,et al.  Optimization of Anti-Spam Systems with Multiobjective Evolutionary Algorithms , 2013, Inf. Resour. Manag. J..

[11]  Iryna Yevseyeva,et al.  Optimising anti-spam filters with evolutionary algorithms , 2013, Expert Syst. Appl..

[12]  Christian Igel,et al.  Operator adaptation in evolutionary computation and its application to structure optimization of neural networks , 2003, Neurocomputing.

[13]  Enrique Alba,et al.  The jMetal framework for multi-objective optimization: Design and architecture , 2010, IEEE Congress on Evolutionary Computation.

[14]  Huimin Zhao,et al.  A multi-objective genetic programming approach to developing Pareto optimal decision trees , 2007, Decis. Support Syst..

[15]  Peter H. Morgan Differential evolution and sparse neural networks , 2008, Expert Syst. J. Knowl. Eng..

[16]  David J. Hand,et al.  A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems , 2001, Machine Learning.

[17]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[18]  Robert P. W. Duin,et al.  Efficient Multiclass ROC Approximation by Decomposition via Confusion Matrix Perturbation Analysis , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Yen-Liang Chen,et al.  Time-constrained cost-sensitive decision tree induction , 2016, Inf. Sci..

[20]  Mark Johnston,et al.  Multi-Objective Genetic Programming for Classification with Unbalanced Data , 2009, Australasian Conference on Artificial Intelligence.

[21]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[22]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[23]  Ji Shan-Fan,et al.  The Multi-objective Differential Evolution Algorithm Based on Quick Convex Hull Algorithms , 2009, 2009 Fifth International Conference on Natural Computation.

[24]  Licheng Jiao,et al.  A Sparse Spectral Clustering Framework via Multiobjective Evolutionary Algorithm , 2016, IEEE Transactions on Evolutionary Computation.

[25]  Tom Fawcett PRIE: a system for generating rulelists to maximize ROC performance , 2008, Data Mining and Knowledge Discovery.

[26]  H. B. Mann,et al.  On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other , 1947 .

[27]  Robert M. Nishikawa,et al.  The hypervolume under the ROC hypersurface of "Near-Guessing" and "Near-Perfect" observers in N-class classification tasks , 2005, IEEE Transactions on Medical Imaging.

[28]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[29]  James P. Egan,et al.  Signal detection theory and ROC analysis , 1975 .

[30]  Kalyanmoy Deb,et al.  Muiltiobjective Optimization Using Nondominated Sorting in Genetic Algorithms , 1994, Evolutionary Computation.

[31]  Nicola Beume,et al.  SMS-EMOA: Multiobjective selection based on dominated hypervolume , 2007, Eur. J. Oper. Res..

[32]  Mark Johnston,et al.  Evolving Diverse Ensembles Using Genetic Programming for Classification With Unbalanced Data , 2013, IEEE Transactions on Evolutionary Computation.

[33]  Bernhard Sendhoff,et al.  Pareto-Based Multiobjective Machine Learning: An Overview and Case Studies , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[34]  Jouni Lampinen,et al.  GDE3: the third evolution step of generalized differential evolution , 2005, 2005 IEEE Congress on Evolutionary Computation.

[35]  Robert E. Schapire,et al.  On reoptimizing multi-class classifiers , 2008, Machine Learning.

[36]  Xin Yao,et al.  An Evolutionary Multiobjective Approach to Sparse Reconstruction , 2014, IEEE Transactions on Evolutionary Computation.

[37]  Shan-Fan Ji,et al.  The Multi-objective Differential Evolution Algorithm Based on Quick Convex Hull Algorithms , 2009, ICNC.

[38]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[39]  Fang Liu,et al.  A novel selection evolutionary strategy for constrained optimization , 2013, Inf. Sci..

[40]  Ujjwal Maulik,et al.  A Survey of Multiobjective Evolutionary Algorithms for Data Mining: Part I , 2014, IEEE Transactions on Evolutionary Computation.

[41]  Jonathan E. Fieldsend,et al.  Multi-class ROC analysis from a multi-objective optimisation perspective , 2006, Pattern Recognit. Lett..

[42]  Antonio J. Nebro,et al.  On the Effect of Applying a Steady-State Selection Scheme in the Multi-Objective Genetic Algorithm NSGA-II , 2009, Nature-Inspired Algorithms for Optimisation.

[43]  Tom Fawcett,et al.  Robust Classification for Imprecise Environments , 2000, Machine Learning.

[44]  Xin Yao,et al.  Multiobjective genetic programming for maximizing ROC performance , 2014, Neurocomputing.

[45]  G. Chapman,et al.  [Medical decision making]. , 1976, Lakartidningen.

[46]  José Hernández-Orallo,et al.  Volume under the ROC Surface for Multi-class Problems , 2003, ECML.

[47]  Alvin F. Martin,et al.  The DET curve in assessment of detection task performance , 1997, EUROSPEECH.

[48]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[49]  Jafar Rezaei,et al.  Convex hull ranking algorithm for multi-objective evolutionary algorithms , 2011, Sci. Iran..

[50]  Fang Liu,et al.  A modified objective function method with feasible-guiding strategy to solve constrained multi-objective optimization problems , 2014, Appl. Soft Comput..

[51]  Qingfu Zhang,et al.  MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition , 2007, IEEE Transactions on Evolutionary Computation.

[52]  David P. Dobkin,et al.  The quickhull algorithm for convex hulls , 1996, TOMS.

[53]  R. Austria Declaration , 1987 .

[54]  Yaochu Jin,et al.  Evolutionary Multiobjective Image Feature Extraction in the Presence of Noise , 2015, IEEE Transactions on Cybernetics.

[55]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[56]  Xin Yao,et al.  Two_Arch2: An Improved Two-Archive Algorithm for Many-Objective Optimization , 2015, IEEE Transactions on Evolutionary Computation.

[57]  Witold Pedrycz,et al.  A variable reduction strategy for evolutionary algorithms handling equality constraints , 2015, Appl. Soft Comput..

[58]  Shlomo Yitzhaki,et al.  Relative Deprivation and the Gini Coefficient , 1979 .

[59]  Tianshi Chen,et al.  Towards Maximizing the Area Under the ROC Curve for Multi-Class Classification Problems , 2011, AAAI.

[60]  M. Anastasio,et al.  Multiobjective genetic optimization of diagnostic classifiers with implications for generating receiver operating characteristic curves , 1999, IEEE Transactions on Medical Imaging.

[61]  L. Bottou Stochastic Gradient Learning in Neural Networks , 1991 .

[62]  Bernhard Sendhoff,et al.  Generalization Improvement in Multi-Objective Learning , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[63]  Xin Yao,et al.  Evolving Neural Networks with Maximum AUC for Imbalanced Data Classification , 2010, HAIS.

[64]  Beatrice Lazzerini,et al.  Multi-objective genetic fuzzy classifiers for imbalanced and cost-sensitive datasets , 2010, Soft Comput..

[65]  Xin Yao,et al.  Convex Hull-Based Multiobjective Genetic Programming for Maximizing Receiver Operating Characteristic Performance , 2015, IEEE Transactions on Evolutionary Computation.

[66]  Peter A. Flach,et al.  ROCCER: An Algorithm for Rule Learning Based on ROC Analysis , 2005, IJCAI.

[67]  Rahul Gupta,et al.  Training ensemble of diverse classifiers on feature subsets , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[68]  Marco Laumanns,et al.  SPEA2: Improving the strength pareto evolutionary algorithm , 2001 .

[69]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[70]  M. Binder,et al.  Comparing Three-class Diagnostic Tests by Three-way ROC Analysis , 2000, Medical decision making : an international journal of the Society for Medical Decision Making.

[71]  Kurt Mehlhorn,et al.  Four Results on Randomized Incremental Constructions , 1992, Comput. Geom..

[72]  D. Mossman Three-way ROCs , 1999, Medical decision making : an international journal of the Society for Medical Decision Making.

[73]  Alvaro A. Cárdenas,et al.  Optimal ROC Curve for a Combination of Classifiers , 2007, NIPS.