Cost-sensitive business failure prediction when misclassification costs are uncertain: A heterogeneous ensemble selection approach

Abstract In order to assess risks associated with establishing relationships with corporate partners such as clients, suppliers, debtors or contractors, decision makers often turn to business failure prediction models. While a large body of literature has focused on optimizing and evaluating novel methods in terms of classification accuracy, recent research has acknowledged the existence of asymmetric misclassification costs associated with prediction errors and thus, advocates the usage of alternative evaluation metrics. However, these papers often assume a misclassification cost matrix to be known and fixed for both the training and the evaluation of models, whereas in reality these costs are often uncertain. This paper presents a methodological framework based upon heterogeneous ensemble selection and multi-objective optimization for cost-sensitive business failure prediction that accommodates uncertainty at the level of misclassification costs. The framework assumes unknown costs during model training and accommodates varying degrees of uncertainty during model deployment. Specifically, NSGA-II is deployed to optimize cost space resulting in a set of pareto-optimal ensemble classifiers where every learner minimizes expected misclassification cost for a specific range of cost ratios. An extensive set of experiments evaluates the method on multiple data sets and for different scenarios that reflect the extent to which cost ratios are known during model deployment. Results clearly demonstrate the ability of our method to minimize cost under the absence of exact knowledge of misclassification costs.

[1]  Peter A. Flach,et al.  Brier Curves: a New Cost-Based Visualisation of Classifier Performance , 2011, ICML.

[2]  Grigorios Tsoumakas,et al.  Pruning an ensemble of classifiers via reinforcement learning , 2009, Neurocomputing.

[3]  Halil Ibrahim Erdal,et al.  Forecasting Bank Failure: Base Learners, Ensembles and Hybrid Ensembles , 2017 .

[4]  Yves Lecourtier,et al.  A multi-model selection framework for unknown and/or evolutive misclassification cost problems , 2010, Pattern Recognit..

[5]  Pedro M. Domingos MetaCost: a general method for making classifiers cost-sensitive , 1999, KDD '99.

[6]  Daniel Hernández-Lobato,et al.  An Analysis of Ensemble Pruning Techniques Based on Ordered Aggregation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  Gary M. Weiss Mining with rarity: a unifying framework , 2004, SKDD.

[9]  Antanas Verikas,et al.  Hybrid and ensemble-based soft computing techniques in bankruptcy prediction: a survey , 2010, Soft Comput..

[10]  Nitesh V. Chawla,et al.  Evolutionary Ensemble Creation and Thinning , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[11]  P. Pendharkar,et al.  Misclassification cost minimizing fitness functions for genetic algorithm-based artificial neural network classifiers , 2009, J. Oper. Res. Soc..

[12]  Hui Li,et al.  Listed companies' financial distress prediction based on weighted majority voting combination of multiple classifiers , 2008, Expert Syst. Appl..

[13]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[14]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[15]  Francisco Javier García Castellano,et al.  Expert Systems With Applications , 2022 .

[16]  Guido Dedene,et al.  Cost-sensitive learning and decision making revisited , 2005, Eur. J. Oper. Res..

[17]  Nitesh V. Chawla,et al.  Optimizing Classifiers for Hypothetical Scenarios , 2015, PAKDD.

[18]  Rich Caruana,et al.  Ensemble selection from libraries of models , 2004, ICML.

[19]  Rich Caruana,et al.  Getting the Most Out of Ensemble Selection , 2006, Sixth International Conference on Data Mining (ICDM'06).

[20]  Zhi-Hua Zhou,et al.  Learning with cost intervals , 2010, KDD '10.

[21]  J. Friedman Stochastic gradient boosting , 2002 .

[22]  Elizabeth Tashjian,et al.  Spreading the Misery? Sources of Bankruptcy Spillover in the Supply Chain , 2015 .

[23]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[24]  Yufei Xia,et al.  A novel heterogeneous ensemble credit scoring model based on bstacking approach , 2018, Expert Syst. Appl..

[25]  Niall M. Adams,et al.  Comparing classifiers when the misallocation costs are uncertain , 1999, Pattern Recognit..

[26]  Constantin Zopounidis,et al.  A survey of business failures with an emphasis on prediction methods and industrial applications , 1996 .

[27]  Juan José Rodríguez Diez,et al.  Rotation Forest: A New Classifier Ensemble Method , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Fang Liu,et al.  3D fast convex-hull-based evolutionary multiobjective optimization algorithm , 2018, Appl. Soft Comput..

[29]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Vadlamani Ravi,et al.  Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review , 2007, Eur. J. Oper. Res..

[31]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[32]  Licheng Jiao,et al.  Multiobjective optimization of classifiers by means of 3D convex-hull-based evolutionary algorithms , 2014, Inf. Sci..

[33]  Daniel Martin,et al.  Early warning of bank failure: A logit regression approach , 1977 .

[34]  Shanlin Yang,et al.  Heterogeneous Ensemble for Default Prediction of Peer-to-Peer Lending in China , 2018, IEEE Access.

[35]  Marek Kurzynski,et al.  A probabilistic model of classifier competence for dynamic ensemble selection , 2011, Pattern Recognit..

[36]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[37]  Gleb Lanine,et al.  Failure prediction in the Russian bank sector with logit and trait recognition models , 2006, Expert Syst. Appl..

[38]  Ingoo Han,et al.  Bankruptcy prediction using case-based reasoning, neural networks, and discriminant analysis , 1997 .

[39]  Pedro M. Domingos,et al.  Tree Induction for Probability-Based Ranking , 2003, Machine Learning.

[40]  Tom Heskes,et al.  Clustering ensembles of neural network models , 2003, Neural Networks.

[41]  Nojun Kwak,et al.  Feature extraction for classification problems and its application to face recognition , 2008, Pattern Recognit..

[42]  Xingyi Zhang,et al.  Multi-objective evolutionary algorithm for optimizing the partial area under the ROC curve , 2019, Knowledge-Based Systems.

[43]  Terry Windeatt,et al.  Pruning of Error Correcting Output Codes by optimization of accuracy–diversity trade off , 2014, Machine Learning.

[44]  Parag C. Pendharkar,et al.  A threshold-varying artificial neural network approach for classification and its application to bankruptcy prediction problem , 2005, Comput. Oper. Res..

[45]  E. F. Brigham,et al.  Financial Management: Theory and Practice , 1990 .

[46]  Christian Gagné,et al.  Multi-objective evolutionary optimization for generating ensembles of classifiers in the ROC space , 2012, GECCO '12.

[47]  P.D. Yoo,et al.  A Semiparametric Model Approach to Financial Bankruptcy Prediction , 2006, 2006 IEEE International Conference on Engineering of Intelligent Systems.

[48]  Denis Larocque,et al.  Discrete-time survival trees and forests with time-varying covariates , 2009 .

[49]  Sharon A. DeVaney,et al.  Predicting Business Failure of Retail Firms: An Analysis Using Mixed Industry Models , 1998 .

[50]  Sergio Davalos,et al.  Designing an If-Then Rules Based Ensemble of Heterogeneous Bankruptcy Classifiers: A Genetic Algorithm Approach , 2014, Intell. Syst. Account. Finance Manag..

[51]  Petr Hájek,et al.  Two-stage consumer credit risk modelling using heterogeneous ensemble learning , 2019, Decis. Support Syst..

[52]  H. Frydman,et al.  Introducing Recursive Partitioning for Financial Classification: The Case of Financial Distress , 1985 .

[53]  Chih-Fong Tsai,et al.  Feature selection in bankruptcy prediction , 2009, Knowl. Based Syst..

[54]  Robert Sabourin,et al.  From dynamic classifier selection to dynamic ensemble selection , 2008, Pattern Recognit..

[55]  Ignacio Olmeda,et al.  Hybrid Classifiers for Financial Multicriteria Decision Making: The Case of Bankruptcy Prediction , 1997 .

[56]  Constantin Zopounidis,et al.  Corporate failure prediction in the European energy sector: A multicriteria approach and the effect of country characteristics , 2017, Eur. J. Oper. Res..

[57]  Edward I. Altman,et al.  FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY , 1968 .

[58]  Ning Chen,et al.  Comparative study of classifier ensembles for cost-sensitive credit risk assessment , 2015, Intell. Data Anal..

[59]  R. Jarrow,et al.  Bankruptcy Prediction With Industry Effects , 2004 .

[60]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[61]  Hui Li,et al.  Principal component case-based reasoning ensemble for business failure prediction , 2011, Inf. Manag..

[62]  Luiz Eduardo Soares de Oliveira,et al.  Dynamic selection of classifiers - A comprehensive review , 2014, Pattern Recognit..

[63]  Qinghua Huang,et al.  Predicting financial distress and corporate failure: A review from the state-of-the-art definitions, modeling, sampling, and featuring approaches , 2014, Knowl. Based Syst..

[64]  Jianjun Li A two-step rejection procedure for testing multiple hypotheses , 2008 .

[65]  Juan José Rodríguez Diez,et al.  An Experimental Study on Rotation Forest Ensembles , 2007, MCS.

[66]  Efstathios Kirkos Assessing methodologies for intelligent bankruptcy prediction , 2012, Artificial Intelligence Review.

[67]  James A. Ohlson FINANCIAL RATIOS AND THE PROBABILISTIC PREDICTION OF BANKRUPTCY , 1980 .

[68]  J. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research , 2015, Eur. J. Oper. Res..

[69]  Vadlamani Ravi,et al.  Soft computing system for bank performance prediction , 2008, Appl. Soft Comput..

[70]  Bianca Zadrozny,et al.  Learning and making decisions when costs and probabilities are both unknown , 2001, KDD '01.

[71]  Zhongsheng Hua,et al.  Predicting corporate financial distress based on integration of support vector machine and logistic regression , 2007, Expert Syst. Appl..

[72]  Thomas G. Dietterich,et al.  Pruning Adaptive Boosting , 1997, ICML.

[73]  Robert C. Holte,et al.  Cost curves: An improved method for visualizing classifier performance , 2006, Machine Learning.

[74]  Prakash P. Shenoy,et al.  Using Bayesian networks for bankruptcy prediction: Some methodological issues , 2007, Eur. J. Oper. Res..

[75]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .

[76]  Salvatore J. Stolfo,et al.  AdaCost: Misclassification Cost-Sensitive Boosting , 1999, ICML.

[77]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[78]  Vineet Agarwal,et al.  Are hazard models superior to traditional bankruptcy prediction approaches? A comprehensive test , 2014 .

[79]  Sofie Balcaen,et al.  35 years of studies on business failure: an overview of the classic statistical methodologies and their related problems , 2006 .

[80]  Constantin Zopounidis,et al.  Model combination for credit risk assessment: A stacked generalization approach , 2007, Ann. Oper. Res..

[81]  Tom Fawcett,et al.  Robust Classification for Imprecise Environments , 2000, Machine Learning.

[82]  Sally I. McClean,et al.  A data mining approach to the prediction of corporate failure , 2001, Knowl. Based Syst..

[83]  Francisco Herrera,et al.  Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power , 2010, Inf. Sci..

[84]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[85]  Sotiris B. Kotsiantis,et al.  Selective costing voting for bankruptcy prediction , 2007, Int. J. Knowl. Based Intell. Eng. Syst..

[86]  Hui Li,et al.  On performance of case-based reasoning in Chinese business failure prediction from sensitivity, specificity, positive and negative values , 2011, Appl. Soft Comput..

[87]  Ning Chen,et al.  A Consensus Approach for Combining Multiple Classifiers in Cost-Sensitive Bankruptcy Prediction , 2013, ICANNGA.

[88]  Bianca Zadrozny,et al.  Transforming classifier scores into accurate multiclass probability estimates , 2002, KDD.

[89]  Peter A. Flach,et al.  A Unified View of Performance Metrics: Translating Threshold Choice into Expected Classification Loss C` Esar Ferri , 2012 .

[90]  Robert Sabourin,et al.  A dynamic overproduce-and-choose strategy for the selection of classifier ensembles , 2008, Pattern Recognit..

[91]  Marek Kurzynski,et al.  A measure of competence based on random classification for dynamic ensemble selection , 2012, Inf. Fusion.