A novel classifier ensemble approach for financial distress prediction

Financial distress prediction is very important to financial institutions who must be able to make critical decisions regarding customer loans. Bankruptcy prediction and credit scoring are the two main aspects considered in financial distress prediction. To assist in this determination, thereby lowering the risk borne by the financial institution, it is necessary to develop effective prediction models for prediction of the likelihood of bankruptcy and estimation of credit risk. A number of financial distress prediction models have been constructed, which utilize various machine learning techniques, such as single classifiers and classifier ensembles, but improving the prediction accuracy is the major research issue. In addition, aside from improving the prediction accuracy, there have been very few studies that specifically consider lowering the Type I error. In practice, Type I errors need to receive careful consideration during model construction because they can affect the cost to the financial institution. In this study, we introduce a classifier ensemble approach designed to reduce the misclassification cost. The outputs produced by multiple classifiers are combined by utilizing the unanimous voting (UV) method to find the final prediction result. Experimental results obtained based on four relevant datasets show that our UV ensemble approach outperforms the baseline single classifiers and classifier ensembles. Specifically, the UV ensemble not only provides relatively good prediction accuracy and minimizes Type I/II errors, but also produces the smallest misclassification cost.

[1]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[2]  Antanas Verikas,et al.  Hybrid and ensemble-based soft computing techniques in bankruptcy prediction: a survey , 2010, Soft Comput..

[3]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[4]  Hui Li,et al.  Predicting business failure using classification and regression tree: An empirical comparison with popular classical statistical methods and top classification mining methods , 2010, Expert Syst. Appl..

[5]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[6]  Kyung-shik Shin,et al.  An application of support vector machines in bankruptcy prediction model , 2005, Expert Syst. Appl..

[7]  Jian Ma,et al.  A hybrid ensemble approach for enterprise credit risk assessment based on Support Vector Machine , 2012, Expert Syst. Appl..

[8]  E. N. Ozkan-Gunay,et al.  Prediction of bank failures in emerging financial markets: an ANN approach , 2007 .

[9]  Stephen C. H. Leung,et al.  Vertical bagging decision trees model for credit scoring , 2010, Expert Syst. Appl..

[10]  Melody Y. Kiang,et al.  Managerial Applications of Neural Networks: The Case of Bank Failure Predictions , 1992 .

[11]  W. Beaver Financial Ratios As Predictors Of Failure , 1966 .

[12]  Jonathan N. Crook,et al.  Recent developments in consumer credit risk assessment , 2007, Eur. J. Oper. Res..

[13]  Hongshik Ahn,et al.  A weight-adjusted voting algorithm for ensembles of classifiers , 2011 .

[14]  Jian Ma,et al.  A comparative assessment of ensemble learning for credit scoring , 2011, Expert Syst. Appl..

[15]  Shi Lei,et al.  Financial Data Mining Based on Support Vector Machines and Ensemble Learning , 2010, 2010 International Conference on Intelligent Computation Technology and Automation.

[16]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[17]  Vadlamani Ravi,et al.  Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review , 2007, Eur. J. Oper. Res..

[18]  Sofie Balcaen,et al.  35 years of studies on business failure: an overview of the classic statistical methodologies and their related problems , 2006 .

[19]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[20]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[21]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[22]  Ping Yao Credit Scoring Using Ensemble Machine Learning , 2009, 2009 Ninth International Conference on Hybrid Intelligent Systems.

[23]  David West,et al.  Neural network credit scoring models , 2000, Comput. Oper. Res..

[24]  Xin Xu,et al.  Logistic Regression and Boosting for Labeled Bags of Instances , 2004, PAKDD.

[25]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[26]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[27]  Wei-Yang Lin,et al.  Machine Learning in Financial Crisis Prediction: A Survey , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[28]  Thomas G. Dietterich Machine-Learning Research , 1997, AI Mag..

[29]  Jinyong Yang,et al.  AdaBoost based bankruptcy forecasting of Korean construction companies , 2014, Appl. Soft Comput..

[30]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Ruibin Geng,et al.  Prediction of financial distress: An empirical study of listed Chinese companies using data mining , 2015, Eur. J. Oper. Res..

[32]  Edward I. Altman,et al.  FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY , 1968 .

[33]  J. Efrim Boritz,et al.  Effectiveness of neural network types for prediction of business failure , 1995 .

[34]  James A. Ohlson FINANCIAL RATIOS AND THE PROBABILISTIC PREDICTION OF BANKRUPTCY , 1980 .

[35]  Gianluca Antonini,et al.  Subagging for credit scoring models , 2010, Eur. J. Oper. Res..

[36]  David West,et al.  Neural network ensemble strategies for financial decision applications , 2005, Comput. Oper. Res..

[37]  Herbert Lee,et al.  Bagging and the Bayesian Bootstrap , 2001, AISTATS.

[38]  Chih-Fong Tsai,et al.  Feature selection in bankruptcy prediction , 2009, Knowl. Based Syst..

[39]  Lin Ma,et al.  Empirical analysis of support vector machine ensemble classifiers , 2009, Expert Syst. Appl..

[40]  Lei Xi,et al.  Bagging of Artificial Neural Networks for Bankruptcy Prediction , 2009, 2009 International Conference on Information and Financial Engineering.

[41]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[42]  Randall S. Sexton,et al.  Improving Decision Effectiveness of Artificial Neural Networks: A Modified Genetic Algorithm Approach , 2003, Decis. Sci..