Comparative analysis of data mining methods for bankruptcy prediction

A great deal of research has been devoted to prediction of bankruptcy, to include application of data mining. Neural networks, support vector machines, and other algorithms often fit data well, but because of lack of comprehensibility, they are considered black box technologies. Conversely, decision trees are more comprehensible by human users. However, sometimes far too many rules result in another form of incomprehensibility. The number of rules obtained from decision tree algorithms can be controlled to some degree through setting different minimum support levels. This study applies a variety of data mining tools to bankruptcy data, with the purpose of comparing accuracy and number of rules. For this data, decision trees were found to be relatively more accurate compared to neural networks and support vector machines, but there were more rule nodes than desired. Adjustment of minimum support yielded more tractable rule sets.

[1]  Kyung-shik Shin,et al.  An application of support vector machines in bankruptcy prediction model , 2005, Expert Syst. Appl..

[2]  Daniel Berg Bankruptcy Prediction by Generalized Additive Models , 2006 .

[3]  Yi-Chung Hu,et al.  Comparing four bankruptcy prediction models: Logit, quadratic interval logit, neural and fuzzy neural networks , 2010, Expert Syst. Appl..

[4]  Michael Y. Hu,et al.  Artificial neural networks in bankruptcy prediction: General framework and cross-validation analysis , 1999, Eur. J. Oper. Res..

[5]  P. Pendharkar,et al.  Misclassification cost minimizing fitness functions for genetic algorithm-based artificial neural network classifiers , 2009, J. Oper. Res. Soc..

[6]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[7]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[8]  Ian Witten,et al.  Data Mining , 2000 .

[9]  Young U. Ryu,et al.  Firm bankruptcy prediction: experimental comparison of isotonic separation and other classification approaches , 2005, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[10]  Athanasios V. Vasilakos,et al.  Comparison of computational intelligence based classification techniques for remotely sensed optical image classification , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Andrew P. Bradley,et al.  Rule extraction from support vector machines: A review , 2010, Neurocomputing.

[12]  Chris Barry,et al.  Fuzzy, Distributed, Instance Counting, and Default ARTMAP Neural Networks for Financial Diagnosis , 2010, Int. J. Inf. Technol. Decis. Mak..

[13]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[14]  Sungbin Cho,et al.  A hybrid approach based on the combination of variable selection using decision trees and case-based reasoning using the Mahalanobis distance: For bankruptcy prediction , 2010, Expert Syst. Appl..

[15]  Ramesh Sharda,et al.  Bankruptcy prediction using neural networks , 1994, Decis. Support Syst..

[16]  Emil Levi,et al.  Identification of complex systems based on neural and Takagi-Sugeno fuzzy model , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[17]  Chih-Fong Tsai,et al.  Combining multiple feature selection methods for stock prediction: Union, intersection, and multi-intersection approaches , 2010, Decis. Support Syst..

[18]  Gündüz Ulusoy,et al.  A new perspective on the competitiveness of nations , 2008 .

[19]  Monica Lam,et al.  Neural network techniques for financial performance prediction: integrating fundamental and technical analysis , 2004, Decis. Support Syst..

[20]  Wei Sun,et al.  Neural Network Based Self-Learning Control Strategy for Electronic Throttle Valve , 2010, IEEE Transactions on Vehicular Technology.

[21]  Marjorie B. Platt,et al.  Probabilistic Neural Networks in Bankruptcy Prediction , 1999 .

[22]  Armando Vieira,et al.  Improving bankruptcy prediction with Hidden Layer Learning Vector Quantization , 2006 .

[23]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[24]  Bart Baesens,et al.  Predicting going concern opinion with data mining , 2008, Decis. Support Syst..

[25]  Namsik Chang,et al.  Dynamics of Modeling in Data Mining: Interpretive Approach to Bankruptcy Prediction , 1999, J. Manag. Inf. Syst..

[26]  Russell L. Purvis,et al.  Forecasting the NYSE composite index with technical analysis, pattern recognizer, neural network, and genetic algorithm: a case study in romantic decision support , 2002, Decis. Support Syst..

[27]  Xiaoding Wang,et al.  Credit risk measurement and early warning of SMEs: An empirical study of listed SMEs in China , 2010, Decis. Support Syst..

[28]  David A. Elizondo,et al.  Bankruptcy forecasting: An empirical comparison of AdaBoost and neural networks , 2008, Decis. Support Syst..

[29]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[30]  Soushan Wu,et al.  Credit rating analysis with support vector machines and neural networks: a market comparative study , 2004, Decis. Support Syst..

[31]  Mark J Funt Financial ratios. , 2009, Pennsylvania dental journal.

[32]  Amir F. Atiya,et al.  Bankruptcy prediction for credit risk using neural networks: A survey and new results , 2001, IEEE Trans. Neural Networks.

[33]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  An Approach to Explain Neural Networks using Symbolic Algorithms , 2002, Int. J. Comput. Intell. Appl..

[34]  Witold Pedrycz,et al.  Heterogeneous fuzzy logic networks: fundamentals and development studies , 2004, IEEE Transactions on Neural Networks.

[35]  Edward I. Altman,et al.  FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY , 1968 .

[36]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[37]  R Risser,et al.  Using non-linear methods to investigate the criterion validity of traffic-psychological test batteries. , 2008, Accident; analysis and prevention.

[38]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[39]  Huimin Zhao,et al.  Incorporating domain knowledge into data mining classifiers: An application in indirect lending , 2008, Decis. Support Syst..

[40]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[41]  Huimin Zhao,et al.  A multi-objective genetic programming approach to developing Pareto optimal decision trees , 2007, Decis. Support Syst..

[42]  Kang Li,et al.  System oriented neural networks -- problem formulation, methodology and application , 2006, Int. J. Pattern Recognit. Artif. Intell..

[43]  Vadlamani Ravi,et al.  Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review , 2007, Eur. J. Oper. Res..

[44]  W. Beaver Financial Ratios As Predictors Of Failure , 1966 .

[45]  Charles T. Stivason,et al.  Improving prediction of neural networks: a study of tow financial prediction tasks , 2004, Adv. Decis. Sci..

[46]  Zhi-Hua Zhou,et al.  Extracting symbolic rules from trained neural network ensembles , 2003, AI Commun..

[47]  Toshiyuki Sueyoshi,et al.  An agent-based decision support system for wholesale electricity market , 2008, Decis. Support Syst..

[48]  David West,et al.  Neural network ensemble strategies for financial decision applications , 2005, Comput. Oper. Res..

[49]  Dursun Delen,et al.  A comparative analysis of machine learning techniques for student retention management , 2010, Decis. Support Syst..

[50]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[51]  Christopher J. Harris,et al.  Structure selection of neurofuzzy networks based on support vector regression , 2002, Int. J. Syst. Sci..

[52]  Chih-Fong Tsai Financial decision support using neural networks and support vector machines , 2008, Expert Syst. J. Knowl. Eng..