Bankruptcy prediction in firms with statistical and intelligent techniques and a comparison of evolutionary computation approaches

In this paper, we compare some traditional statistical methods for predicting financial distress to some more ''unconventional'' methods, such as decision tree classification, neural networks, and evolutionary computation techniques, using data collected from 200 Taiwan Stock Exchange Corporation (TSEC) listed companies. Empirical experiments were conducted using a total of 42 ratios including 33 financial, 8 non-financial and 1 combined macroeconomic index, using principle component analysis (PCA) to extract suitable variables. This paper makes four critical contributions: (1) with nearly 80% fewer financial ratios by the PCA method, the prediction performance is still able to provide highly-accurate forecasts of financial bankruptcy; (2) we show that traditional statistical methods are better able to handle large datasets without sacrificing prediction performance, while intelligent techniques achieve better performance with smaller datasets and would be adversely affected by huge datasets; (3) empirical results show that C5.0 and CART provide the best prediction performance for imminent bankruptcies; and (4) Support Vector Machines (SVMs) with evolutionary computation provide a good balance of high-accuracy short- and long-term performance predictions for healthy and distressed firms. Therefore, the experimental results show that the Particle Swarm Optimization (PSO) integrated with SVM (PSO-SVM) approach could be considered for predicting potential financial distress.

[1]  Toshiyuki Sueyoshi,et al.  DEA as a tool for bankruptcy assessment: A comparative study with logistic regression technique , 2009, Eur. J. Oper. Res..

[2]  N. Campbell,et al.  The Influence Function as an Aid in Outlier Detection in Discriminant Analysis , 1978 .

[3]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[4]  M. Zmijewski METHODOLOGICAL ISSUES RELATED TO THE ESTIMATION OF FINANCIAL DISTRESS PREDICTION MODELS , 1984 .

[5]  H. Frydman,et al.  Introducing Recursive Partitioning for Financial Classification: The Case of Financial Distress , 1985 .

[6]  Sheng-Fa Yuan,et al.  Fault diagnostics based on particle swarm optimisation and support vector machines , 2007 .

[7]  Zhongsheng Hua,et al.  Predicting corporate financial distress based on integration of support vector machine and logistic regression , 2007, Expert Syst. Appl..

[8]  Constantin Zopounidis,et al.  Multicriteria Decision Aid Methods for the Prediction of Business Failure , 1998 .

[9]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[10]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[11]  Obeua S. Persons Using Financial Statement Data To Identify Factors Associated With Fraudulent Financial Reporting , 2011 .

[12]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[13]  Ömer Kaan Baykan,et al.  Predicting bank financial failures using neural networks, support vector machines and multivariate statistical methods: A comparative analysis in the sample of savings deposit insurance fund (SDIF) transferred banks in Turkey , 2009, Expert Syst. Appl..

[14]  H. Kaiser The Application of Electronic Computers to Factor Analysis , 1960 .

[15]  Sulin Pang,et al.  C5.0 Classification Algorithm and Application on Individual Credit Evaluation of Banks , 2009 .

[16]  David E. Booth,et al.  A comparison of supervised and unsupervised neural networks in predicting bankruptcy of Korean firms , 2005, Expert Syst. Appl..

[17]  Olli Simula,et al.  Combining linear equalization and self-organizing adaptation in dynamic discrete-signal detection , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[18]  Paula A. Tkac,et al.  The Financial Crisis of 2008 in Fixed Income Markets , 2009 .

[19]  Simon Parsons,et al.  Soft computing: fuzzy logic, neural networks and distributed artificial intelligence by F. Aminzadeh and M. Jamshidi (Eds.), PTR Prentice Hall, Englewood Cliffs, NJ, pp 301, ISBN 0-13-146234-2 , 1996, Knowl. Eng. Rev..

[20]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[21]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[22]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory, Third Edition , 1989, Springer Series in Information Sciences.

[23]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[24]  W. Beaver Financial Ratios As Predictors Of Failure , 1966 .

[25]  Kyoung-jae Kim,et al.  Financial time series forecasting using support vector machines , 2003, Neurocomputing.

[26]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[27]  Lijuan Cao,et al.  Support vector machines experts for time series forecasting , 2003, Neurocomputing.

[28]  E. Laitinen,et al.  Bankruptcy prediction: Application of the Taylor's expansion in logistic regression , 2000 .

[29]  Yannis Manolopoulos,et al.  Data Mining techniques for the detection of fraudulent financial statements , 2007, Expert Syst. Appl..

[30]  Wei-Sen Chen,et al.  Using neural networks and data mining techniques for the financial distress prediction model , 2009, Expert Syst. Appl..

[31]  Richard G. Mathieu,et al.  Kanban setting through artificial intelligence: a comparative study of artificial neural networks and decision trees , 2000 .

[32]  William R. Kinney,et al.  Characteristics of firms correcting previously reported quarterly earnings , 1989 .

[33]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[34]  Vadlamani Ravi,et al.  Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review , 2007, Eur. J. Oper. Res..

[35]  B. Tabachnick,et al.  Using Multivariate Statistics , 1983 .

[36]  Heien-Kun Chiang,et al.  Comparing extended classifier system and genetic programming for financial forecasting: an empirical study , 2007, Soft Comput..

[37]  J. Patell,et al.  The Experimental Design of Classification Models: An Application of Recursive Partitioning and Bootstrapping to Commercial Bank Loan Classifications , 1984 .

[38]  James A. Ohlson FINANCIAL RATIOS AND THE PROBABILISTIC PREDICTION OF BANKRUPTCY , 1980 .

[39]  Young S. Kwon,et al.  A practical approach to bankruptcy prediction for small businesses: Substituting the unavailable financial data for credit card sales information , 2010, Expert Syst. Appl..

[40]  Russell C. Eberhart,et al.  A new optimizer using particle swarm theory , 1995, MHS'95. Proceedings of the Sixth International Symposium on Micro Machine and Human Science.

[41]  Ingoo Han,et al.  Hybrid genetic algorithms and support vector machines for bankruptcy prediction , 2006, Expert Syst. Appl..

[42]  Sungbin Cho,et al.  A hybrid approach based on the combination of variable selection using decision trees and case-based reasoning using the Mahalanobis distance: For bankruptcy prediction , 2010, Expert Syst. Appl..

[43]  C. Zopounidis,et al.  Detecting falsified financial statements: a comparative study using multicriteria analysis and multivariate statistical techniques , 2002 .

[44]  Edward I. Altman,et al.  FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY , 1968 .

[45]  Thomas E. McKee,et al.  Genetic programming and rough sets: A hybrid approach to bankruptcy classification , 2002, Eur. J. Oper. Res..

[46]  W. Fung The influence of observations on misclassification probability in multiple discriminant analysis , 1996 .

[47]  W. Loh,et al.  SPLIT SELECTION METHODS FOR CLASSIFICATION TREES , 1997 .

[48]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[49]  Kurt Fanning,et al.  Neural Network Detection of Management Fraud Using Published Financial Data , 1998 .