Empirical research of hybridizing principal component analysis with multivariate discriminant analysis and logistic regression for business failure prediction

Research highlights? The present research improves predictive performance of statistical models by integrating principal component analysis (PCA) with multivariate discriminant analysis and logistic regression in business failure prediction. ? Several filer approaches for feature selection are integrated with PCA, the best one of which is used in helping produce predictions. ? Empirical research indicates that the new approach is useful in improving performance of statistical models in business failure prediction. Predicting business failure of listed companies is a hot topic because of the emergency of financial crisis in developed countries recently. The two classical statistical methods of multivariate discriminant analysis (MDA) and logistic regression (logit) have taken a key role in the area of business failure prediction (BFP). However, they are frequently criticized for the relative low predictive performance compared with some newly developed predictive methods. The present research contributes to the construction of a new hybrid method for BFP by integrating principal component analysis (PCA) with MDA and logit to help them produce dominating predictive performance, a pilot study demonstrating the effectiveness of employing PCA with MDA and logit for BFP, even a pilot study demonstrating the usefulness of PCA in BFP. The hybrid method is implemented by using stepwise method of MDA, stepwise method of logit, and independent sample t test as the preprocessing procedure, on the basis of which PCA is further used to extract features for MDA and logit. The optimal feature set for a specific task of BFP is determined by an empirical means of splitting all available data for thirty times to obtain the optimal preprocessing procedure for PCA. In the empirical research, this study attempts to investigate whether or not the new hybrid method can produce dominating performance in short-term BFP of Chinese listed companies. For comparison, the most preferred filter approach of stepwise method of MDA for short-term BFP in China was employed to select optimal features for MDA and logit. Meanwhile, the use of PCA on all available data to extract features for MDA and logit to make predictions is also employed to make a comparison. Empirical results indicate that the new method, namely: the hybridization of PCA with MDA and logit, can produce dominating predictive performance in short-term BFP of Chinese listed companies. The preferred preprocessing procedure of PCA for short-term BFP of Chinese listed companies is stepwise method of MDA and corresponding performance of MDA and logit respectively outperforms all the other methods. Meanwhile, predictive performance of MDA and logit with the optimal PCA procedure is not significantly different.

[1]  Hui Li,et al.  Gaussian case-based reasoning for business failure prediction with empirical data in China , 2009, Inf. Sci..

[2]  Ingoo Han,et al.  Hybrid genetic algorithms and support vector machines for bankruptcy prediction , 2006, Expert Syst. Appl..

[3]  Hui Li,et al.  Ranking-order case-based reasoning for financial distress prediction , 2008, Knowl. Based Syst..

[4]  Sungbin Cho,et al.  A hybrid approach based on the combination of variable selection using decision trees and case-based reasoning using the Mahalanobis distance: For bankruptcy prediction , 2010, Expert Syst. Appl..

[5]  Marjorie B. Platt,et al.  Probabilistic Neural Networks in Bankruptcy Prediction , 1999 .

[6]  Chi-Bin Cheng,et al.  Financial distress prediction by a radial basis function network with logit analysis learning , 2006, Comput. Math. Appl..

[7]  David A. Hensher,et al.  Modelling corporate failure: A multinomial nested logit analysis for unordered outcomes , 2007 .

[8]  Peter J. A. Shaw,et al.  Multivariate Statistics for the Environmental Sciences , 2003 .

[9]  Yu-Chiang Hu,et al.  Retail default prediction by using sequential minimal optimization technique , 2009 .

[10]  Robert W. Ingram,et al.  Tests of the generalizability of Altman's bankruptcy prediction model , 2001 .

[11]  Sally I. McClean,et al.  A data mining approach to the prediction of corporate failure , 2001, Knowl. Based Syst..

[12]  A. Saunders,et al.  Credit risk measurement: Developments over the last 20 years , 1997 .

[13]  Hui Li,et al.  Data mining method for listed companies' financial distress prediction , 2008, Knowl. Based Syst..

[14]  Hojjat Adeli,et al.  Hybridizing principles of TOPSIS with case-based reasoning for business failure prediction , 2011, Comput. Oper. Res..

[15]  Dae-Ki Kang,et al.  Ensemble with neural networks for bankruptcy prediction , 2010, Expert Syst. Appl..

[16]  Yi-Chung Hu,et al.  Comparing four bankruptcy prediction models: Logit, quadratic interval logit, neural and fuzzy neural networks , 2010, Expert Syst. Appl..

[17]  Hui Li,et al.  Listed companies' financial distress prediction based on weighted majority voting combination of multiple classifiers , 2008, Expert Syst. Appl..

[18]  Engin Avci,et al.  An intelligent diagnosis system based on principle component analysis and ANFIS for the heart valve diseases , 2009, Expert Syst. Appl..

[19]  Yuh-Jye Lee,et al.  Variable selection and oversampling in the use of smooth support vector machines for predicting the default risk of companies , 2009 .

[20]  Hui Li,et al.  Predicting business failure using classification and regression tree: An empirical comparison with popular classical statistical methods and top classification mining methods , 2010, Expert Syst. Appl..

[21]  Vadlamani Ravi,et al.  Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review , 2007, Eur. J. Oper. Res..

[22]  Vadlamani Ravi,et al.  Financial distress prediction in banks using Group Method of Data Handling neural network, counter propagation neural network and fuzzy ARTMAP , 2010, Knowl. Based Syst..

[23]  Stewart Jones,et al.  An Error Component Logit Analysis of Corporate Bankruptcy and Insolvency Risk in Australia , 2007 .

[24]  Zhongsheng Hua,et al.  Predicting corporate financial distress based on integration of support vector machine and logistic regression , 2007, Expert Syst. Appl..

[25]  Hui Li,et al.  Financial distress prediction based on OR-CBR in the principle of k-nearest neighbors , 2009, Expert Syst. Appl..

[26]  Sharon A. DeVaney,et al.  Predicting Business Failure of Retail Firms: An Analysis Using Mixed Industry Models , 1998 .

[27]  Kyoung-jae Kim,et al.  Bankruptcy prediction modeling with hybrid case-based reasoning and genetic algorithms approach , 2009, Appl. Soft Comput..

[28]  Hui Li,et al.  On performance of case-based reasoning in Chinese business failure prediction from sensitivity, specificity, positive and negative values , 2011, Appl. Soft Comput..

[29]  Chih-Fong Tsai,et al.  Feature selection in bankruptcy prediction , 2009, Knowl. Based Syst..

[30]  Edward I. Altman,et al.  FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY , 1968 .

[31]  Kemal Polat,et al.  Principles component analysis, fuzzy weighting pre-processing and artificial immune recognition system based diagnostic system for diagnosis of lung cancer , 2008, Expert Syst. Appl..

[32]  Fang-Mei Tseng,et al.  A quadratic interval logit model for forecasting bankruptcy , 2005 .

[33]  Juliana Yim,et al.  Comparison of country risk models: hybrid neural networks, logit models, discriminant analysis and cluster techniques , 2005, Expert Syst. Appl..

[34]  Young-Chan Lee,et al.  Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters , 2005, Expert Syst. Appl..

[35]  Ching-Chiang Yeh,et al.  The use of hybrid manifold learning and support vector machines in the prediction of business failure , 2011, Knowl. Based Syst..

[36]  E. Laitinen,et al.  Bankruptcy prediction: Application of the Taylor's expansion in logistic regression , 2000 .

[37]  Hui Li,et al.  Forecasting business failure in China using case‐based reasoning with hybrid case respresentation , 2009 .

[38]  Ingoo Han,et al.  Bankruptcy prediction using case-based reasoning, neural networks, and discriminant analysis , 1997 .

[39]  Hui Li,et al.  Financial distress early warning based on group decision making , 2009, Comput. Oper. Res..

[40]  James A. Ohlson FINANCIAL RATIOS AND THE PROBABILISTIC PREDICTION OF BANKRUPTCY , 1980 .

[41]  Vadlamani Ravi,et al.  Differential evolution trained wavelet neural networks: Application to bankruptcy prediction in banks , 2009, Expert Syst. Appl..