Deep learning models for bankruptcy prediction using textual disclosures

Abstract This study introduces deep learning models for corporate bankruptcy forecasting using textual disclosures. Although textual data are common, it is rarely considered in the financial decision support models. Deep learning uses layers of neural networks to extract features from textual data for prediction. We construct a comprehensive bankruptcy database of 11,827 U.S. public companies and show that deep learning models yield superior prediction performance in forecasting bankruptcy using textual disclosures. When textual data are used in conjunction with traditional accounting-based ratio and market-based variables, deep learning models can further improve the prediction accuracy. We also investigate the effectiveness of two deep learning architectures. Interestingly, our empirical results show that simpler models such as averaging embedding are more effective than convolutional neural networks. Our results provide the first large-sample evidence for the predictive power of textual disclosures.

[1]  Toshiyuki Sueyoshi,et al.  DEA-DA for bankruptcy-based performance assessment: Misclassification analysis of Japanese construction industry , 2009, Eur. J. Oper. Res..

[2]  Ming-Fu Hsu,et al.  A hybrid approach of DEA, rough set and support vector machines for business failure prediction , 2010, Expert systems with applications.

[3]  Praveen Pathak,et al.  Making words work: Using financial text as a predictor of financial events , 2010, Decis. Support Syst..

[4]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[5]  Hossein Etemadi,et al.  A Genetic Programming Model for Bankruptcy Prediction: Empirical Evidence from Iran , 2009, Expert Syst. Appl..

[6]  Yoshua Bengio,et al.  Practical Recommendations for Gradient-Based Training of Deep Architectures , 2012, Neural Networks: Tricks of the Trade.

[7]  H. Müller CHANGE-POINTS IN NONPARAMETRIC REGRESSION ANALYSIS' , 1992 .

[8]  Dae-Ki Kang,et al.  Ensemble with neural networks for bankruptcy prediction , 2010, Expert Syst. Appl..

[9]  David L. Olson,et al.  Comparative analysis of data mining methods for bankruptcy prediction , 2012, Decis. Support Syst..

[10]  Ramesh Sharda,et al.  Bankruptcy prediction using neural networks , 1994, Decis. Support Syst..

[11]  Feng Li Annual Report Readability, Current Earnings, and Earnings Persistence , 2008 .

[12]  I. Hasan,et al.  Financial Crises and Bank Failures: A Review of Prediction Methods , 2009 .

[13]  B. Bernanke Bankruptcy, Liquidity, and Recession , 1981 .

[14]  Pamela K. Coats,et al.  A neural network for classifying the financial health of a firm , 1995 .

[15]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[16]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[17]  Ning Chen,et al.  A genetic algorithm-based approach to cost-sensitive bankruptcy prediction , 2011, Expert Syst. Appl..

[18]  Mark Lang,et al.  Textual Analysis and International Financial Reporting: Large Sample Evidence , 2015 .

[19]  Philippe du Jardin,et al.  Bankruptcy prediction using terminal failure processes , 2015, Eur. J. Oper. Res..

[20]  Jake M. Hofman,et al.  Prediction and explanation in social systems , 2017, Science.

[21]  Dirk Tasche,et al.  Measuring the Discriminative Power of Rating Systems , 2003, SSRN Electronic Journal.

[22]  Hsinchun Chen,et al.  The information content of mandatory risk factor disclosures in corporate filings , 2010 .

[23]  Tim Loughran,et al.  When is a Liability not a Liability? Textual Analysis, Dictionaries, and 10-Ks , 2010 .

[24]  Hui Li,et al.  Hybridizing principles of the Electre method with case-based reasoning for data mining: Electre-CBR-I and Electre-CBR-II , 2009, Eur. J. Oper. Res..

[25]  Donald P. Cram,et al.  Assessing the Probability of Bankruptcy , 2002 .

[26]  Ingoo Han,et al.  Hybrid neural network models for bankruptcy predictions , 1996, Decis. Support Syst..

[27]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[28]  Gregor Dorfleitner,et al.  Description-text related soft information in peer-to-peer lending – Evidence from two leading European platforms , 2016 .

[29]  Deron Liang,et al.  Financial ratios and corporate governance indicators in bankruptcy prediction: A comprehensive study , 2016, Eur. J. Oper. Res..

[30]  Sumit Agarwal,et al.  The Information Value of Credit Rating Action Reports: A Textual Analysis , 2016 .

[31]  Vadlamani Ravi,et al.  Differential evolution trained wavelet neural networks: Application to bankruptcy prediction in banks , 2009, Expert Syst. Appl..

[32]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[33]  Steven Bird,et al.  NLTK: The Natural Language Toolkit , 2002, ACL.

[34]  David C. Yen,et al.  A hybrid financial analysis model for business failure prediction , 2008, Expert Syst. Appl..

[35]  Ioannis E. Tsolas,et al.  Evaluation of credit risk based on firm performance , 2010, Eur. J. Oper. Res..

[36]  Ruibin Geng,et al.  Prediction of financial distress: An empirical study of listed Chinese companies using data mining , 2015, Eur. J. Oper. Res..

[37]  Zahn Bozanic,et al.  Qualitative Disclosure and Changes in Sell‐Side Financial Analysts' Information Environment , 2015 .

[38]  Ömer Kaan Baykan,et al.  Predicting bank financial failures using neural networks, support vector machines and multivariate statistical methods: A comparative analysis in the sample of savings deposit insurance fund (SDIF) transferred banks in Turkey , 2009, Expert Syst. Appl..

[39]  Constantin Zopounidis,et al.  Assessing Bank Soundness with Classification Techniques , 2009 .

[40]  Vadlamani Ravi,et al.  Failure prediction of dotcom companies using hybrid intelligent techniques , 2009, Expert Syst. Appl..

[41]  Mu-Yen Chen,et al.  Predicting corporate financial distress based on integration of decision tree classification and logistic regression , 2011, Expert Syst. Appl..

[42]  Edward I. Altman,et al.  FINANCIAL RATIOS, DISCRIMINANT ANALYSIS AND THE PREDICTION OF CORPORATE BANKRUPTCY , 1968 .

[43]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[44]  Aapo Hyvärinen,et al.  Noise-Contrastive Estimation of Unnormalized Statistical Models, with Applications to Natural Image Statistics , 2012, J. Mach. Learn. Res..

[45]  Philippe du Jardin,et al.  A two-stage classification technique for bankruptcy prediction , 2016, Eur. J. Oper. Res..

[46]  Raktim Pal,et al.  Predicting the survival or failure of click-and-mortar corporations: A knowledge discovery approach , 2006, Eur. J. Oper. Res..

[47]  Jonathan Crook,et al.  Chinese companies distress prediction: an application of data envelopment analysis , 2014, J. Oper. Res. Soc..

[48]  Carlos Serrano-Cinca,et al.  Partial Least Square Discriminant Analysis for bankruptcy prediction , 2013, Decis. Support Syst..

[49]  Yoshua Bengio,et al.  Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[50]  Constantin Zopounidis,et al.  Corporate failure prediction in the European energy sector: A multicriteria approach and the effect of country characteristics , 2017, Eur. J. Oper. Res..

[51]  Peter F. Wanke,et al.  Financial distress drivers in Brazilian banks: A dynamic slacks approach , 2015, Eur. J. Oper. Res..

[52]  David A. Elizondo,et al.  Bankruptcy forecasting: An empirical comparison of AdaBoost and neural networks , 2008, Decis. Support Syst..

[53]  Silvia Angela Osmetti,et al.  The effectiveness of TARP-CPP on the US banking industry: A new copula-based approach , 2016, Eur. J. Oper. Res..

[54]  P. Schönbucher Credit Derivatives Pricing Models: Models, Pricing and Implementation , 2003 .

[55]  Toshiyuki Sueyoshi,et al.  DEA as a tool for bankruptcy assessment: A comparative study with logistic regression technique , 2009, Eur. J. Oper. Res..

[56]  William J. Mayew,et al.  MD&A Disclosure and the Firm's Ability to Continue as a Going Concern , 2015 .

[57]  W. Beaver Financial Ratios As Predictors Of Failure , 1966 .

[58]  C. J. Stone,et al.  Additive Regression and Other Nonparametric Models , 1985 .

[59]  Bo K. Wong,et al.  Neural network applications in finance: A review and analysis of literature (1990-1996) , 1998, Inf. Manag..

[60]  Yan Yu,et al.  Variable selection and corporate bankruptcy forecasts , 2015 .

[61]  Francisco Javier de Cos Juez,et al.  A hybrid device for the solution of sampling bias problems in the forecasting of firms' bankruptcy , 2012, Expert Syst. Appl..

[62]  J. Campbell,et al.  In Search of Distress Risk , 2006, SSRN Electronic Journal.

[63]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[64]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[65]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[66]  Tyler Shumway Forecasting Bankruptcy More Accurately: A Simple Hazard Model , 1999 .

[67]  Michael Y. Hu,et al.  Artificial neural networks in bankruptcy prediction: General framework and cross-validation analysis , 1999, Eur. J. Oper. Res..

[68]  Hui Li,et al.  Financial distress prediction based on OR-CBR in the principle of k-nearest neighbors , 2009, Expert Syst. Appl..

[69]  David J. Hand,et al.  Measuring classifier performance: a coherent alternative to the area under the ROC curve , 2009, Machine Learning.

[70]  Yan Yu,et al.  A Class of Discrete Transformation Survival Models With Application to Default Probability Prediction , 2012 .

[71]  Francisco Javier de Cos Juez,et al.  Bankruptcy forecasting: A hybrid approach using Fuzzy c-means clustering and Multivariate Adaptive Regression Splines (MARS) , 2011, Expert Syst. Appl..

[72]  Vadlamani Ravi,et al.  Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review , 2007, Eur. J. Oper. Res..

[73]  Steven Salzberg,et al.  On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach , 1997, Data Mining and Knowledge Discovery.