Improving the prediction of going concern of Taiwanese listed companies using a hybrid of LASSO with data mining techniques

The purpose of this study is to establish rigorous and reliable going concern doubt (GCD) prediction models. This study first uses the least absolute shrinkage and selection operator (LASSO) to select variables and then applies data mining techniques to establish prediction models, such as neural network (NN), classification and regression tree (CART), and support vector machine (SVM). The samples of this study include 48 GCD listed companies and 124 NGCD (non-GCD) listed companies from 2002 to 2013 in the TEJ database. We conduct fivefold cross validation in order to identify the prediction accuracy. According to the empirical results, the prediction accuracy of the LASSO–NN model is 88.96 % (Type I error rate is 12.22 %; Type II error rate is 7.50 %), the prediction accuracy of the LASSO–CART model is 88.75 % (Type I error rate is 13.61 %; Type II error rate is 14.17 %), and the prediction accuracy of the LASSO–SVM model is 89.79 % (Type I error rate is 10.00 %; Type II error rate is 15.83 %).

[1]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[2]  Bryan K. Church,et al.  Default on Debt Obligations and the Issuance of Going-concern Opinions , 1992 .

[3]  Gregory R. Madey,et al.  The Application of Neural Networks and a Qualitative Response Model to the Auditor's Going Concern Uncertainty Decision* , 1995 .

[4]  D. Cormier,et al.  The Auditor's Consideration of the Going Concern Assumption: A Diagnostic Model , 1995 .

[5]  Bryan K. Church,et al.  Going Concern Opinions and the Market's Reaction to Bankruptcy Filings , 1996 .

[6]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[7]  Jane F. Mutchler,et al.  The influence of contrary information and mitigating factors on audit opinion decisions on bankrupt companies , 1997 .

[8]  Jon Woodroof,et al.  An Analysis of the Usefulness of Debt Defaults and Going Concern Opinions in Bankruptcy Risk Assessment , 1998 .

[9]  Daniel E. O’Leary Using neural networks to predict corporate failure , 1998 .

[10]  Murugan Anandarajan,et al.  A comparison of machine learning techniques with a qualitative response model for auditor’s going concern reporting , 1999 .

[11]  Joseph V. Carcello,et al.  Audit Committee Composition and Auditor Reporting , 2000 .

[12]  Kip R. Krumwiede,et al.  Further Evidence on the Auditor's Going‐Concern Report: The Influence of Management Plans , 2001 .

[13]  George J. Benston,et al.  Enron: what happened and what we can learn from it , 2002 .

[14]  Sunil J Rao,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2003 .

[15]  Dasaratha V. Rama,et al.  Audit fees, non-audit fees, and auditor reporting on UK stressed companies , 2003 .

[16]  Anthony Brabazon,et al.  A hybrid genetic model for the prediction of corporate failure , 2004, Comput. Manag. Sci..

[17]  Hian Chye Koh,et al.  Going concern prediction using data mining techniques , 2004 .

[18]  R. Tibshirani,et al.  Sparsity and smoothness via the fused lasso , 2005 .

[19]  Michael Doumpos,et al.  Probabilistic neural networks for the identification of qualified audit opinions , 2007, Expert Syst. Appl..

[20]  Yannis Manolopoulos,et al.  Identifying Qualified Auditors\' Opinions: A Data Mining Approach , 2007 .

[21]  Yannis Manolopoulos,et al.  Data Mining techniques for the detection of fraudulent financial statements , 2007, Expert Syst. Appl..

[22]  Chih-Fong Tsai,et al.  Using neural network ensembles for bankruptcy prediction and credit scoring , 2008, Expert Syst. Appl..

[23]  Bart Baesens,et al.  Predicting going concern opinion with data mining , 2008, Decis. Support Syst..

[24]  Hui Li,et al.  Data mining method for listed companies' financial distress prediction , 2008, Knowl. Based Syst..

[25]  Chih-Fong Tsai Financial decision support using neural networks and support vector machines , 2008, Expert Syst. J. Knowl. Eng..

[26]  Hui Li,et al.  Predicting business failure using multiple case-based reasoning combined with support vector machine , 2009, Expert Syst. Appl..

[27]  Tzong-Huei Lin,et al.  A cross model study of corporate financial distress prediction in Taiwan: Multiple discriminant analysis, logit, probit and neural networks models , 2009, Neurocomputing.

[28]  Tanya Y. Tang,et al.  Can Book-Tax Differences Capture Earnings Management and Tax Management? Empirical Evidence from China , 2010 .

[29]  Ming-Fu Hsu,et al.  A hybrid approach of DEA, rough set and support vector machines for business failure prediction , 2010, Expert systems with applications.

[30]  Jie Sun,et al.  SFFS-PC-NN optimized by genetic algorithm for dynamic prediction of financial distress with longitudinal data streams , 2011, Knowl. Based Syst..

[31]  F. Mokhatab Rafiei,et al.  Financial health prediction models using artificial neural networks, genetic algorithm and multivariate discriminant analysis: Iranian evidence , 2011, Expert Syst. Appl..

[32]  Ahsan Habib,et al.  Split-share reform and earnings management: Evidence from China , 2012 .

[33]  Kin Keung Lai,et al.  Empirical models based on features ranking techniques for corporate financial distress prediction , 2012, Comput. Math. Appl..

[34]  Ashish Patil,et al.  Increased tRNA modification and gene-specific codon usage regulate cell cycle progression during the DNA damage response , 2012, Cell cycle.

[35]  Mahdi Salehi,et al.  Data Mining Approach to Prediction of Going Concern Using Classification and Regression Tree (CART) , 2013 .

[36]  A Legarra,et al.  Application of Bayesian least absolute shrinkage and selection operator (LASSO) and BayesCπ methods for genomic selection in French Holstein and Montbéliarde breeds. , 2013, Journal of dairy science.

[37]  Tomasz Korol Early warning models against bankruptcy risk for Central European and Latin American enterprises , 2013 .

[38]  Ching-Chiang Yeh,et al.  Going-concern prediction using hybrid random forests and rough set approach , 2014, Inf. Sci..

[39]  Masashi Sugiyama,et al.  High-Dimensional Feature Selection by Feature-Wise Kernelized Lasso , 2012, Neural Computation.

[40]  S. Kim,et al.  Predicting restaurant financial distress using decision tree and AdaBoosted decision tree models , 2014 .

[41]  Suduan Chen,et al.  A Hybrid Approach of Stepwise Regression, Logistic Regression, Support Vector Machine, and Decision Tree for Forecasting Fraudulent Financial Statements , 2014, TheScientificWorldJournal.

[42]  Vinoo Alluri,et al.  Capturing the musical brain with Lasso: Dynamic decoding of musical features from fMRI data , 2014, NeuroImage.

[43]  Thomas P. Trappenberg,et al.  A biological mechanism for Bayesian feature selection: Weight decay and raising the LASSO , 2015, Neural Networks.

[44]  Christophe Marsala,et al.  Rank discrimination measures for enforcing monotonicity in decision tree induction , 2015, Inf. Sci..

[45]  Fu-Hsiang Chen,et al.  Detecting biotechnology industry's earnings management using Bayesian network, principal component analysis, back propagation neural network, and decision tree , 2015 .

[46]  Suduan Chen,et al.  GOING CONCERN PREDICTION USING DATA MINING , 2015 .