Developing a decision support system to detect material weaknesses in internal control

Abstract Wells Fargo employees set up 3.5 million fraudulent accounts over several years. Wells Fargo ended up paying $4.5 billion in fines. The Wells Fargo scandal highlights the importance of management actions to prevent misstatements and potential frauds on a timely basis. This study utilized the design science research paradigm to develop a predictive framework (IT artifact) in order to stratify firms into multiple risk groups for disclosing material weakness(es) in internal control (MWIC). The proposed methodology employed a hybrid heuristic optimization-based machine learning methodology. Synthetic minority over-sampling technique (SMOTE) was utilized to handle the learning problem with the imbalanced data. The best performing model was the proposed hybrid Genetic Algorithms (GA) and Support Vector Machines (SVM). The proposed methodology was internally validated via k-fold cross validation, and then externally validated using several separate datasets. The (GA) selected variables were ranked from the most important to the least through Information fusion (IF) sensitivity. A web-based decision support system was built to predict the firm-specific MWIC risk category. The web-based tool can be used to create an early warning system for predicting MWIC(s).

[1]  Margaret F. Shipley,et al.  Assessing risks due to threats to internal control in a computer-based accounting information system: a pragmatic approach based on fuzzy set theory , 2004, Intell. Syst. Account. Finance Manag..

[2]  Yongsheng Ding,et al.  Forecasting financial condition of Chinese listed companies based on support vector machine , 2008, Expert Syst. Appl..

[3]  Cemil Kuzey,et al.  Analyzing initial public offerings' short-term performance using decision trees and SVMs , 2015, Decis. Support Syst..

[4]  Yong Shi,et al.  Predicting Material Weaknesses In Internal Control Systems After The Sarbanes-Oxley Act Using Multiple Criteria Linear Programming And Other Data Mining Approaches , 2011 .

[5]  Tim Loughran,et al.  Textual Analysis in Accounting and Finance: A Survey: TEXTUAL ANALYSIS IN ACCOUNTING AND FINANCE , 2016 .

[6]  Vadlamani Ravi,et al.  Bankruptcy prediction in banks and firms via statistical and intelligent techniques - A review , 2007, Eur. J. Oper. Res..

[7]  David Martens,et al.  Bankruptcy prediction for SMEs using relational data , 2017, Decis. Support Syst..

[8]  Iván Pastor Sanz,et al.  Bankruptcy visualization and prediction using neural networks: A study of U.S. commercial banks , 2015, Expert Syst. Appl..

[9]  Philippe du Jardin,et al.  A two-stage classification technique for bankruptcy prediction , 2016, Eur. J. Oper. Res..

[10]  Kyung-shik Shin,et al.  An application of support vector machines in bankruptcy prediction model , 2005, Expert Syst. Appl..

[11]  Weili Ge,et al.  Determinants of Weaknesses in Internal Control over Financial Reporting , 2006 .

[12]  Jean C. Bedard,et al.  Detection and Severity Classifications of Sarbanes-Oxley Section 404 Internal Control Deficiencies , 2011 .

[13]  Ruibin Geng,et al.  Prediction of financial distress: An empirical study of listed Chinese companies using data mining , 2015, Eur. J. Oper. Res..

[14]  Vadlamani Ravi,et al.  Detection of financial statement fraud and feature selection using data mining techniques , 2011, Decis. Support Syst..

[15]  Asil Oztekin,et al.  A data analytic approach to forecasting daily stock returns in an emerging market , 2016, Eur. J. Oper. Res..

[16]  Thomas Fischer,et al.  Deep learning with long short-term memory networks for financial market predictions , 2017, Eur. J. Oper. Res..

[17]  Ingoo Han,et al.  Hybrid genetic algorithms and support vector machines for bankruptcy prediction , 2006, Expert Syst. Appl..

[18]  Sarah C. Rice,et al.  How Effective is Internal Control Reporting Under SOX 404? Determinants of the (Non-)Disclosure of Existing Material Weaknesses , 2011 .

[19]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[20]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[21]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[22]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[23]  Daniel Wangerin,et al.  Internal Control Over Financial Reporting and Managerial Rent Extraction: Evidence from the Profitability of Insider Trading , 2013 .

[24]  Ali Dag,et al.  Stratifying no-show patients into multiple risk groups via a holistic data analytics-based framework , 2020, Decis. Support Syst..

[25]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[26]  Sotiris Kotsiantis,et al.  Forecasting Fraudulent Financial Statements using Data Mining , 2007 .

[27]  William R. Kinney,et al.  The Discovery and Reporting of Internal Control Deficiencies Prior to SOX-Mandated Audits , 2007 .

[28]  So Young Sohn,et al.  Support vector machines for default prediction of SMEs based on technology credit , 2010, Eur. J. Oper. Res..

[29]  E. Miller Risk, Uncertainty, and Divergence of Opinion , 1977 .

[30]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[31]  Erik Cambria,et al.  Natural language based financial forecasting: a survey , 2017, Artificial Intelligence Review.

[32]  Chih-Hsun Chou,et al.  Hybrid genetic algorithm and fuzzy clustering for bankruptcy prediction , 2017, Appl. Soft Comput..

[33]  Mei Feng,et al.  Does Ineffective Internal Control over Financial Reporting Affect a Firm’s Operations? Evidence from Firms’ Inventory Management , 2014 .

[34]  Hongnian Yu,et al.  A combination selection algorithm on forecasting , 2014, Eur. J. Oper. Res..

[35]  E. Myllymäki The Persistence in the Association between Section 404 Material Weaknesses and Financial Reporting Quality , 2014 .

[36]  John J. Cheh,et al.  Determinants of Internal Control Weaknesses , 2010 .

[37]  Marlene Plumlee,et al.  An Analysis of the Underlying Causes Attributed to Restatements , 2009 .

[38]  Alan R. Hevner,et al.  Design Science in Information Systems Research , 2004, MIS Q..

[39]  J. Popp,et al.  Sample size planning for classification models. , 2012, Analytica chimica acta.

[40]  Ali Dag,et al.  A Bayesian Belief Network-based probabilistic mechanism to determine patient no-show risk categories , 2020 .

[41]  Rudrajeet Pal,et al.  Business health characterization: A hybrid regression and support vector machine analysis , 2016, Expert Syst. Appl..

[42]  Vikas Kumar,et al.  Systematic review of bankruptcy prediction models: Towards a framework for tool selection , 2018, Expert Syst. Appl..

[43]  Ting Sun,et al.  The Incremental Informativeness of the Sentiment of Conference Calls for Internal Control Material Weaknesses , 2018 .

[44]  Leslie M. Collins,et al.  Financial fraud detection using vocal, linguistic and financial cues , 2015, Decis. Support Syst..

[45]  Guido Caldarelli,et al.  S 1 Appendix , 2016 .

[46]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[47]  Herbert Kimura,et al.  Machine learning models and bankruptcy prediction , 2017, Expert Syst. Appl..

[48]  L. Su,et al.  Do customers respond to the disclosure of internal control weakness , 2014 .

[49]  Weili Ge,et al.  Accruals Quality and Internal Control Over Financial Reporting , 2007 .

[50]  Marjorie B. Platt,et al.  Probabilistic Neural Networks in Bankruptcy Prediction , 1999 .

[51]  Ali Dag,et al.  A Bayesian Approach to Detect the Firms with Material Weakness in Internal Control , 2019 .

[52]  Johan L. Perols Financial Statement Fraud Detection: An Analysis of Statistical and Machine Learning Algorithms , 2011 .

[53]  Emilia Grzegorzewska,et al.  Bankruptcy prediction in the agribusiness sector: Lessons from quantitative and qualitative approaches , 2018, Journal of Business Research.

[54]  Erkam Güresen,et al.  Developing an early warning system to predict currency crises , 2014, Eur. J. Oper. Res..

[55]  Federico Girosi,et al.  Support Vector Machines: Training and Applications , 1997 .

[56]  Chuan-Ju Wang,et al.  On the risk prediction and analysis of soft information in finance reports , 2017, Eur. J. Oper. Res..

[57]  D. Donelson,et al.  Internal Control Weaknesses and Financial Reporting Fraud , 2017 .

[58]  Regina Wittenberg Moerman,et al.  The Impact of Financial Reporting Quality on Debt Contracting: Evidence from Internal Control Weakness Reports , 2010 .

[59]  Deron Liang,et al.  Financial ratios and corporate governance indicators in bankruptcy prediction: A comprehensive study , 2016, Eur. J. Oper. Res..

[60]  Alan R. Hevner,et al.  POSITIONING AND PRESENTING DESIGN SCIENCE RESEARCH FOR MAXIMUM IMPACT 1 , 2013 .

[61]  Ching-Chiang Yeh,et al.  Going-concern prediction using hybrid random forests and rough set approach , 2014, Inf. Sci..

[62]  Dan W. Patterson,et al.  Artificial Neural Networks: Theory and Applications , 1998 .

[63]  Patricia M. Dechow,et al.  Predicting Material Accounting Misstatements*: Predicting Material Accounting Misstatements , 2011 .

[64]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.