Forecasting Fraudulent Financial Statements using Data Mining

This paper explores the effectiveness of machine learning techniques in detecting firms that issue fraudulent financial statements (FFS) and deals with the identification of factors associated to FFS. To this end, a number of experiments have been conducted using representative learning algorithms, which were trained using a data set of 164 fraud and non-fraud Greek firms in the recent period 2001-2002. The decision of which particular method to choose is a complicated problem. A good alternative to choosing only one method is to create a hybrid forecasting system incorporating a number of possible solution methods as components (an ensemble of classifiers). For this purpose, we have implemented a hybrid decision support system that combines the representative algorithms using a stacking variant methodology and achieves better performance than any examined simple and ensemble method. To sum up, this study indicates that the investigation of financial information can be used in the identification of FFS and underline the importance of financial ratios. Keywords—machine learning, stacking, classifier

[1]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[2]  Kenneth O. Cogger,et al.  Neural network detection of management fraud using published financial data , 1998, Intell. Syst. Account. Finance Manag..

[3]  Alexander K. Seewald,et al.  How to Make Stacking Better and Faster While Also Taking Care of an Unknown Weakness , 2002, International Conference on Machine Learning.

[4]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[5]  C. Zopounidis,et al.  Detecting falsified financial statements: a comparative study using multicriteria analysis and multivariate statistical techniques , 2002 .

[6]  Kurt Fanning,et al.  Neural Network Detection of Management Fraud Using Published Financial Data , 1998 .

[7]  Charalambos Spathis Detecting false financial statements using published data: some evidence from Greece , 2002 .

[8]  Marko Robnik-Sikonja,et al.  An adaptation of Relief for attribute estimation in regression , 1997, ICML.

[9]  B. Green,et al.  Assessing the risk of management fraud through neural network technology , 1997 .

[10]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[11]  David W. Aha,et al.  Lazy Learning , 1997, Springer Netherlands.

[12]  Joseph V. Carcello,et al.  A Decision Aid for Assessing the Likelihood of Fraudulent Financial Reporting , 2000 .

[13]  J. Coffee A Theory of Corporate Scandals: Why the U.S. And Europe Differ , 2005 .

[14]  Ian H. Witten,et al.  Issues in Stacked Generalization , 2011, J. Artif. Intell. Res..

[15]  Ian Witten,et al.  Data Mining , 2000 .

[16]  JOHANNES FÜRNKRANZ,et al.  Separate-and-Conquer Rule Learning , 1999, Artificial Intelligence Review.

[17]  Ross L. Watts,et al.  Positive Accounting Theory , 2006 .

[18]  Glen D. Moyes,et al.  An empirical analysis of the likelihood of detecting fraud in New Zealand , 2002 .

[19]  Johannes Fürnkranz,et al.  An Evaluation of Grading Classifiers , 2001, IDA.

[20]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[21]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[22]  David Corderre Fraud Detection: Using Data Analysis Techniques to Detect Fraud , 2000 .

[23]  Yannis Manolopoulos,et al.  Data Mining techniques for the detection of fraudulent financial statements , 2007, Expert Syst. Appl..

[24]  Finn Verner Jensen,et al.  Introduction to Bayesian Networks , 2008, Innovations in Bayesian Networks.

[25]  Sreerama K. Murthy,et al.  Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey , 1998, Data Mining and Knowledge Discovery.

[26]  Thomas G. Calderon,et al.  A roadmap for future neural networks research in auditing and risk assessment , 2002, Int. J. Account. Inf. Syst..

[27]  Ian H. Witten,et al.  Induction of model trees for predicting continuous classes , 1996 .

[28]  Roger Meuwissen,et al.  Classification and Analysis of Major European Business Failures , 2005 .