Association Rules and Machine Learning for Enhancing Undeclared Work Detection

Undeclared work is, by definition, a multi-faceted phenomenon that needs to be detected. In welfare states, undeclared work results in loss of public revenue and thus resources critical for welfare mechanisms’ funding, lack of worker protection and, last but not least, unfair competition for legitimate businesses. Yet, little to no studies have proposed the use of sophisticated machine learning methods in tackling this severe socioeconomic problem. In this study, we demonstrate the application of an advanced data analysis method, the association rule mining, which has significant advantages over rule-based systems, in classifying employers likely to engage in undeclared work. Indeed, the results of this pilot project proved divulging, even to the most experienced labour inspectors, offering insights in patterns of employers’ illegal behaviour, that were previously unidentified.

[1]  Vadlamani Ravi,et al.  Detection of financial statement fraud and feature selection using data mining techniques , 2011, Decis. Support Syst..

[2]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[3]  Darshan M. Tank,et al.  Improved Apriori Algorithm for Mining Association Rules , 2014 .

[4]  Gehao Sheng,et al.  A Novel Association Rule Mining Method of Big Data for Power Transformers State Parameters Based on Probabilistic Graph Model , 2018, IEEE Transactions on Smart Grid.

[5]  Sotirios Chatzis,et al.  Gated Mixture Variational Autoencoders for Value Added Tax audit case selection , 2020, Knowl. Based Syst..

[6]  C. Williams,et al.  Evaluating the policy approaches for tackling undeclared work in the European Union , 2017 .

[7]  Manish Gupta,et al.  Audit Selection Strategy for Improving Tax Compliance – Application of Data Mining Techniques , 2007 .

[8]  C. Williams,et al.  An Evaluation of the Scale of Undeclared Work in the European Union and Its Structural Determinants: Estimates Using the Labour Input Method , 2017 .

[9]  Yin Shan,et al.  Mining Medical Specialist Billing Patterns for Health Service Management , 2008, AusDM.

[10]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[11]  C. Williams Diagnostic Report on undeclared work in Greece , 2016 .

[12]  Maumita Bhattacharya,et al.  Intelligent Financial Fraud Detection: A Comprehensive Review , 2015 .

[13]  C. Williams Tackling Undeclared Work In The European Union: An Evaluation Of Government Policy Approaches , 2019 .

[14]  Mohammed J. Zaki Scalable Algorithms for Association Mining , 2000, IEEE Trans. Knowl. Data Eng..

[15]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[16]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[17]  She-I Chang,et al.  Using data mining technique to enhance tax evasion detection performance , 2012, Expert Syst. Appl..

[18]  Hiroki Arimura,et al.  LCM: An Efficient Algorithm for Enumerating Frequent Closed Item Sets , 2003, FIMI.

[19]  Rakhi Garg,et al.  Mining Association Rules in Various Computing Environments: A Survey , 2019, ArXiv.

[20]  F. Schneider,et al.  Shadow Economies: Size, Causes, and Consequences , 2000 .

[21]  José Maria Monteiro,et al.  An Empirical Method for Discovering Tax Fraudsters: A Real Case Study of Brazilian Fiscal Evasion , 2015, IDEAS.

[22]  Chia-Wen Liao,et al.  Designing of dynamic labor inspection system for construction industry , 2012, Expert Syst. Appl..

[23]  Xiaojing Wang,et al.  Mining algorithm for association rules in big data based on Hadoop , 2018 .

[24]  Seyed Mohssen Ghafari,et al.  A survey on association rules mining using heuristics , 2019, WIREs Data Mining Knowl. Discov..