Corporate residence fraud detection

With the globalisation of the world's economies and ever-evolving financial structures, fraud has become one of the main dissipaters of government wealth and perhaps even a major contributor in the slowing down of economies in general. Although corporate residence fraud is known to be a major factor, data availability and high sensitivity have caused this domain to be largely untouched by academia. The current Belgian government has pledged to tackle this issue at large by using a variety of in-house approaches and cooperations with institutions such as academia, the ultimate goal being a fair and efficient taxation system. This is the first data mining application specifically aimed at finding corporate residence fraud, where we show the predictive value of using both structured and fine-grained invoicing data. We further describe the problems involved in building such a fraud detection system, which are mainly data-related (e.g. data asymmetry, quality, volume, variety and velocity) and deployment-related (e.g. the need for explanations of the predictions made).

[1]  J. M. Serrano,et al.  Association rules applied to credit card fraud detection , 2009, Expert Syst. Appl..

[2]  Corinna Cortes,et al.  Communities of interest , 2001, Intell. Data Anal..

[3]  D. Hand,et al.  Unsupervised Profiling Methods for Fraud Detection , 2002 .

[4]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[5]  Mihaela Aghenitei,et al.  THE FIGHT AGAINST TAX FRAUD AND TAX EVASION , 2013 .

[6]  Foster Provost,et al.  Suspicion scoring of networked entities based on guilt-by-association, collective inference, and focused data access 1 , 2005 .

[7]  Foster J. Provost,et al.  Using co-visitation networks for detecting large scale online display advertising exchange fraud , 2013, KDD.

[8]  She-I Chang,et al.  Using data mining technique to enhance tax evasion detection performance , 2012, Expert Syst. Appl..

[9]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[10]  Jean-Jacques Rousseau,et al.  Du Contrat social ou Principes du droit politique , 1839 .

[11]  S HilasConstantinos,et al.  An application of supervised and unsupervised learning approaches to telecommunications fraud detection , 2008 .

[12]  A. Hasan,et al.  Organisation for Economic Co-operation and Development , 2007 .

[13]  Paris A. Mastorocostas,et al.  An application of supervised and unsupervised learning approaches to telecommunications fraud detection , 2008, Knowl. Based Syst..

[14]  Miriam H. Baer Linkage and the Deterrence of Corporate Fraud , 2008 .

[15]  Cynthia Rudin,et al.  The P-Norm Push: A Simple Convex Ranking Algorithm that Concentrates at the Top of the List , 2009, J. Mach. Learn. Res..

[16]  Yong Hu,et al.  The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature , 2011, Decis. Support Syst..

[17]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[18]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[19]  Foster J. Provost,et al.  Explaining Data-Driven Document Classifications , 2013, MIS Q..

[20]  Juan D. Velásquez,et al.  Characterization and detection of taxpayers with false invoices using data mining techniques , 2013, Expert Syst. Appl..

[21]  A. Savvopoulos Consumer Credit Models: Pricing, Profit and Portfolios , 2010 .

[22]  Jonathan N. Crook,et al.  Credit Scoring and Its Applications , 2002, SIAM monographs on mathematical modeling and computation.

[23]  Niall M. Adams,et al.  Transaction aggregation as a strategy for credit card fraud detection , 2009, Data Mining and Knowledge Discovery.

[24]  COMMUNICATION FROM THE COMMISSION TO THE EUROPEAN PARLIAMENT AND THE COUNCIL State of progress of the Galileo programme , 2002 .

[25]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[26]  Rüdiger W. Brause,et al.  Neural data mining for credit card fraud detection , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[27]  L. Thomas Consumer credit models: pricing, profit and portfolios , 2009 .

[28]  Yannis Manolopoulos,et al.  Data Mining techniques for the detection of fraudulent financial statements , 2007, Expert Syst. Appl..

[29]  Niall M. Adams,et al.  Off-the-peg and bespoke classifiers for fraud detection , 2008, Comput. Stat. Data Anal..

[30]  Praveen Pathak,et al.  Detecting Management Fraud in Public Companies , 2010, Manag. Sci..

[31]  Tom Fawcett,et al.  Adaptive Fraud Detection , 1997, Data Mining and Knowledge Discovery.

[32]  Kate Smith-Miles,et al.  A Comprehensive Survey of Data Mining-based Fraud Detection Research , 2010, ArXiv.

[33]  Troy Raeder,et al.  Using Co-Visitation Networks For Classifying Non-Intentional Traffic , 2013 .

[34]  Foster Provost,et al.  Suspicion scoring based on guilt-by-association, colle ctive inference, and focused data access 1 , 2005 .

[35]  David J. Hand,et al.  Statistical fraud detection: A review , 2002 .

[36]  David Martens,et al.  DEPARTMENT OF ENGINEERING MANAGEMENT Classification over bipartite graphs through projection , 2015 .

[37]  Joseph P. DeMarco,et al.  The Social Contract or Principles of Political Right , 1975 .

[38]  Foster J. Provost,et al.  Distribution-based aggregation for relational learning with identifier attributes , 2006, Machine Learning.

[39]  Carla E. Brodley,et al.  Class Imbalance, Redux , 2011, 2011 IEEE 11th International Conference on Data Mining.

[40]  Tom Fawcett,et al.  Combining Data Mining and Machine Learning for Effective User Profiling , 1996, KDD.

[41]  Foster J. Provost,et al.  Predictive Modeling With Big Data: Is Bigger Really Better? , 2013, Big Data.

[42]  Ekrem Duman,et al.  Detecting credit card fraud by decision trees and support vector machines , 2011 .

[43]  Foster Provost,et al.  A Simple Relational Classifier , 2003 .

[44]  Dino Pedreschi,et al.  High Quality True-Positive Prediction for Fiscal Fraud Detection , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[45]  Alfons Söllner,et al.  Jean-Jacques Rousseau, Du Contrat Social ou Principes du Droit Politique, Amsterdam 1762 , 2007 .