A Novel Tax Evasion Detection Framework via Fused Transaction Network Representation

Tax evasion usually refers to the false declaration of taxpayers to reduce their tax obligations; this type of behavior leads to the loss of taxes and damage to the fair principle of taxation. Tax evasion detection plays a crucial role in reducing tax revenue loss. Currently, efficient auditing methods mainly include traditional data-mining-oriented methods, which cannot be well adapted to the increasingly complicated transaction relationships between taxpayers. Driven by this requirement, recent studies have been conducted by establishing a transaction network and applying the graphical pattern matching algorithm for tax evasion identification. However, such methods rely on expert experience to extract the tax evasion chart pattern, which is time-consuming and labor-intensive. More importantly, taxpayers' basic attributes are not considered and the dual identity of the taxpayer in the transaction network is not well retained. To address this issue, we have proposed a novel tax evasion detection framework via fused transaction network representation (TED-TNR), to detecting tax evasion based on fused transaction network representation, which jointly embeds transaction network topological information and basic taxpayer attributes into low-dimensional vector space, and considers the dual identity of the taxpayer in the transaction network. Finally, we conducted experimental tests on real-world tax data, revealing the superiority of our method, compared with state-of-the-art models.

[1]  Qinghua Zheng,et al.  IRTED-TL: An Inter-Region Tax Evasion Detection Method Based on Transfer Learning , 2018, 2018 17th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/ 12th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE).

[2]  Zhenisbek Assylbekov,et al.  Detecting Value-Added Tax Evasion by Business Entities of Kazakhstan , 2016, KES-IDT.

[3]  Xiaolong Jin,et al.  Predict Anchor Links across Social Networks via an Embedding Approach , 2016, IJCAI.

[4]  Eduardo Tapia,et al.  An Agent-Based Model of Tax Compliance: an Application to the Spanish Case , 2013, Adv. Complex Syst..

[5]  You-Shyang Chen,et al.  A Delphi-based rough sets fusion model for extracting payment rules of vehicle license tax in the government sector , 2010, Expert Syst. Appl..

[6]  Qinghua Zheng,et al.  Mining Suspicious Tax Evasion Groups in a Corporate Governance Network , 2017, ICA3PP.

[7]  Cheng Li,et al.  DeepCas: An End-to-end Predictor of Information Cascades , 2016, WWW.

[8]  She-I Chang,et al.  Using data mining technique to enhance tax evasion detection performance , 2012, Expert Syst. Appl..

[9]  Eduardo Tapia,et al.  Tax Compliance, Rational Choice, and Social Influence: An Agent-Based Model , 2014 .

[10]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[11]  Wei Lu,et al.  Deep Neural Networks for Learning Graph Representations , 2016, AAAI.

[12]  Erik Hemberg,et al.  Modeling tax evasion with genetic algorithms , 2014 .

[13]  Xiao Huang,et al.  Accelerated Attributed Network Embedding , 2017, SDM.

[14]  Dimitrios Hristu-Varsakelis,et al.  A decision support model for tax revenue collection in Greece , 2012, Decis. Support Syst..

[15]  Zachary Eyler-Walker,et al.  Closing the gap: automated screening of tax returns to identify egregious tax shelters , 2006, SKDD.

[16]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[17]  Jian Pei,et al.  A Survey on Network Embedding , 2017, IEEE Transactions on Knowledge and Data Engineering.

[18]  Una-May O'Reilly,et al.  Detecting tax evasion: a co-evolutionary approach , 2016, Artificial Intelligence and Law.

[19]  Andrés Moreno,et al.  Tax Fraud Detection for Under-Reporting Declarations Using an Unsupervised Machine Learning Approach , 2018, KDD.

[20]  Xiaoqing Liu,et al.  Application of Hierarchical Clustering in Tax Inspection Case-Selecting , 2010, 2010 International Conference on Computational Intelligence and Software Engineering.

[21]  Jian Pei,et al.  Asymmetric Transitivity Preserving Graph Embedding , 2016, KDD.

[22]  Juan D. Velásquez,et al.  Characterization and detection of taxpayers with false invoices using data mining techniques , 2013, Expert Syst. Appl..

[23]  Manish Gupta,et al.  Audit Selection Strategy for Improving Tax Compliance – Application of Data Mining Techniques , 2007 .

[24]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[25]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[26]  Naoki Abe,et al.  Optimizing debt collections using constrained reinforcement learning , 2010, KDD.

[27]  João Balsa,et al.  Agents that collude to evade taxes , 2007, AAMAS '07.

[28]  Heng Huang,et al.  Deep Attributed Network Embedding , 2018, IJCAI.

[29]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[30]  Una-May O'Reilly,et al.  Tax non-compliance detection using co-evolution of tax evasion risk and audit likelihood , 2015, ICAIL.

[31]  Qinghua Zheng,et al.  Mining Suspicious Tax Evasion Groups in Big Data , 2016, IEEE Transactions on Knowledge and Data Engineering.

[32]  Qinghua Zheng,et al.  Identifying suspicious groups of affiliated-transaction-based tax evasion in big data , 2019, Inf. Sci..

[33]  Foster J. Provost,et al.  Corporate residence fraud detection , 2014, KDD.

[34]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[35]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[36]  Graham J. Williams,et al.  Exploratory Multilevel Hot Spot Analysis: Australian Taxation Office Case Study , 2007, AusDM.

[37]  Jian Pei,et al.  Community Preserving Network Embedding , 2017, AAAI.