Taxpayer compliance classification using C4.5, SVM, KNN, Naive Bayes and MLP

Tax revenue has a very important role to fund the State's finances. In order for the optimal tax revenue, the tax authorities must perform tax supervision to the taxpayers optimally. By using the self-assessment taxation system that is taxpayers calculation, pay and report their own tax obligations added with the data of other parties will create a very large data. Therefore, the tax authorities are required to immediately know the taxpayer non-compliance for further audit. This research uses the classification algorithm C4.5, SVM (Support Vector Machine), KNN (K-Nearest Neighbor), Naive Bayes and MLP (Multilayer Perceptron) to classify the level of taxpayer compliance with four goals that are corporate taxpayers comply formally and materially required, corporate taxpayers comply formally required, corporate taxpayers comply materially required and corporate taxpayers not comply formally and materially required. The classification results of each algorithm are compared and the best algorithm chosen based on criteria F-Score, Accuracy and Time taken to build the model by using fuzzy TOPSIS method. The final result shows that C4.5 algorithm is the best algorithm to classify taxpayer compliance level compared to other algorithms.

[1]  Mohsen Naderpour,et al.  Decision making on adoption of cloud computing in e-commerce using fuzzy TOPSIS , 2017, 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[2]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[3]  Rana Alaa El-Deen Ahmeda,et al.  Performance Study of Classification Algorithms for Consumer Online Shopping Attitudes and Behavior Using Data Mining , 2015, 2015 Fifth International Conference on Communication Systems and Network Technologies.

[4]  Gurpreet Singh,et al.  Multi-layer perceptron (MLP) neural network technique for offline handwritten Gurmukhi character recognition , 2014, 2014 IEEE International Conference on Computational Intelligence and Computing Research.

[5]  Riyanarto Sarno,et al.  Personality classification based on Twitter text using Naive Bayes, KNN and SVM , 2015, 2015 International Conference on Data and Software Engineering (ICoDSE).

[6]  Y. Anwar,et al.  Corporate Governance, Earnings Management and Tax Management☆ , 2015 .

[7]  M. Mayilvaganan,et al.  Comparison of classification techniques for predicting the performance of students academic environment , 2014, 2014 International Conference on Communication and Network Technologies.

[8]  She-I Chang,et al.  Using data mining technique to enhance tax evasion detection performance , 2012, Expert Syst. Appl..

[9]  Cheng Peng,et al.  Novel naïve Bayes classification models for predicting the carcinogenicity of chemicals. , 2016, Food and chemical toxicology : an international journal published for the British Industrial Biological Research Association.

[10]  Riyanarto Sarno,et al.  Priority Determination for Higher Education Strategic Planning Using Balanced Scorecard, FAHP and TOPSIS (Case study: XYZ University) , 2016 .

[11]  Mehdi Ghazanfari,et al.  Detecting corporate tax evasion using a hybrid intelligent system: A case study of Iran , 2017, Int. J. Account. Inf. Syst..

[12]  Palli Suryachandra,et al.  Comparison of machine learning algorithms for breast cancer , 2016, 2016 International Conference on Inventive Computation Technologies (ICICT).

[13]  Dariusz Walczak,et al.  Project rankings for participatory budget based on the fuzzy TOPSIS method , 2017, Eur. J. Oper. Res..

[14]  Serkan Yavuz,et al.  Weapon selection using the AHP and TOPSIS methods under fuzzy environment , 2009, Expert Syst. Appl..

[15]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[16]  Eşref Adalı,et al.  Multilayer perceptron neural network technique for fraud detection , 2017, 2017 International Conference on Computer Science and Engineering (UBMK).

[17]  Satyendra Prasad Singh,et al.  Performance Analysis of Classification Tree Learning Algorithms , 2012 .