A Comparative Study of Machine Learning Techniques for Credit Card Fraud Detection Based on Time Variance

This paper proposes a comparative performance of ten different machine learning algorithms, done on a credit card fraud detection application. The machine learning methods have been classified into two groups namely classification algorithms and ensemble learning group. Each group is comprised of five different algorithms. Besides, the 'Time' feature is introduced in the data set and performances of the algorithms are studied with and without the 'Time' feature. Two algorithms of the ensemble learning group have been found to perform better when the used dataset does not include the 'Time' feature. However, for the classification algorithms group, three classifiers are found to show better predictive accuracies when all attributes are included in the used dataset. The rest of the machine learning models have approximate similar scores between these datasets.

[1]  Arti Mohanpurkar,et al.  Credit card fraud detection using Hidden Markov Model , 2011, 2011 World Congress on Information and Communication Technologies.

[2]  Marijana Zekić-Sušac,et al.  Predicting company growth using logistic regression and neural networks , 2016 .

[3]  S. Siva Prakash,et al.  Credit Card Fraud Detection using Adaboost and Majority Voting , 2019 .

[4]  Jorge Cadima,et al.  Principal component analysis: a review and recent developments , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[5]  Weili. Ong,et al.  Real time credit card fraud detection using computational intelligence , 2011 .

[6]  Reid A. Johnson,et al.  Calibrating Probability with Undersampling for Unbalanced Classification , 2015, 2015 IEEE Symposium Series on Computational Intelligence.

[7]  Christopher Krügel,et al.  Understanding fraudulent activities in online ad exchanges , 2011, IMC '11.

[8]  D. Cox The Regression Analysis of Binary Sequences , 2017 .

[9]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[10]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[11]  Sauchi Stephen Lee Noisy replication in skewed binary classification , 2000 .

[12]  Gianluca Bontempi,et al.  Learned lessons in credit card fraud detection from a practitioner perspective , 2014, Expert Syst. Appl..

[13]  Samuel A. Oluwadare,et al.  Credit card fraud detection using machine learning techniques: A comparative analysis , 2017, 2017 International Conference on Computing Networking and Informatics (ICCNI).

[14]  Ashish Rastogi,et al.  A Machine Learning Approach for Detection of Fraud based on SVM , 2013 .

[15]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[16]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[17]  Véronique Van Vlasselaer,et al.  Fraud Analytics : Using Descriptive, Predictive, and Social Network Techniques:A Guide to Data Science for Fraud Detection , 2015 .

[18]  R. Polikar,et al.  Bootstrap - Inspired Techniques in Computation Intelligence , 2007, IEEE Signal Processing Magazine.

[19]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[20]  N. Altman An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[21]  Francisca Nonyelum Ogwueleka DATA MINING APPLICATION IN CREDIT CARD FRAUD DETECTION SYSTEM , 2011 .