Fraud Detection in Banking Data by Machine Learning Techniques

As technology advanced and e-commerce services expanded, credit cards became one of the most popular payment methods, resulting in an increase in the volume of banking transactions. Furthermore, the significant increase in fraud requires high banking transaction costs. As a result, detecting fraudulent activities has become a fascinating topic. In this study, we consider the use of class weight-tuning hyperparameters to control the weight of fraudulent and legitimate transactions. We use Bayesian optimization in particular to optimize the hyperparameters while preserving practical issues such as unbalanced data. We propose weight-tuning as a pre-process for unbalanced data, as well as CatBoost and XGBoost to improve the performance of the LightGBM method by accounting for the voting mechanism. Finally, in order to improve performance even further, we use deep learning to fine-tune the hyperparameters, particularly our proposed weight-tuning one. We perform some experiments on real-world data to test the proposed methods. To better cover unbalanced datasets, we use recall-precision metrics in addition to the standard ROC-AUC. CatBoost, LightGBM, and XGBoost are evaluated separately using a 5-fold cross-validation method. Furthermore, the majority voting ensemble learning method is used to assess the performance of the combined algorithms. LightGBM and XGBoost achieve the best level criteria of ROC-AUC = 0.95, precision 0.79, recall 0.80, F1 score 0.79, and MCC 0.79, according to the results. By using deep learning and the Bayesian optimization method to tune the hyperparameters, we also meet the ROC-AUC = 0.94, precision = 0.80, recall = 0.82, F1 score = 0.81, and MCC = 0.81. This is a significant improvement over the cutting-edge methods we compared it to.

[1]  Ebrima N. Ceesay,et al.  Analyzing Credit Card Fraud Detection based on Machine Learning Models , 2022, 2022 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS).

[2]  Mohd. Hassan,et al.  Machine Learning based Credit Card Fraud Detection - A Review , 2022, 2022 International Conference on Applied Artificial Intelligence and Computing (ICAAIC).

[3]  B. Belaton,et al.  Credit Card Fraud Detection Using a New Hybrid Machine Learning Architecture , 2022, Mathematics.

[4]  Poornima Tyagi,et al.  Analysis of Supervised Machine Learning Algorithms in the Context of Fraud Detection , 2022, ECS Transactions.

[5]  S. Sarathambekai,et al.  Evaluation of Naïve Bayes and Voting Classifier Algorithm for Credit Card Fraud Detection , 2022, 2022 8th International Conference on Advanced Computing and Communication Systems (ICACCS).

[6]  Sivaraman Jayaraman,et al.  Analysis and classification of heart rate using CatBoost feature ranking model , 2021, Biomed. Signal Process. Control..

[7]  Babak Amiri,et al.  An Ensemble-Based Credit Card Fraud Detection Algorithm Using an Efficient Voting Strategy , 2021, Comput. J..

[8]  Sami Ben Jabeur,et al.  CatBoost model and artificial intelligence techniques for corporate failure prediction , 2021 .

[9]  Chungang Yan,et al.  Learning Transaction Cohesiveness for Online Payment Fraud Detection , 2021, CONF-CDS.

[10]  Xiong Kewei,et al.  A Hybrid Deep Learning Model For Online Fraud Detection , 2021, 2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE).

[11]  Yeming Chen,et al.  CatBoost for Fraud Detection in Financial Transactions , 2021, 2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE).

[12]  Haonan Feng Ensemble Learning in Credit Card Fraud Detection Using Boosting Methods , 2021, 2021 2nd International Conference on Computing and Data Science (CDS).

[13]  Nastaran Hajiheydari,et al.  Elucidation of big data analytics in banking: a four-stage Delphi study , 2020, J. Enterp. Inf. Manag..

[14]  Assef Jafar,et al.  Example-dependent cost-sensitive credit cards fraud detection using SMOTE and Bayes minimum risk , 2020, SN Applied Sciences.

[15]  Taghi M. Khoshgoftaar,et al.  Medicare Fraud Detection using CatBoost , 2020, 2020 IEEE 21st International Conference on Information Reuse and Integration for Data Science (IRI).

[16]  Olumide Sunday Adewale,et al.  A framework for detecting credit card fraud with cost-sensitive meta-learning ensemble approach , 2020 .

[17]  Fairoz Nower Khan,et al.  Credit Card Fraud Prediction and Classification using Deep Neural Network and Ensemble Learning , 2020, 2020 IEEE Region 10 Symposium (TENSYMP).

[18]  A. Helen Victoria,et al.  Automatic tuning of hyperparameters using Bayesian optimization , 2020, Evolving Systems.

[19]  Wu Hao,et al.  Predicting Hard Rock Pillar Stability Using GBDT, XGBoost, and LightGBM Algorithms , 2020, Mathematics.

[20]  Anil Goyal,et al.  Diversity-Aware Weighted Majority Vote Classifier for Imbalanced Data , 2020, 2020 International Joint Conference on Neural Networks (IJCNN).

[21]  Yongjae Lee,et al.  Basic Enhancement Strategies When Using Bayesian Optimization for Hyperparameter Tuning of Deep Neural Networks , 2020, IEEE Access.

[22]  Satwinder Singh,et al.  Comparison and analysis of logistic regression, Naïve Bayes and KNN machine learning algorithms for credit card fraud detection , 2020, International Journal of Information Technology.

[23]  Jay Nanduri,et al.  Ecommerce Fraud Detection Through Fraud Islands and Multi-layer Machine Learning Model , 2020 .

[24]  Altyeb Altaher Taha,et al.  An Intelligent Approach to Credit Card Fraud Detection Using an Optimized Light Gradient Boosting Machine , 2020, IEEE Access.

[25]  Ping Jiang,et al.  Credit Card Fraud Detection Using Autoencoder Neural Network , 2019, ArXiv.

[26]  Ljiljana Brkic,et al.  Detecting Credit Card Fraud Using Selected Machine Learning Algorithms , 2019, 2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[27]  Utkarsh Porwal,et al.  Credit Card Fraud Detection in E-Commerce , 2018, 2019 18th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/13th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE).

[28]  Ping Zhu,et al.  An Ensemble Learning Framework for Credit Card Fraud Detection Based on Training Set Partitioning and Clustering , 2018, 2018 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI).

[29]  Peter Beling,et al.  Deep learning detecting fraud in credit card transactions , 2018, 2018 Systems and Information Engineering Design Symposium (SIEDS).

[30]  Djamila Aouada,et al.  Feature engineering strategies for credit card fraud detection , 2016, Expert Syst. Appl..

[31]  Mohammad Kazem Akbari,et al.  A novel model for credit card fraud detection using Artificial Immune Systems , 2014, Appl. Soft Comput..

[32]  Arslan Munir,et al.  A Sequence Mining-Based Novel Architecture for Detecting Fraudulent Transactions in Healthcare Systems , 2022, IEEE Access.

[33]  M. Markey,et al.  Healthcare Fraud Data Mining Methods: A Look Back and Look Ahead. , 2022, Perspectives in health information management.

[34]  Chee Peng Lim,et al.  Credit Card Fraud Detection Using AdaBoost and Majority Voting , 2019, IEEE Access.