Enhanced credit card fraud detection based on SVM-recursive feature elimination and hyper-parameters optimization

Abstract With the growth of online shopping, Credit Card Fraud (CCF) comes out as a serious menace. For this end, the automatic and real-time fraud detection field calls for several studies. The recent ones use many Machine Learning (ML) methods and techniques due to their beneficial characteristics to build a good fitting model to catch fraudulent transactions. The purpose of this paper is to develop a new model based on a hybrid approach for Credit Card Fraud Detection (CCFD). The proposed model, compared to previous studies, shows its strong ability to identify fraud transactions. Precisely, the robustness of our model is built by combining the strength of three sub-methods; the Recursive Feature Elimination (RFE) for selecting the most useful predictive features, the GridSearchCV for Hyper-Parameters Optimization (HPO) and the Synthetic Minority Oversampling (SMOTE) to overcome the imbalanced data problem. The experimentations of our model, on many real-world data sets, gives the best results in terms of efficiency and effectiveness.

[1]  Taghi M. Khoshgoftaar,et al.  Survey on deep learning with class imbalance , 2019, J. Big Data.

[2]  Francisca Nonyelum Ogwueleka DATA MINING APPLICATION IN CREDIT CARD FRAUD DETECTION SYSTEM , 2011 .

[3]  Mohamad Zamini,et al.  Credit Card Fraud Detection using autoencoder based clustering , 2018, 2018 9th International Symposium on Telecommunications (IST).

[4]  Fernando Jiménez,et al.  Multi-objective evolutionary feature selection for online sales forecasting , 2017, Neurocomputing.

[5]  Huan Liu,et al.  Feature Selection for Clustering: A Review , 2018, Data Clustering: Algorithms and Applications.

[6]  Chee Peng Lim,et al.  Credit Card Fraud Detection Using AdaBoost and Majority Voting , 2019, IEEE Access.

[7]  K. Ramakalyani,et al.  Fraud Detection of Credit Card Payment System by Genetic Algorithm , 2012 .

[8]  David J. Hand,et al.  Statistical fraud detection: A review , 2002 .

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  Niall M. Adams,et al.  Plastic card fraud detection using peer group analysis , 2008, Adv. Data Anal. Classif..

[11]  Phimphaka Taninpong,et al.  Anomaly Detection based on GS-OCSVM Classification , 2020, 2020 12th International Conference on Knowledge and Smart Technology (KST).

[12]  Bogdan Zagajewski,et al.  Comparison of support vector machine, random forest and neural network classifiers for tree species classification on airborne hyperspectral APEX images , 2017 .

[13]  Naoufal Rtayli,et al.  Selection Features and Support Vector Machine for Credit Card Risk Identification , 2020 .

[14]  M Syeda,et al.  Parallel granular neural networks for fast credit card fraud detection , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[15]  Robert P. W. Duin,et al.  Uniform Object Generation for Optimizing One-class Classifiers , 2002, J. Mach. Learn. Res..

[16]  David Zhang,et al.  Feature selection and analysis on correlated gas sensor data with recursive feature elimination , 2015 .

[17]  Yue Zhou,et al.  A hybrid semi-supervised approach for financial fraud detection , 2017, 2017 International Conference on Machine Learning and Cybernetics (ICMLC).

[18]  Rob J. Hyndman,et al.  A note on the validity of cross-validation for evaluating autoregressive time series prediction , 2018, Comput. Stat. Data Anal..

[19]  Naomi S. Altman,et al.  Points of Significance: Classification evaluation , 2016, Nature Methods.

[20]  Björn E. Ottersten,et al.  Cost Sensitive Credit Card Fraud Detection Using Bayes Minimum Risk , 2013, 2013 12th International Conference on Machine Learning and Applications.

[21]  Frédéric Oblé,et al.  Combining unsupervised and supervised learning in credit card fraud detection , 2019, Inf. Sci..

[22]  Roberto Saia,et al.  A Linear-dependence-based Approach to Design Proactive Credit Scoring Models , 2016, KDIR.

[23]  Gianluca Bontempi,et al.  Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization , 2018, International Journal of Data Science and Analytics.

[24]  Mehdi Shajari,et al.  Cost-sensitive payment card fraud detection based on dynamic random forest and k-nearest neighbors , 2018, Expert Syst. Appl..

[25]  Anthony Brabazon,et al.  Identifying online credit card fraud using Artificial Immune Systems , 2010, IEEE Congress on Evolutionary Computation.

[26]  Helmut Krcmar,et al.  Reducing false positives in fraud detection: Combining the red flag approach with process mining , 2018, Int. J. Account. Inf. Syst..

[27]  Jun Zhang,et al.  Comprehensive analysis of network traffic data , 2018, Concurr. Comput. Pract. Exp..

[28]  Mohammad Kazem Akbari,et al.  Credit cards fraud detection by negative selection algorithm on hadoop (To reduce the training time) , 2013, The 5th Conference on Information and Knowledge Technology.

[29]  Roberto Saia A Discrete Wavelet Transform Approach to Fraud Detection , 2017, NSS.

[30]  Maumita Bhattacharya,et al.  Intelligent Financial Fraud Detection: A Comprehensive Review , 2015 .

[31]  Vadlamani Ravi,et al.  Credit Card Fraud Detection using Big Data Analytics: Use of PSOAANN based One-Class Classification , 2016, ICIA.

[32]  Varun Goel,et al.  DataCan: Robust Approach for Genome Cancer Data Analysis , 2019, Data Management, Analytics and Innovation.

[33]  Jayesh Chaudhary,et al.  A Survey on Credit Card Fraud Detection Using Machine Learning , 2018, 2018 2nd International Conference on Trends in Electronics and Informatics (ICOEI).

[34]  Syed Zainudeen Mohd Said,et al.  A Subset Feature Elimination Mechanism for Intrusion Detection System , 2016 .

[35]  Shweta Taneja,et al.  Application of Balancing Techniques with Ensemble Approach for Credit Card Fraud Detection , 2019, 2019 International Conference on Computing, Power and Communication Technologies (GUCON).

[36]  Peter Beling,et al.  Horse race analysis in credit card fraud—deep learning, logistic regression, and Gradient Boosted Tree , 2017, 2017 Systems and Information Engineering Design Symposium (SIEDS).

[37]  Longbing Cao,et al.  Effective detection of sophisticated online banking fraud on extremely imbalanced data , 2012, World Wide Web.

[38]  ZainalAnazida,et al.  Fraud detection system , 2016 .

[39]  Jie Yang,et al.  Kernelized support vector machine with deep learning: An efficient approach for extreme multiclass dataset , 2017, Pattern Recognit. Lett..

[40]  Bartosz Krawczyk,et al.  Learning from imbalanced data: open challenges and future directions , 2016, Progress in Artificial Intelligence.

[41]  Anurag Jain,et al.  Adaptive Credit Card Fraud Detection Techniques Based on Feature Selection Method , 2019 .

[42]  Peter Beling,et al.  Deep learning detecting fraud in credit card transactions , 2018, 2018 Systems and Information Engineering Design Symposium (SIEDS).

[43]  Sara Makki,et al.  An Experimental Study With Imbalanced Classification Approaches for Credit Card Fraud Detection , 2019, IEEE Access.

[44]  William N. Robinson,et al.  Sequential fraud detection for prepaid cards using hidden Markov model divergence , 2018, Expert Syst. Appl..

[45]  Priyanka Kumari,et al.  Analysis of Credit Card Fraud Detection Using Fusion Classifiers , 2018, Advances in Intelligent Systems and Computing.

[46]  P. Bentley,et al.  Fuzzy Darwinian Detection of Credit Card Fraud , 2000 .

[47]  Zeeshan Ali Rana,et al.  Evaluating Performance of Software Defect Prediction Models Using Area Under Precision-Recall Curve (AUC-PR) , 2019, 2019 2nd International Conference on Advancements in Computational Sciences (ICACS).

[48]  J. L. Peterson,et al.  Deep Neural Network Initialization With Decision Trees , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[49]  Bernhard Y. Renard,et al.  Recursive feature elimination in random forest classification supports nanomaterial grouping , 2019, NanoImpact.

[50]  Abhinav Srivastava,et al.  Credit Card Fraud Detection Using Hidden Markov Model , 2008, IEEE Transactions on Dependable and Secure Computing.

[51]  Diego Reforgiato Recupero,et al.  Fraud detection for E-commerce transactions by employing a prudential Multiple Consensus model , 2019, J. Inf. Secur. Appl..

[52]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[53]  Pourya Shamsolmoali,et al.  Application of Credit Card Fraud Detection: Based on Bagging Ensemble Classifier , 2015 .

[54]  Djamila Aouada,et al.  Feature engineering strategies for credit card fraud detection , 2016, Expert Syst. Appl..

[55]  Michael Granitzer,et al.  Sequence classification for credit-card fraud detection , 2018, Expert Syst. Appl..

[56]  Naoufal Rtayli,et al.  Credit Card Risk Detection based on Feature-Filter and Fraud Identification , 2019, 2019 Third International Conference on Intelligent Computing in Data Sciences (ICDS).

[57]  Chungang Yan,et al.  Transaction Fraud Detection Based on Total Order Relation and Behavior Diversity , 2018, IEEE Transactions on Computational Social Systems.

[58]  S. Benson Edwin Raj,et al.  Analysis on credit card fraud detection methods , 2011 .

[59]  Pourya Shamsolmoali,et al.  Boosting prediction performance on imbalanced dataset , 2018, Int. J. Inf. Commun. Technol..

[60]  Rameshwar Pratap,et al.  Ensemble learning for credit card fraud detection , 2018, COMAD/CODS.

[61]  Damminda Alahakoon,et al.  Minority report in fraud detection: classification of skewed data , 2004, SKDD.

[62]  Alfredo De Santis,et al.  Using generative adversarial networks for improving classification effectiveness in credit card fraud detection , 2017, Inf. Sci..

[63]  Ekrem Duman,et al.  Detecting credit card fraud by Modified Fisher Discriminant Analysis , 2015, Expert Syst. Appl..

[64]  Roberto Saia,et al.  Evaluating Credit Card Transactions in the Frequency Domain for a Proactive Fraud Detection Approach , 2017, SECRYPT.

[65]  Samrat Kumar Dey,et al.  Flow Based Anomaly Detection in Software Defined Networking: A Deep Learning Approach With Feature Selection Method , 2018, 2018 4th International Conference on Electrical Engineering and Information & Communication Technology (iCEEiCT).

[66]  Pradeep Ray,et al.  Artificial immune systems for the detection of credit card fraud: an architecture, prototype and preliminary results , 2012, Inf. Syst. J..

[67]  Changjun Jiang,et al.  Random forest for credit card fraud detection , 2018, 2018 IEEE 15th International Conference on Networking, Sensing and Control (ICNSC).

[68]  WestJarrod,et al.  Intelligent financial fraud detection , 2016 .

[69]  Jie Yang,et al.  A Novel Strategy for Mining Highly Imbalanced Data in Credit Card Transactions , 2017 .

[70]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[71]  Anazida Zainal,et al.  Fraud detection system: A survey , 2016, J. Netw. Comput. Appl..

[72]  R. C. Suganthe,et al.  Recursive Feature Elimination and Clustering Technique for Network Anomaly Detection , 2018 .

[73]  Matthijs Douze,et al.  Deep Clustering for Unsupervised Learning of Visual Features , 2018, ECCV.

[74]  Prateek Kulkarni,et al.  Credit Card Fraud Prediction And Detection using Artificial Neural Network And Self-Organizing Maps , 2019, 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC).

[75]  Luis Vergara,et al.  New Perspectives of Pattern Recognition for Automatic Credit Card Fraud Detection , 2018 .

[76]  M. Ufuk Çaglayan,et al.  Hybrid approaches for detecting credit card fraud , 2017, Expert Syst. J. Knowl. Eng..

[77]  Adriano M. Pereira,et al.  A customized classification algorithm for credit card fraud detection , 2018, Eng. Appl. Artif. Intell..

[78]  Roger Xu,et al.  Model Selection for Anomaly Detection in Wireless Ad Hoc Networks , 2007, 2007 IEEE Symposium on Computational Intelligence and Data Mining.

[79]  Ekrem Duman,et al.  Detecting credit card fraud by genetic algorithm and scatter search , 2011, Expert Syst. Appl..

[80]  Cesare Alippi,et al.  Credit card fraud detection and concept-drift adaptation with delayed supervised information , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[81]  Addisson Salazar,et al.  Semi-Supervised Learning For Imbalanced Classification Of Credit Card Transaction , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[82]  Debashree Devi,et al.  A Cost-sensitive weighted Random Forest Technique for Credit Card Fraud Detection , 2019, 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT).

[83]  Ayahiko Niimi,et al.  Detection of fraud use of credit card by extended VFDT , 2011, 2011 World Congress on Internet Security (WorldCIS-2011).

[84]  R.K. Jeyauthmi Rha,et al.  Recursive Feature Elimination and Clustering Technique for Network Anomaly Detection , 2018, 2018 International Conference on Current Trends towards Converging Technologies (ICCTCT).

[85]  Addisson Salazar,et al.  Signal processing on graphs for improving automatic credit card fraud detection , 2017, 2017 International Carnahan Conference on Security Technology (ICCST).