Cost-sensitive Risk Induced Bayesian Inference Bagging (RIBIB) for credit card fraud detection

Abstract Credit card fraud represents one of the biggest threats for organizations due to the probability of huge losses associated with them. This paper presents a cost-sensitive Risk Induced Bayesian Inference Bagging model, RIBIB, for credit card fraud detection. RIBIB proposes a novel bagging architecture incorporating a constrained bag creation method, a Risk Induced Bayesian Inference method as a base learner and a cost-sensitive weighted voting combiner. Experiments on Brazilian Bank data indicate 1.04–1.5 times reduced cost. Experiments on UCSD-FICO data exhibit robustness of the model in handling unseen data without any need for domain specific parameter fine-tuning.

[1]  Sharon Bertsch McGrayne,et al.  The Theory That Would Not Die: How Bayes' Rule Cracked the Enigma Code, Hunted Down Russian Submarines, and Emerged Triumphant from Two Centuries of Controversy , 2011 .

[2]  Francisco Herrera,et al.  An insight into imbalanced Big Data classification: outcomes and challenges , 2017 .

[3]  Yijing Li,et al.  Learning from class-imbalanced data: Review of methods and applications , 2017, Expert Syst. Appl..

[4]  Cesare Alippi,et al.  Credit card fraud detection and concept-drift adaptation with delayed supervised information , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[5]  Douglas L. Reilly,et al.  Credit card fraud detection with a neural-network , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.

[6]  Bernd Freisleben,et al.  CARDWATCH: a neural network based database mining system for credit card fraud detection , 1997, Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering (CIFEr).

[7]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[8]  D. Hand,et al.  Unsupervised Profiling Methods for Fraud Detection , 2002 .

[9]  U. Srinivasulu Reddy,et al.  Modelling a stable classifier for handling large scale data with noise and imbalance , 2017, 2017 International Conference on Computational Intelligence in Data Science(ICCIDS).

[10]  Xuelong Li,et al.  Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Durga Toshniwal,et al.  Online sparse class imbalance learning on big data , 2016, Neurocomputing.

[12]  Gianluca Bontempi,et al.  Learned lessons in credit card fraud detection from a practitioner perspective , 2014, Expert Syst. Appl..

[13]  Björn E. Ottersten,et al.  Improving Credit Card Fraud Detection with Calibrated Probabilities , 2014, SDM.

[14]  Eric Bauer,et al.  An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants , 1999, Machine Learning.

[15]  Stan Matwin,et al.  Learning When Negative Examples Abound , 1997, ECML.

[16]  Xin Yao,et al.  Diversity analysis on imbalanced data sets by using ensemble models , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[17]  Pourya Shamsolmoali,et al.  Application of Credit Card Fraud Detection: Based on Bagging Ensemble Classifier , 2015 .

[18]  João P. P. Gomes,et al.  Ensemble of Efficient Minimal Learning Machines for Classification and Regression , 2017, Neural Processing Letters.

[19]  Björn E. Ottersten,et al.  Example-dependent cost-sensitive decision trees , 2015, Expert Syst. Appl..

[20]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[21]  H. Kashima,et al.  Roughly balanced bagging for imbalanced data , 2009 .

[22]  Salvatore J. Stolfo,et al.  Toward Scalable Learning with Non-Uniform Class and Cost Distributions: A Case Study in Credit Card Fraud Detection , 1998, KDD.

[23]  Rüdiger W. Brause,et al.  Neural data mining for credit card fraud detection , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[24]  Mohsen Rohani,et al.  Cost sensitive modeling of credit card fraud using neural network strategy , 2016, 2016 2nd International Conference of Signal Processing and Intelligent Systems (ICSPIS).

[25]  José R. Dorronsoro,et al.  Neural fraud detection in credit card operations , 1997, IEEE Trans. Neural Networks.

[26]  Joaquin Vanschoren,et al.  Case study on bagging stable classifiers for data streams , 2015 .

[27]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[28]  Giorgio Maria Di Nunzio A new decision to take for cost-sensitive Naïve Bayes classifiers , 2014, Inf. Process. Manag..

[29]  Björn E. Ottersten,et al.  Example-Dependent Cost-Sensitive Logistic Regression for Credit Scoring , 2014, 2014 13th International Conference on Machine Learning and Applications.

[30]  Maumita Bhattacharya,et al.  An investigation on experimental issues in financial fraud mining , 2016, 2016 IEEE 11th Conference on Industrial Electronics and Applications (ICIEA).

[31]  Charles Elkan,et al.  The Foundations of Cost-Sensitive Learning , 2001, IJCAI.

[32]  Mohammad Kazem Akbari,et al.  A novel model for credit card fraud detection using Artificial Immune Systems , 2014, Appl. Soft Comput..

[33]  Djamila Aouada,et al.  Feature engineering strategies for credit card fraud detection , 2016, Expert Syst. Appl..

[34]  Salvatore J. Stolfo,et al.  Distributed data mining in credit card fraud detection , 1999, IEEE Intell. Syst..

[35]  Björn E. Ottersten,et al.  Cost Sensitive Credit Card Fraud Detection Using Bayes Minimum Risk , 2013, 2013 12th International Conference on Machine Learning and Applications.

[36]  Alair Pereira do Lago,et al.  Credit Card Fraud Detection with Artificial Immune System , 2008, ICARIS.

[37]  Doaa Hassan,et al.  The Impact of False Negative Cost on the Performance of Cost Sensitive Learning Based on Bayes Minimum Risk: A Case Study in Detecting Fraudulent Transactions , 2017 .

[38]  Masoumeh Zareapoor,et al.  FraudMiner: A Novel Credit Card Fraud Detection Model Based on Frequent Itemset Mining , 2014, TheScientificWorldJournal.