Random forest for credit card fraud detection

Credit card fraud events take place frequently and then result in huge financial losses. Criminals can use some technologies such as Trojan or Phishing to steal the information of other people's credit cards. Therefore, an effictive fraud detection method is important since it can identify a fraud in time when a criminal uses a stolen card to consume. One method is to make full use of the historical transaction data including normal transactions and fraud ones to obtain normal/fraud behavior features based on machine learning techniques, and then utilize these features to check if a transaction is fraud or not. In this paper, two kinds of random forests are used to train the behavior features of normal and abnormal transactions. We make a comparison of the two random forests which are different in their base classifiers, and analyze their performance on credit fraud detection. The data used in our experiments come from an e-commerce company in China.

[1]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[2]  Tung-Shou Chen,et al.  A new binary support vector system for increasing detection rate of credit card fraud , 2006, Int. J. Pattern Recognit. Artif. Intell..

[3]  Mahmoud Reza Hashemi,et al.  Mining information from credit card time series for timelier fraud detection , 2010, 2010 5th International Symposium on Telecommunications.

[4]  Orlando Belo,et al.  Usage signatures analysis an alternative method for preventing fraud in E-Commerce applications , 2014, 2014 International Conference on Data Science and Advanced Analytics (DSAA).

[5]  Yvan Saeys,et al.  Java-ML: A Machine Learning Library , 2009, J. Mach. Learn. Res..

[6]  Ekrem Duman,et al.  Detecting credit card fraud by genetic algorithm and scatter search , 2011, Expert Syst. Appl..

[7]  Ekrem Duman,et al.  Detecting credit card fraud by decision trees and support vector machines , 2011 .

[8]  D. Hand,et al.  Unsupervised Profiling Methods for Fraud Detection , 2002 .

[9]  Monique Snoeck,et al.  APATE: A novel approach for automated credit card transaction fraud detection using network-based extensions , 2015, Decis. Support Syst..

[10]  Robert C. Holte,et al.  C4.5, Class Imbalance, and Cost Sensitivity: Why Under-Sampling beats Over-Sampling , 2003 .

[11]  Salvatore J. Stolfo,et al.  Distributed data mining in credit card fraud detection , 1999, IEEE Intell. Syst..

[12]  Shamik Sural,et al.  BLAST-SSAHA Hybridization for Credit Card Fraud Detection , 2009, IEEE Transactions on Dependable and Secure Computing.

[13]  Weili. Ong,et al.  Real time credit card fraud detection using computational intelligence , 2011 .

[14]  Yehuda Vardi,et al.  A Hybrid High-Order Markov Chain Model for Computer Intrusion Detection , 2001 .

[15]  Luigi Barone,et al.  Nature-Inspired Techniques in the Context of Fraud Detection , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[16]  Arti Mohanpurkar,et al.  Credit card fraud detection using Hidden Markov Model , 2011, 2011 World Congress on Information and Communication Technologies.

[17]  Rahul Johari,et al.  A New Framework for Credit Card Transactions Involving Mutual Authentication between Cardholder and Merchant , 2011, 2011 International Conference on Communication Systems and Network Technologies.

[18]  Markus Jakobsson,et al.  Implicit Authentication through Learning User Behavior , 2010, ISC.

[19]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[20]  David J. Hand,et al.  Statistical fraud detection: A review , 2002 .