A data mining based system for credit-card fraud detection in e-tail

Credit-card fraud leads to billions of dollars in losses for online merchants. With the development of machine learning algorithms, researchers have been finding increasingly sophisticated ways to detect fraud, but practical implementations are rarely reported. We describe the development and deployment of a fraud detection system in a large e-tail merchant. The paper explores the combination of manual and automatic classification, gives insights into the complete development process and compares different machine learning methods. The paper can thus help researchers and practitioners to design and implement data mining based systems for fraud detection or similar problems. This project has contributed not only with an automatic system, but also with insights to the fraud analysts for improving their manual revision process, which resulted in an overall superior performance. A case study of credit-card fraud detection in an e-tail company is presented.The design and implementation of a fraud detection system is reported.A practical perspective on the complete development process is given.The combination of an automatic classifier with manual revision is explored.Different supervised learning methods are compared.

[1]  Corinna Cortes,et al.  Communities of interest , 2001, Intell. Data Anal..

[2]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[3]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[4]  Douglas L. Reilly,et al.  Credit card fraud detection with a neural-network , 1994, 1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences.

[5]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[6]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[7]  K. Ramakalyani,et al.  Fraud Detection of Credit Card Payment System by Genetic Algorithm , 2012 .

[8]  John Elder,et al.  Handbook of Statistical Analysis and Data Mining Applications , 2009 .

[9]  M. Krivko,et al.  A hybrid model for plastic card fraud detection systems , 2010, Expert Syst. Appl..

[10]  Tej Paul Bhatla,et al.  Understanding Credit Card Frauds , 2003 .

[11]  Salvatore J. Stolfo,et al.  Toward Scalable Learning with Non-Uniform Class and Cost Distributions: A Case Study in Credit Card Fraud Detection , 1998, KDD.

[12]  Anthony Brabazon,et al.  Identifying online credit card fraud using Artificial Immune Systems , 2010, IEEE Congress on Evolutionary Computation.

[13]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[14]  Joos Vandewalle,et al.  A hybrid system for fraud detection in mobile communications , 1999, ESANN.

[15]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[16]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[17]  Jon T. S. Quah,et al.  Real-time credit card fraud detection using computational intelligence , 2008, Expert Syst. Appl..

[18]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[19]  Niall M. Adams,et al.  Transaction aggregation as a strategy for credit card fraud detection , 2009, Data Mining and Knowledge Discovery.

[20]  Shiguo Wang,et al.  A Comprehensive Survey of Data Mining-Based Accounting-Fraud Detection Research , 2010, 2010 International Conference on Intelligent Computation Technology and Automation.

[21]  Kurt Hornik,et al.  The support vector machine under test , 2003, Neurocomputing.

[22]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[23]  Ekrem Duman,et al.  Detecting credit card fraud by genetic algorithm and scatter search , 2011, Expert Syst. Appl..

[24]  Yong Hu,et al.  The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature , 2011, Decis. Support Syst..

[25]  Gianluca Bontempi,et al.  Learned lessons in credit card fraud detection from a practitioner perspective , 2014, Expert Syst. Appl..

[26]  David A. Montague Essentials of Online payment Security and Fraud Prevention , 2010 .

[27]  Tom Fawcett,et al.  Adaptive Fraud Detection , 1997, Data Mining and Knowledge Discovery.

[28]  Abhinav Srivastava,et al.  Credit Card Fraud Detection Using Hidden Markov Model , 2008, IEEE Transactions on Dependable and Secure Computing.

[29]  J. Christopher Westland,et al.  Employing transaction aggregation strategy to detect credit card fraud , 2012, Expert Syst. Appl..