Credit card fraud detection through parenclitic network analysis

The detection of frauds in credit card transactions is a major topic in financial research, of profound economic implications. While this has hitherto been tackled through data analysis techniques, the resemblances between this and other problems, like the design of recommendation systems and of diagnostic/prognostic medical tools, suggest that a complex network approach may yield important benefits. In this paper we present a first hybrid data mining/complex network classification algorithm, able to detect illegal instances in a real card transaction data set. It is based on a recently proposed network reconstruction algorithm that allows creating representations of the deviation of one instance from a reference group. We show how the inclusion of features extracted from the network data representation improves the score obtained by a standard, neural network-based classification algorithm and additionally how this combined approach can outperform a commercial fraud detection system in specific operation niches. Beyond these specific results, this contribution represents a new example on how complex networks and data mining can be integrated as complementary tools, with the former providing a view to data beyond the capabilities of the latter.

[1]  Tung-Shou Chen,et al.  A new binary support vector system for increasing detection rate of credit card fraud , 2006, Int. J. Pattern Recognit. Artif. Intell..

[2]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[3]  Rudolf Kruse,et al.  Fuzzy Systems , 2010, Encyclopedia of Machine Learning.

[4]  Reza Ebrahimi Atani,et al.  A Survey of Credit Card Fraud Detection Techniques: Data and Technique Oriented Perspective , 2016, ArXiv.

[5]  Bernd Freisleben,et al.  CARDWATCH: a neural network based database mining system for credit card fraud detection , 1997, Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering (CIFEr).

[6]  P. Bentley,et al.  Fuzzy Darwinian Detection of Credit Card Fraud , 2000 .

[7]  Tej Paul Bhatla,et al.  Understanding Credit Card Frauds , 2003 .

[8]  Rüdiger W. Brause,et al.  Neural data mining for credit card fraud detection , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[9]  Shilpa Chakravartula,et al.  Complex Networks: Structure and Dynamics , 2014 .

[10]  Vladimir Zaslavsky,et al.  Credit Card Fraud Detection Using Self-Organizing Maps , 2006 .

[11]  Massimiliano Zanin,et al.  Optimizing Functional Network Representation of Multivariate Time Series , 2012, Scientific Reports.

[12]  Ekrem Duman,et al.  Detecting credit card fraud by decision trees and support vector machines , 2011 .

[13]  K. Ramakalyani,et al.  Fraud Detection of Credit Card Payment System by Genetic Algorithm , 2012 .

[14]  Manoj Kumar,et al.  Genetic Algorithm: Review and Application , 2010 .

[15]  Carlos Serrano-Cinca,et al.  Self organizing neural networks for financial diagnosis , 1996, Decision Support Systems.

[16]  Saeed Badshah,et al.  Vibration Analysis of an Ocean Current Turbine Blade , 2012 .

[17]  Ekrem Duman,et al.  Detecting credit card fraud by genetic algorithm and scatter search , 2011, Expert Syst. Appl..

[18]  Douglas M. Hawkins,et al.  The Problem of Overfitting , 2004, J. Chem. Inf. Model..

[19]  Jacek M. Zurada,et al.  Introduction to artificial neural systems , 1992 .

[20]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[21]  Ernestina Menasalvas Ruiz,et al.  Information content: Assessing meso-scale structures in complex networks , 2014, ArXiv.

[22]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[23]  Maumita Bhattacharya,et al.  Intelligent Financial Fraud Detection: A Comprehensive Review , 2015 .

[24]  Emanuel Mineda Carneiro,et al.  Cluster Analysis and Artificial Neural Networks: A Case Study in Credit Card Fraud Detection , 2015, 2015 12th International Conference on Information Technology - New Generations.

[25]  Teuvo Kohonen,et al.  The self-organizing map , 1990, Neurocomputing.

[26]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[27]  C. Wilson,et al.  Terrorist Capabilities for Cyberattack: Overview and Policy Issues , 2005 .

[28]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[29]  R. Patidar,et al.  Credit Card Fraud Detection Using Neural Network , 2011 .

[30]  Massimiliano Zanin,et al.  The topology of card transaction money flows , 2016, 1605.04938.

[31]  Paula Fritzsche Tools in Artificial Intelligence , 2008 .

[32]  Qibei Lu,et al.  Research on Credit Card Fraud Detection Model Based on Class Weighted Support Vector Machine , 2011 .

[33]  Ernestina Menasalvas Ruiz,et al.  Combining complex networks and data mining: why and how , 2016, bioRxiv.

[34]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[35]  M Syeda,et al.  Parallel granular neural networks for fast credit card fraud detection , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[36]  V Latora,et al.  Efficient behavior of small-world networks. , 2001, Physical review letters.

[37]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[38]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[39]  Neha Sethi,et al.  A Revived Survey of Various Credit Card Fraud Detection Techniques , 2014 .

[40]  System Sciences , 1999, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers.

[41]  Massimiliano Zanin,et al.  Parenclitic networks: uncovering new functions in biological data , 2014, Scientific Reports.

[42]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[43]  Chonghui Guo,et al.  Entropy optimization of scale-free networks’ robustness to random failures , 2005, cond-mat/0506725.

[44]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[45]  Thorsten Meinl,et al.  KNIME - the Konstanz information miner: version 2.0 and beyond , 2009, SKDD.

[46]  S. Boccaletti,et al.  Complex networks analysis of obstructive nephropathy data. , 2011, Chaos.

[47]  Yong Hu,et al.  The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature , 2011, Decis. Support Syst..

[48]  S. Strogatz Exploring complex networks , 2001, Nature.

[49]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[50]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..