Generating Personalized Algorithms to Learn Bayesian Network Classifiers for Fraud Detection in Web Transactions

The volume of electronic transactions has raised a lot in last years, mainly due to the popularization of e-commerce. We also observe a significant increase in the number of fraud cases, resulting in billions of dollars losses each year worldwide. Therefore, it is essential to develop and apply techniques that can assist in fraud detection. In this direction, we propose an evolutionary algorithm to automatically build Bayesian Network Classifiers (BNCs) tailored to solve the problem of detecting fraudulent transactions. BNCs are powerful classification models that can deal well with data features, missing data and uncertainty. In order to evaluate the techniques, we adopt an economic efficiency metric and apply them to our real dataset. Our results show good performance in fraud detection, presenting gains up to 17%, compared to the actual scenario of the company.

[1]  Adriano M. Pereira,et al.  Characterizing and Evaluating Fraud in Electronic Transactions , 2012, 2012 Eighth Latin American Web Congress.

[2]  Erland Jonsson,et al.  How to systematically classify computer security intrusions , 1997, S&P 1997.

[3]  Alex Guimarães Cardoso de Sá Evolução automática de algoritmos de redes bayesianas de classificação , 2014 .

[4]  David Heckerman,et al.  Bayesian Networks for Data Mining , 2004, Data Mining and Knowledge Discovery.

[5]  Ioana Vasiu,et al.  Dissecting computer fraud: from definitional issues to a taxonomy , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[6]  B Thomas,et al.  A COMPARISON OF CONVENTIONAL AND ONLINE FRAUD , 2004 .

[7]  Alex Alves Freitas,et al.  Extending the ABC-Miner Bayesian Classification Algorithm , 2013, NICSO.

[8]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[9]  Risto Miikkulainen,et al.  Evolving Neural Networks through Augmenting Topologies , 2002, Evolutionary Computation.

[10]  Krzysztof J. Cios,et al.  New synthesis of bayesian network classifiers and cardiac spect image interpretation , 1999 .

[11]  Christos Faloutsos,et al.  Netprobe: a fast and scalable system for fraud detection in online auction networks , 2007, WWW '07.

[12]  Alex Alves Freitas,et al.  Automatic Design of Decision-Tree Algorithms with Evolutionary Algorithms , 2013, Evolutionary Computation.

[13]  Gregory F. Cooper,et al.  A Bayesian Method for the Induction of Probabilistic Networks from Data , 1992 .

[14]  Ian Witten,et al.  Data Mining , 2000 .

[15]  Vinicius Almendra,et al.  Finding the needle: A risk-based ranking of product listings at online auction sites for non-delivery fraud prediction , 2013, Expert Syst. Appl..

[16]  Gisele L. Pappa,et al.  Towards a method for automatically evolving bayesian network classifiers , 2013, GECCO.

[17]  Qiang Shen,et al.  Learning Bayesian networks: approaches and issues , 2011, The Knowledge Engineering Review.

[18]  Marden Neubert,et al.  Fraud detection in reputation systems in e-markets using logistic regression and stepwise optimization , 2010 .

[19]  Kevin Leyton-Brown,et al.  Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms , 2012, KDD.

[20]  Gisele L. Pappa,et al.  Automating the Design of Data Mining Algorithms: An Evolutionary Computation Approach , 2009 .

[21]  Xin Yao,et al.  Evolving artificial neural networks , 1999, Proc. IEEE.

[22]  Yong Hu,et al.  The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature , 2011, Decis. Support Syst..

[23]  Gonzalo Álvarez,et al.  A new taxonomy of Web attacks suitable for efficient encoding , 2003, Comput. Secur..

[24]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[25]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..