Unsupervised learning for robust Bitcoin fraud detection

The rampant absorption of Bitcoin as a cryptographic currency, along with rising cybercrime activities, warrants utilization of anomaly detection to identify potential fraud. Anomaly detection plays a pivotal role in data mining since most outlying points contain crucial information for further investigation. In the financial world which the Bitcoin network is part of by default, anomaly detection amounts to fraud detection. This paper investigates the use of trimmed k-means, that is capable of simultaneous clustering of objects and fraud detection in a multivariate setup, to detect fraudulent activity in Bitcoin transactions. The proposed approach detects more fraudulent transactions than similar studies or reports on the same dataset.

[1]  J. A. Cuesta-Albertos,et al.  Trimmed $k$-means: an attempt to robustify quantizers , 1997 .

[2]  A. Gordaliza,et al.  Robustness Properties of k Means and Trimmed k Means , 1999 .

[3]  Adrian E. Raftery,et al.  MCLUST Version 3: An R Package for Normal Mixture Modeling and Model-Based Clustering , 2006 .

[4]  S. Nakamoto,et al.  Bitcoin: A Peer-to-Peer Electronic Cash System , 2008 .

[5]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[6]  Christos Faloutsos,et al.  oddball: Spotting Anomalies in Weighted Graphs , 2010, PAKDD.

[7]  Luis Angel García-Escudero,et al.  Exploring the number of groups in robust model-based clustering , 2011, Stat. Comput..

[8]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[9]  Yong Hu,et al.  The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature , 2011, Decis. Support Syst..

[10]  Anthony C. Davison,et al.  High-Dimensional Bayesian Clustering with Variable Selection: The R Package bclust , 2012 .

[11]  Luis Angel García-Escudero,et al.  tclust: An R Package for a Trimming Approach to Cluster Analysis , 2012 .

[12]  Deepak Zambre Analysis of Bitcoin Network Dataset for Fraud , 2013 .

[13]  Ajay Rana,et al.  K-means with Three different Distance Metrics , 2013 .

[14]  A. Blundell-Wignall The Bitcoin Question , 2014 .

[15]  David A. Clifton,et al.  A review of novelty detection , 2014, Signal Process..

[16]  Danai Koutra,et al.  Graph based anomaly detection and description: a survey , 2014, Data Mining and Knowledge Discovery.

[17]  Dr. M. Prabakaran,et al.  An Multi-Variant Relational Model for Money Laundering Identification using Time Series Data Set , 2014 .

[18]  Jeremy Clark,et al.  SoK: Research Perspectives and Challenges for Bitcoin and Cryptocurrencies , 2015, 2015 IEEE Symposium on Security and Privacy.

[19]  R. C. Maloumby-Baka,et al.  The Quest to Lower High Remittance Costs to Africa: A Brief Review of the Use of Mobile Banking and Bitcoins , 2015 .

[20]  Kirsten Schuettler,et al.  Migration and remittances : recent developments and outlook , 2015 .

[21]  Steven Lee,et al.  Anomaly Detection in Bitcoin Network Using Unsupervised Learning Methods , 2016, ArXiv.

[22]  Steven Lee,et al.  Anomaly Detection in the Bitcoin System - A Network Perspective , 2016, ArXiv.