The Complex Community Structure of the Bitcoin Address Correspondence Network

Bitcoin is built on a blockchain, an immutable decentralized ledger that allows entities (users) to exchange Bitcoins in a pseudonymous manner. Bitcoins are associated with alpha-numeric addresses and are transferred via transactions. Each transaction is composed of a set of input addresses (associated with unspent outputs received from previous transactions) and a set of output addresses (to which Bitcoins are transferred). Despite Bitcoin was designed with anonymity in mind, different heuristic approaches exist to detect which addresses in a specific transaction belong to the same entity. By applying these heuristics, we build an Address Correspondence Network: in this representation, addresses are nodes are connected with edges if at least one heuristic detects them as belonging to the same entity. In this paper, we analyze for the first time the Address Correspondence Network and show it is characterized by a complex topology, signaled by a broad, skewed degree distribution and a power-law component size distribution. Using a large-scale dataset of addresses for which the controlling entities are known, we show that a combination of external data coupled with standard community detection algorithms can reliably identify entities. The complex nature of the Address Correspondence Network reveals that usage patterns of individual entities create statistical regularities; and that these regularities can be leveraged to more accurately identify entities and gain a deeper understanding of the Bitcoin economy as a whole.

[1]  Arvind Narayanan,et al.  BlockSci: Design and applications of a blockchain analysis platform , 2017, USENIX Security Symposium.

[2]  Zhao Yang,et al.  A Comparative Analysis of Community Detection Algorithms on Artificial Networks , 2016, Scientific Reports.

[3]  S. Nakamoto,et al.  Bitcoin: A Peer-to-Peer Electronic Cash System , 2008 .

[4]  Tālis J. Putniņš,et al.  Sex, Drugs, and Bitcoin: How Much Illegal Activity Is Financed through Cryptocurrencies? , 2019, The Review of Financial Studies.

[5]  S Vairachilai A Comparative Analysis of Community Detection Algorithms on Complex Network Based on Modularity , 2018 .

[6]  Christoph Fretter,et al.  The Unreasonable Effectiveness of Address Clustering , 2016, 2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress (UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld).

[7]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[9]  L. Hubert,et al.  Comparing partitions , 1985 .

[10]  Matthijs J. Warrens,et al.  On the Equivalence of Cohen’s Kappa and the Hubert-Arabie Adjusted Rand Index , 2008, J. Classif..

[11]  Xin Liu,et al.  Evaluation of Community Detection Methods , 2018, IEEE Transactions on Knowledge and Data Engineering.

[12]  Thomas M. Cover,et al.  Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing) , 2006 .

[13]  Tālis J. Putniņš,et al.  Sex, Drugs, and Bitcoin: How Much Illegal Activity Is Financed Through Cryptocurrencies? , 2018, The Review of Financial Studies.

[14]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[16]  Jonas David Nick,et al.  Data-Driven De-Anonymization in Bitcoin , 2015 .

[17]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[18]  István Csabai,et al.  Do the Rich Get Richer? An Empirical Analysis of the Bitcoin Transaction Network , 2013, PloS one.

[19]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[20]  Michael S. Kester,et al.  Bitcoin Transaction Graph Analysis , 2015, ArXiv.

[21]  S A R A H M E I K L E J O H N,et al.  A Fistful of Bitcoins Characterizing Payments Among Men with No Names , 2013 .

[22]  Yan Luo,et al.  Do Bitcoin Users Really Care About Anonymity? An Analysis of the Bitcoin Transaction Graph , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[23]  Marco Alberto Javarone,et al.  From Bitcoin to Bitcoin Cash: a network analysis , 2018, CRYBLOCK@MobiSys.

[24]  K. Sneppen,et al.  Specificity and Stability in Topology of Protein Networks , 2002, Science.

[25]  César A. Hidalgo,et al.  Scale-free networks , 2008, Scholarpedia.

[26]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[27]  Alex Biryukov,et al.  Deanonymization and Linkability of Cryptocurrency Transactions Based on Network Analysis , 2019, 2019 IEEE European Symposium on Security and Privacy (EuroS&P).

[28]  Jie Luo,et al.  Heuristic-Based Address Clustering in Bitcoin , 2020, IEEE Access.

[29]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[30]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[31]  C. Tessone,et al.  Bitcoin Transaction Networks: An Overview of Recent Results , 2020, Frontiers in Physics.

[32]  T. Squartini,et al.  The evolving liaisons between the transaction networks of Bitcoin and its price dynamics , 2019, 1907.03577.

[33]  Julia Hirschberg,et al.  V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure , 2007, EMNLP.

[34]  Albert-László Barabási,et al.  Scale-free networks , 2008, Scholarpedia.

[35]  Bitcoin Proof of Stake: A Peer-to-Peer Electronic Cash System , 2020 .

[36]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[37]  Ravikiran Vatrapu,et al.  Breaking Bad: De-Anonymising Entity Types on the Bitcoin Blockchain Using Supervised Machine Learning , 2018, HICSS.

[38]  Alessandra Conversi,et al.  Comparative Analysis , 2009, Encyclopedia of Database Systems.

[39]  Yash Patel Deanonymizing Bitcoin Transaction: An Investigative Study On Large-Scale Graph Clustering , 2018 .

[40]  James Bailey,et al.  Information Theoretic Measures for Clusterings Comparison: Variants, Properties, Normalization and Correction for Chance , 2010, J. Mach. Learn. Res..

[41]  Maxim Panov,et al.  Automatic Bitcoin Address Clustering , 2017, 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA).