SteemOps: Extracting and Analyzing Key Operations in Steemit Blockchain-based Social Media Platform

Advancements in distributed ledger technologies are driving the rise of blockchain-based social media platforms such as Steemit, where users interact with each other in similar ways as conventional social networks. These platforms are autonomously managed by users using decentralized consensus protocols in a cryptocurrency ecosystem. The deep integration of social networks and blockchains in these platforms provides potential for numerous cross-domain research studies that are of interest to both the research communities. However, it is challenging to process and analyze large volumes of raw Steemit data as it requires specialized skills in both software engineering and blockchain systems and involves substantial efforts in extracting and filtering various types of operations. To tackle this challenge, we collect over 38 million blocks generated in Steemit during a 45 month time period from 2016/03 to 2019/11 and extract ten key types of operations performed by the users. The results generate SteemOps, a new dataset that organizes more than 900 million operations from Steemit into three sub-datasets namely (i) social-network operation dataset (SOD), (ii) witness-election operation dataset (WOD) and (iii) value-transfer operation dataset (VOD). We describe the dataset schema and its usage in detail and outline possible future research studies using SteemOps. SteemOps is designed to facilitate future research aimed at providing deeper insights on emerging blockchain-based social media platforms.

[1]  Filippo Menczer,et al.  The rise of social bots , 2014, Commun. ACM.

[2]  Jack Hessel,et al.  Science, AskScience, and BadScience: On the Coexistence of Highly Related Communities , 2016, ICWSM.

[3]  JooSeok Song,et al.  Trend of centralization in Bitcoin's distributed network , 2015, 2015 IEEE/ACIS 16th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD).

[4]  Ittay Eyal,et al.  The Miner's Dilemma , 2014, 2015 IEEE Symposium on Security and Privacy.

[5]  Gang Wang,et al.  Wisdom in the social crowd: an analysis of quora , 2013, WWW.

[6]  S. Nakamoto,et al.  Bitcoin: A Peer-to-Peer Electronic Cash System , 2008 .

[7]  Yongdae Kim,et al.  Impossibility of Full Decentralization in Permissionless Blockchains , 2019, AFT.

[8]  Tim Weninger,et al.  Consumers and Curators: Browsing and Voting Patterns on Reddit , 2017, IEEE Transactions on Computational Social Systems.

[9]  Beng Chin Ooi,et al.  BLOCKBENCH: A Framework for Analyzing Private Blockchains , 2017, SIGMOD Conference.

[10]  Balaji Palanisamy,et al.  Comparison of Decentralization in DPoS and PoW Blockchains , 2020, ICBC.

[11]  Seungwon (Eugene) Jeong Centralized Decentralization: Does Voting Matter? Simple Economics of the DPoS Blockchain Governance , 2020 .

[12]  Emin Gün Sirer,et al.  Majority Is Not Enough: Bitcoin Mining Is Vulnerable , 2013, Financial Cryptography.

[13]  Philipp Jovanovic,et al.  OmniLedger: A Secure, Scale-Out, Decentralized Ledger via Sharding , 2018, 2018 IEEE Symposium on Security and Privacy (SP).

[14]  Seungwon Shin,et al.  Cybercriminal Minds: An investigative study of cryptocurrency abuses in the Dark Web , 2019, NDSS.

[15]  Sarah Meiklejohn,et al.  Tracing Transactions Across Cryptocurrency Ledgers , 2018, USENIX Security Symposium.

[16]  Balaji Palanisamy,et al.  Incentivized Blockchain-based Social Media Platforms: A Case Study of Steemit , 2019, WebSci.

[17]  Kristina Lerman,et al.  Evidence of Online Performance Deterioration in User Sessions on Reddit , 2016, PloS one.

[18]  Hao Wang,et al.  Monoxide: Scale out Blockchains with Asynchronous Consensus Zones , 2019, NSDI.

[19]  J. Chung,et al.  Sustainable Growth and Token Economy Design: The Case of Steemit , 2018, Sustainability.

[20]  Emin Gün Sirer,et al.  Decentralization in Bitcoin and Ethereum Networks , 2018, Financial Cryptography.

[21]  Zibin Zheng,et al.  Traveling the token world: A graph analysis of Ethereum ERC20 token ecosystem , 2020, WWW.

[22]  Aggelos Kiayias,et al.  A Puff of Steem: Security Analysis of Decentralized Content Curation , 2018, Tokenomics.

[23]  Vitalik Buterin A NEXT GENERATION SMART CONTRACT & DECENTRALIZED APPLICATION PLATFORM , 2015 .

[24]  Jure Leskovec,et al.  Discovering value from community activity on focused question answering sites: a case study of stack overflow , 2012, KDD.

[25]  Mike Thelwall,et al.  Can social news websites pay for content and curation? The SteemIt cryptocurrency model , 2018, J. Inf. Sci..

[26]  Xifeng Zhao,et al.  Measuring Decentralization in Bitcoin and Ethereum using Multiple Metrics and Granularities , 2021, 2021 IEEE 37th International Conference on Data Engineering Workshops (ICDEW).

[27]  Greg Stoddard,et al.  Popularity and Quality in Social News Aggregators: A Study of Reddit and Hacker News , 2015, WWW.

[28]  Lillian Lee,et al.  All Who Wander: On the Prevalence and Characteristics of Multi-community Engagement , 2015, WWW.

[29]  Zibin Zheng,et al.  Market Manipulation of Bitcoin: Evidence from Mining the Mt. Gox Transaction Network , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.