Improving Robustness and Accuracy of Ponzi Scheme Detection on Ethereum Using Time-Dependent Features

The rapid development of blockchain has led to more and more funding pouring into the cryptocurrency market, which also attracted cybercriminals' interest in recent years. The Ponzi scheme, an old-fashioned fraud, is now popular on the blockchain, causing considerable financial losses to many crypto-investors. A few Ponzi detection methods have been proposed in the literature, most of which detect a Ponzi scheme based on its smart contract source code or opcode. The contract-code-based approach, while achieving very high accuracy, is not robust: first, the source codes of a majority of contracts on Ethereum are not available, and second, a Ponzi developer can fool a contract-code-based detection model by obfuscating the opcode or inventing a new profit distribution logic that cannot be detected (since these models were trained on existing Ponzi logics only). A transaction-based approach could improve the robustness of detection because transactions, unlike smart contracts, are harder to be manipulated. However, the current transaction-based detection models achieve fairly low accuracy. We address this gap in the literature by developing new detection models that rely only on the transactions, hence guaranteeing the robustness, and moreover, achieve considerably higher Accuracy, Precision, Recall, and F1-score than existing transaction-based models. This is made possible thanks to the introduction of novel time-dependent features that capture Ponzi behaviours characteristics derived from our comprehensive data analyses on Ponzi and non-Ponzi data from the XBlock-ETH repository

[1]  Yanmei Zhang,et al.  Detecting Ethereum Ponzi Schemes Based on Improved LightGBM Algorithm , 2022, IEEE Transactions on Computational Social Systems.

[2]  Shaojing Fu,et al.  Al-SPSD: Anti-leakage smart Ponzi schemes detection in blockchain , 2021, Inf. Process. Manag..

[3]  Xiapu Luo,et al.  SADPonzi: Detecting and Characterizing Ponzi Schemes in Ethereum Smart Contracts , 2021, Proc. ACM Meas. Anal. Comput. Syst..

[4]  Marco Di Francesco,et al.  Evaluating Machine-Learning Techniques for Detecting Smart Ponzi Schemes , 2021, 2021 IEEE/ACM 4th International Workshop on Emerging Trends in Software Engineering for Blockchain (WETSEB).

[5]  Zijiang Yang,et al.  Early Detection of Smart Ponzi Scheme Contracts Based on Behavior Forest Similarity , 2020, 2020 IEEE 20th International Conference on Software Quality, Reliability and Security (QRS).

[6]  Thar Baker,et al.  Blockchain-based privacy-preserving remote data integrity checking scheme for IoT information systems , 2020, Inf. Process. Manag..

[7]  Tsan-Ming Choi,et al.  Blockchain technology in supply chain operations: Applications, challenges and research opportunities , 2020, Transportation Research Part E: Logistics and Transportation Review.

[8]  V. Urovi,et al.  A Consent Model for Blockchain-Based Health Data Sharing Platforms , 2020, IEEE Access.

[9]  Shaojing Fu,et al.  Expose Your Mask: Smart Ponzi Schemes Detection on Blockchain , 2020, 2020 International Joint Conference on Neural Networks (IJCNN).

[10]  Primavera De Filippi,et al.  Blockchain as a confidence machine: The problem of trust & challenges of governance , 2020, Technology in Society.

[11]  Satchidananda Dehuri,et al.  Artificial Neural Network , 2020, Encyclopedia of GIS.

[12]  Jose A. Lozano,et al.  A Review on Outlier/Anomaly Detection in Time Series Data , 2020, ACM Comput. Surv..

[13]  Debiao He,et al.  DCAP: A Secure and Efficient Decentralized Conditional Anonymous Payment System Based on Blockchain , 2020, IEEE Transactions on Information Forensics and Security.

[14]  David Lizcano,et al.  A method for outlier detection based on cluster analysis and visual expert criteria , 2019, Expert Syst. J. Knowl. Eng..

[15]  Peilin Zheng,et al.  XBlock-ETH: Extracting and Exploring Blockchain Data From Ethereum , 2019, IEEE Open Journal of the Computer Society.

[16]  Anders Drachen,et al.  Ethereum Crypto-Games: Mechanics, Prevalence, and Gambling Similarities , 2019, CHI PLAY.

[17]  Yunjie Ge,et al.  Data Mining-Based Ethereum Fraud Detection , 2019, 2019 IEEE International Conference on Blockchain (Blockchain).

[18]  Catarina Ferreira Da Silva,et al.  Towards Blockchain Interoperability: Improving Video Games Data Exchange , 2019, 2019 IEEE International Conference on Blockchain and Cryptocurrency (ICBC).

[19]  Jeannette Paschen,et al.  How blockchain technologies impact your business model , 2019, Business Horizons.

[20]  Zibin Zheng,et al.  Exploiting Blockchain Data to Detect Smart Ponzi Schemes on Ethereum , 2019, IEEE Access.

[21]  Dimitrios Tzovaras,et al.  On the Design of a Blockchain-Based System to Facilitate Healthcare Data Sharing , 2018, 2018 17th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/ 12th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE).

[22]  Antonio Puliafito,et al.  Blockchain and IoT Integration: A Systematic Survey , 2018, Sensors.

[23]  Bernd Bischl,et al.  Time series anomaly detection based on shapelet learning , 2018, Comput. Stat..

[24]  Zibin Zheng,et al.  Detecting Ponzi Schemes on Ethereum: Towards Healthier Blockchain Technology , 2018, WWW.

[25]  Oscar Novo,et al.  Blockchain Meets IoT: An Architecture for Scalable Access Management in IoT , 2018, IEEE Internet of Things Journal.

[26]  Massimo Bartoletti,et al.  Data Mining for Detecting Bitcoin Ponzi Schemes , 2018, 2018 Crypto Valley Conference on Blockchain Technology (CVCBT).

[27]  Tyler Moore,et al.  Analyzing the Bitcoin Ponzi Scheme Ecosystem , 2018, Financial Cryptography Workshops.

[28]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[29]  Peng Jiang,et al.  A Survey on the Security of Blockchain Systems , 2017, Future Gener. Comput. Syst..

[30]  Gang Chen,et al.  Untangling Blockchain: A Data Processing View of Blockchain Systems , 2017, IEEE Transactions on Knowledge and Data Engineering.

[31]  Salil S. Kanhere,et al.  Towards an Optimized BlockChain for IoT , 2017, 2017 IEEE/ACM Second International Conference on Internet-of-Things Design and Implementation (IoTDI).

[32]  Massimo Bartoletti,et al.  Dissecting Ponzi schemes on Ethereum: identification, analysis, and impact , 2017, Future Gener. Comput. Syst..

[33]  Tomi Dahlberg,et al.  Digital Supply Chain Transformation toward Blockchain Integration , 2017, HICSS.

[34]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[35]  Rob J. Hyndman,et al.  Large-Scale Unusual Time Series Detection , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[36]  Tyler Moore,et al.  There's No Free Lunch, Even Using Bitcoin: Tracking the Popularity and Profits of Virtual Currency Scams , 2015, Financial Cryptography.

[37]  Nick S. Jones,et al.  Highly Comparative Feature-Based Time-Series Classification , 2014, IEEE Transactions on Knowledge and Data Engineering.

[38]  George C. Runger,et al.  A time series forest for classification and feature extraction , 2013, Inf. Sci..

[39]  Siddhartha Bhattacharyya,et al.  Data mining for credit card fraud: A comparative study , 2011, Decis. Support Syst..

[40]  Marc Artzrouni,et al.  The mathematics of Ponzi schemes , 2009, Math. Soc. Sci..

[41]  Pavlos Protopapas,et al.  Finding anomalous periodic time series , 2009, Machine Learning.

[42]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[43]  Xiaozhe Wang,et al.  Characteristic-Based Clustering for Time Series Data , 2006, Data Mining and Knowledge Discovery.

[44]  Hui Han,et al.  Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning , 2005, ICIC.

[45]  Robert P. Sheridan,et al.  Random Forest: A Classification and Regression Tool for Compound Classification and QSAR Modeling , 2003, J. Chem. Inf. Comput. Sci..

[46]  Zibin Zheng,et al.  Ponzi scheme detection via oversampling-based Long Short-Term Memory for smart contracts , 2021, Knowl. Based Syst..

[47]  David Kotz,et al.  Amanuensis: Information provenance for health-data systems , 2021, Inf. Process. Manag..

[48]  Elif Derya Übeyli,et al.  Recurrent Neural Networks , 2018 .

[49]  Juan M. Corchado,et al.  How blockchain improves the supply chain: case study alimentary supply chain , 2018, FNC/MobiSPC.

[50]  Yi Zhou,et al.  Erays: Reverse Engineering Ethereum's Opaque Smart Contracts , 2018, USENIX Security Symposium.

[51]  Nick Szabo,et al.  Smart Contracts: Building Blocks for Digital Markets , 2018 .

[52]  Tie-Yan Liu,et al.  A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS 2017.

[53]  Lior Rokach,et al.  Decision forest: Twenty years of research , 2016, Inf. Fusion.

[54]  Daniel Davis Wood ETHEREUM: A SECURE DECENTRALISED GENERALISED TRANSACTION LEDGER , 2014 .

[55]  S. Nakamoto,et al.  Bitcoin: A Peer-to-Peer Electronic Cash System , 2008 .

[56]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.