A Marketplace for Trading AI Models based on Blockchain and Incentives for IoT Data

As Machine Learning (ML) models are becoming increasingly complex, one of the central challenges is their deployment at scale, such that companies and organizations can create value through Artificial Intelligence (AI). An emerging paradigm in ML is a federated approach where the learning model is delivered to a group of heterogeneous agents partially, allowing agents to train the model locally with their own data. However, the problem of valuation of models, as well the questions of incentives for collaborative training and trading of data/models, have received a limited treatment in the literature. In this paper, a new ecosystem of ML model trading over a trusted Blockchainbased network is proposed. The buyer can acquire the model of interest from the ML market, and interested sellers spend local computations on their data to enhance that model’s quality. In doing so, the proportional relation between the local data and the quality of trained models is considered, and the valuations of seller’s data in training the models are estimated through the distributed Data Shapley Value (DSV). At the same time, the trustworthiness of the entire trading process is provided by the Distributed Ledger Technology (DLT). Extensive experimental evaluation of the proposed approach shows a competitive runtime performance, with a 15% drop in the cost of execution, and fairness in terms of incentives for the participants.

[1]  Israel Leyva Mayorga,et al.  Trusted Wireless Monitoring Based on Distributed Ledgers over NB-IoT Connectivity , 2020, IEEE Communications Magazine.

[2]  S. Nakamoto,et al.  Bitcoin: A Peer-to-Peer Electronic Cash System , 2008 .

[3]  Choong Seon Hong,et al.  A Crowdsourcing Framework for On-Device Federated Learning , 2020, IEEE Transactions on Wireless Communications.

[4]  James Y. Zou,et al.  Data Shapley: Equitable Valuation of Data for Machine Learning , 2019, ICML.

[5]  Deying Li,et al.  An Incentive Mechanism for Building a Secure Blockchain-Based Internet of Things , 2021, IEEE Transactions on Network Science and Engineering.

[6]  Jakub Konecný,et al.  On the Outsized Importance of Learning Rates in Local Update Methods , 2020, ArXiv.

[7]  Qing Yang,et al.  Ethna: Analyzing the Underlying Peer-to-Peer Network of the Ethereum Blockchain , 2020, ArXiv.

[8]  Joseph Dureau,et al.  Federated Learning for Keyword Spotting , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Yong Meng Teo,et al.  Dynamic Resource Pricing on Federated Clouds , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[10]  Paolo Missier,et al.  Toward a Decentralized, Trust-Less Marketplace for Brokered IoT Data Trading Using Blockchain , 2019, 2019 IEEE International Conference on Blockchain (Blockchain).

[11]  Nils Gruschka,et al.  Privacy Issues and Data Protection in Big Data: A Case Study Analysis under GDPR , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[12]  Dan Suciu,et al.  Price-Optimal Querying with Data APIs , 2016, Proc. VLDB Endow..

[13]  Yong Xiang,et al.  Decentralized Privacy Using Blockchain-Enabled Federated Learning in Fog Computing , 2020, IEEE Internet of Things Journal.

[14]  Munther A. Dahleh,et al.  A Marketplace for Data: An Algorithmic Solution , 2018, EC.

[15]  Yanxin Zhang,et al.  A decentralized solution for IoT data trusted exchange based-on blockchain , 2017, 2017 3rd IEEE International Conference on Computer and Communications (ICCC).

[16]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[17]  Dawn Song,et al.  Towards Practical Differentially Private Convex Optimization , 2019, 2019 IEEE Symposium on Security and Privacy (SP).

[18]  Richard Nock,et al.  Advances and Open Problems in Federated Learning , 2021, Found. Trends Mach. Learn..

[19]  L. Shapley,et al.  The Shapley Value , 1994 .

[20]  Debiao He,et al.  A Blockchain-Based Proxy Re-Encryption With Equality Test for Vehicular Communication Systems , 2021, IEEE Transactions on Network Science and Engineering.

[21]  Donald C. Langevoort Fraud and Insider Trading in American Securities Regulation: Its Scope and Philosophy in a Global Marketplace , 1993 .

[22]  Amit Prakash,et al.  A Comparative Study of Air Quality Index Based on Factor Analysis and US-EPA Methods for an Urban Environment , 2009 .

[23]  L. Shapley,et al.  Values of Non-Atomic Games , 1974 .

[24]  Bhaskar Krishnamachari,et al.  Streaming Data Payment Protocol (SDPP) for the Internet of Things , 2018, 2018 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData).

[25]  Weichao Mao,et al.  Pricing for Revenue Maximization in IoT Data Markets: An Information Design Perspective , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[26]  Chunyan Miao,et al.  Federated Learning in the Sky: Aerial-Ground Air Quality Sensing Framework With UAV Swarms , 2021, IEEE Internet of Things Journal.

[27]  Judd Randolph Heckman,et al.  A Pricing Model for Data Markets , 2015 .

[28]  Seong-Lyun Kim,et al.  Blockchained On-Device Federated Learning , 2018, IEEE Communications Letters.

[29]  Costas J. Spanos,et al.  Towards Efficient Data Valuation Based on the Shapley Value , 2019, AISTATS.

[30]  JunHo Jo,et al.  Development of an IoT-Based Indoor Air Quality Monitoring Platform , 2020, J. Sensors.

[31]  Paolo Missier,et al.  Mind my value: a decentralized infrastructure for fair and trusted IoT data trading , 2017, IOT.

[32]  Wei Xiong,et al.  Smart Contract Based Data Trading Mode Using Blockchain and Machine Learning , 2019, IEEE Access.

[33]  Daniel Davis Wood,et al.  ETHEREUM: A SECURE DECENTRALISED GENERALISED TRANSACTION LEDGER , 2014 .

[34]  Petar Popovski,et al.  Modeling and Analysis of Data Trading on Blockchain-Based Market in IoT Networks , 2021, IEEE Internet of Things Journal.

[35]  Fan Wu,et al.  Achieving Data Truthfulness and Privacy Preservation in Data Markets , 2018, IEEE Transactions on Knowledge and Data Engineering.

[36]  Eytan Ruppin,et al.  Feature Selection via Coalitional Game Theory , 2007, Neural Computation.

[37]  Baik Hoh,et al.  Sell your experiences: a market mechanism based incentive for participatory sensing , 2010, 2010 IEEE International Conference on Pervasive Computing and Communications (PerCom).

[38]  Pooja Gupta,et al.  A Decentralized IoT Data Marketplace , 2019, ArXiv.

[39]  Hongming Cai,et al.  Ubiquitous Data Accessing Method in IoT-Based Information System for Emergency Medical Services , 2014, IEEE Transactions on Industrial Informatics.

[40]  Charith Perera Sensing as a Service (S2aaS): Buying and Selling IoT Data , 2017, ArXiv.

[41]  Dawn Song,et al.  A Principled Approach to Data Valuation for Federated Learning , 2020, Federated Learning.

[42]  W. Hoeffding A Combinatorial Central Limit Theorem , 1951 .

[43]  U. Lerner,et al.  On the feasibility of measuring urban air pollution by wireless distributed sensor networks. , 2015, The Science of the total environment.

[44]  Daniel L. Rubin,et al.  Data valuation for medical imaging using Shapley value and application to a large-scale chest X-ray dataset , 2020, Scientific Reports.

[45]  Erik Strumbelj,et al.  An Efficient Explanation of Individual Classifications using Game Theory , 2010, J. Mach. Learn. Res..