PDS2: A user-centered decentralized marketplace for privacy preserving data processing

We envision PDS2, a decentralized data marketplace in which consumers submit their tasks to be run within the platform, on the data of willing providers. The goal of PDS2 is to ensure that users maintain full control on their data and do not compromise their privacy, while being rewarded for the value that their data generates. In order to achieve this, our marketplace architecture employs blockchain technology, privacy-preserving computation and decentralized machine learning.We then compare different potential solutions and identify the Ethereum blockchain, trusted execution environments and gossip learning as the most suitable for the implementation of PDS2. We also discuss the main open challenges that are left to tackle and possible directions for future work.

[1]  István Hegedüs,et al.  Decentralized learning works: An empirical comparison of gossip learning and federated learning , 2021, J. Parallel Distributed Comput..

[2]  D. Rueckert,et al.  A Systematic Comparison of Encrypted Machine Learning Solutions for Image Classification , 2020, PPMLP@CCS.

[3]  Faiez Gargouri,et al.  Semantic Web Technologies for the Internet of Things: Systematic Literature Review , 2020, Internet Things.

[4]  Reza M. Parizi,et al.  Federated Learning: A Survey on Enabling Technologies, Protocols, and Applications , 2020, IEEE Access.

[5]  Meng Li,et al.  A Blockchain-Based Trading Platform for Big Data , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[6]  T. Rabin,et al.  Falcon: Honest-Majority Maliciously Secure Framework for Private Deep Learning , 2020, Proc. Priv. Enhancing Technol..

[7]  Zibin Zheng,et al.  A Blockchain-Based Decentralized Federated Learning Framework with Committee Consensus , 2020, IEEE Network.

[8]  Shaohuai Shi,et al.  Communication-Efficient Distributed Deep Learning: A Comprehensive Survey , 2020, ArXiv.

[9]  Raul Castro Fernandez,et al.  Data market platforms , 2020, Proc. VLDB Endow..

[10]  Lodovico Giaretta,et al.  Gossip Learning: Off the Beaten Path , 2019, 2019 IEEE International Conference on Big Data (Big Data).

[11]  Choong Seon Hong,et al.  FLchain: Federated Learning via MEC-enabled Blockchain Network , 2019, 2019 20th Asia-Pacific Network Operations and Management Symposium (APNOMS).

[12]  Nikolaos Laoutaris,et al.  Valuating User Data in a Human-Centric Data Economy , 2019, ArXiv.

[13]  Philip S. Yu,et al.  Applying Differential Privacy Mechanism in Artificial Intelligence , 2019, 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS).

[14]  James Y. Zou,et al.  Data Shapley: Equitable Valuation of Data for Machine Learning , 2019, ICML.

[15]  Kevin R. B. Butler,et al.  Secure Multiparty Computation and Trusted Hardware: Examining Adoption Challenges and Opportunities , 2019, Secur. Commun. Networks.

[16]  Costas J. Spanos,et al.  Towards Efficient Data Valuation Based on the Shapley Value , 2019, AISTATS.

[17]  Christof Fetzer,et al.  TensorSCONE: A Secure TensorFlow Framework using Intel SGX , 2019, ArXiv.

[18]  Amir Houmansadr,et al.  Comprehensive Privacy Analysis of Deep Learning: Passive and Active White-box Inference Attacks against Centralized and Federated Learning , 2018, 2019 IEEE Symposium on Security and Privacy (SP).

[19]  Yang Song,et al.  Beyond Inferring Class Representatives: User-Level Privacy Leakage From Federated Learning , 2018, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[20]  Ravikiran Vatrapu,et al.  Blockchain-based Personal Health Data Sharing System Using Cloud Storage , 2018, 2018 IEEE 20th International Conference on e-Health Networking, Applications and Services (Healthcom).

[21]  Dawn Xiaodong Song,et al.  A Demonstration of Sterling: A Privacy-Preserving Data Marketplace , 2018, Proc. VLDB Endow..

[22]  Dawn Xiaodong Song,et al.  Efficient Deep Learning on Multi-Source Private Data , 2018, ArXiv.

[23]  Dan Boneh,et al.  Slalom: Fast, Verifiable and Private Execution of Neural Networks in Trusted Hardware , 2018, ICLR.

[24]  Arda Yurdakul,et al.  IDMoB: IoT Data Marketplace on Blockchain , 2018, 2018 Crypto Valley Conference on Blockchain Technology (CVCBT).

[25]  Arun Kumar,et al.  Towards Model-based Pricing for Machine Learning in a Data Marketplace , 2018, SIGMOD Conference.

[26]  Marko Vukolic,et al.  Hyperledger fabric: a distributed operating system for permissioned blockchains , 2018, EuroSys.

[27]  Yao Lu,et al.  Oblivious Neural Network Predictions via MiniONN Transformations , 2017, IACR Cryptol. ePrint Arch..

[28]  Márk Jelasity,et al.  Decentralized Management of Random Walks over a Mobile Phone Network , 2017, 2017 25th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP).

[29]  Xin Li,et al.  An IoT Data Communication Framework for Authenticity and Integrity , 2017, 2017 IEEE/ACM Second International Conference on Internet-of-Things Design and Implementation (IoTDI).

[30]  Sebastian Nowozin,et al.  Oblivious Multi-Party Machine Learning on Trusted Processors , 2016, USENIX Security Symposium.

[31]  András A. Benczúr,et al.  Robust Decentralized Low-Rank Matrix Decomposition , 2016, ACM Trans. Intell. Syst. Technol..

[32]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[33]  Antonio F. Gómez-Skarmeta,et al.  Semantic Web of Things: an analysis of the application semantics for the IoT moving towards the IoT convergence , 2014, Int. J. Web Grid Serv..

[34]  István Hegedüs,et al.  Gossip learning with linear models on fully distributed data , 2011, Concurr. Comput. Pract. Exp..

[35]  Pablo Rodriguez,et al.  On economic heavy hitters: shapley value analysis of 95th-percentile pricing , 2010, IMC '10.

[36]  Craig Gentry,et al.  Computing arbitrary functions of encrypted data , 2010, CACM.

[37]  L. Shapley,et al.  The Shapley Value , 1994 .

[38]  Vitalik Buterin A NEXT GENERATION SMART CONTRACT & DECENTRALIZED APPLICATION PLATFORM , 2015 .

[39]  S. Popov The Tangle , 2015 .