论文信息 - A Learning-Based Incentive Mechanism for Federated Learning

A Learning-Based Incentive Mechanism for Federated Learning

Internet of Things (IoT) generates large amounts of data at the network edge. Machine learning models are often built on these data, to enable the detection, classification, and prediction of the future events. Due to network bandwidth, storage, and especially privacy concerns, it is often impossible to send all the IoT data to the data center for centralized model training. To address these issues, federated learning has been proposed to let nodes use the local data to train models, which are then aggregated to synthesize a global model. Most of the existing work has focused on designing learning algorithms with provable convergence time, but other issues, such as incentive mechanism, are unexplored. Although incentive mechanisms have been extensively studied in network and computation resource allocation, yet they cannot be applied to federated learning directly due to the unique challenges of information unsharing and difficulties of contribution evaluation. In this article, we study the incentive mechanism for federated learning to motivate edge nodes to contribute model training. Specifically, a deep reinforcement learning-based (DRL) incentive mechanism has been designed to determine the optimal pricing strategy for the parameter server and the optimal training strategies for edge nodes. Finally, numerical experiments have been implemented to evaluate the efficiency of the proposed DRL-based incentive mechanism.

[1] Tingting Fu,et al. Optimal ThrowBoxes assignment for big data multicast in VDTNs , 2019, Wirel. Networks.

[2] Song Guo,et al. Incentive mechanisms for device-to-device communications , 2015, IEEE Network.

[3] Chi Harold Liu,et al. Free Market of Multi-Leader Multi-Follower Mobile Crowdsensing: An Incentive Mechanism Design by Deep Reinforcement Learning , 2020, IEEE Transactions on Mobile Computing.

[4] Song Guo,et al. Vehicle-Assist Resilient Information and Network System for Disaster Management , 2017, IEEE Transactions on Emerging Topics in Computing.

[5] Zhiyuan Xu,et al. Experience-Driven Congestion Control: When Multi-Path TCP Meets Deep Reinforcement Learning , 2019, IEEE Journal on Selected Areas in Communications.

[6] Song Guo,et al. Experience-Driven Computational Resource Allocation of Federated Learning by Deep Reinforcement Learning , 2020, 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[7] Yaochu Jin,et al. Multi-Objective Evolutionary Federated Learning , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[8] Eric Horvitz,et al. A Deep Hybrid Model for Weather Forecasting , 2015, KDD.

[9] Nando de Freitas,et al. Sample Efficient Actor-Critic with Experience Replay , 2016, ICLR.

[10] Shaohan Hu,et al. Deep Learning for the Internet of Things , 2018, Computer.

[11] Athanasios V. Vasilakos,et al. TRAC: Truthful auction for location-aware collaborative sensing in mobile crowdsourcing , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[12] Albert Y. Zomaya,et al. Federated Learning over Wireless Networks: Optimization Model Design and Analysis , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[13] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.

[14] Xiang-Yang Li,et al. How to crowdsource tasks truthfully without sacrificing utility: Online incentive mechanisms with budget constraint , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[15] Kaoru Ota,et al. Deep Learning for Mobile Multimedia , 2017, ACM Trans. Multim. Comput. Commun. Appl..

[16] Blaise Agüera y Arcas,et al. Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[17] Lei Chen,et al. Free Market of Crowdsourcing: Incentive Mechanism Design for Mobile Sensing , 2014, IEEE Transactions on Parallel and Distributed Systems.

[18] Chuan Wu,et al. Optimus: an efficient dynamic resource scheduler for deep learning clusters , 2018, EuroSys.

[19] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[20] Sue Ellen Haupt,et al. Big Data and Machine Learning for Applied Weather Forecasts: Forecasting Solar Power for Utility Operations , 2015, 2015 IEEE Symposium Series on Computational Intelligence.

[21] Sanjit Krishnan Kaul,et al. Minimizing age of information in vehicular networks , 2011, 2011 8th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks.

[22] M. Dufwenberg. Game theory. , 2011, Wiley interdisciplinary reviews. Cognitive science.

[23] Wei Wang,et al. Role of Gifts in Decision Making: An Endowment Effect Incentive Mechanism for Offloading in the IoV , 2019, IEEE Internet of Things Journal.

[24] Jia Xu,et al. Incentive Mechanisms for Time Window Dependent Tasks in Mobile Crowdsensing , 2015, IEEE Transactions on Wireless Communications.

[25] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[26] Yiwei Thomas Hou,et al. A General Model for Minimizing Age of Information at Network Edge , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[27] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.

[28] Roy D. Yates,et al. Real-time status: How often should one update? , 2012, 2012 Proceedings IEEE INFOCOM.

[29] Song Guo,et al. Resource Management at the Network Edge: A Deep Reinforcement Learning Approach , 2019, IEEE Network.

[30] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[31] Xi Fang,et al. Crowdsourcing to smartphones: incentive mechanism design for mobile phone sensing , 2012, Mobicom '12.

[32] Mehdi Bennis,et al. Optimized Computation Offloading Performance in Virtual Edge Computing Systems Via Deep Reinforcement Learning , 2018, IEEE Internet of Things Journal.

[33] Jian Tang,et al. Truthful incentive mechanisms for crowdsourcing , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[34] Geyong Min,et al. Communication-Efficient Federated Learning for Wireless Edge Intelligence in IoT , 2020, IEEE Internet of Things Journal.

[35] Mianxiong Dong,et al. Learning IoT in Edge: Deep Learning for the Internet of Things with Edge Computing , 2018, IEEE Network.

[36] Albert Y. Zomaya,et al. Intelligent VNF Orchestration and Flow Scheduling via Model-Assisted Deep Reinforcement Learning , 2020, IEEE Journal on Selected Areas in Communications.

[37] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.

[38] Hongke Zhang,et al. Incentive mechanism for computation offloading using edge computing: A Stackelberg game approach , 2017, Comput. Networks.

[39] Chi Harold Liu,et al. Energy-Efficient Distributed Mobile Crowd Sensing: A Deep Learning Approach , 2019, IEEE Journal on Selected Areas in Communications.

[40] Tao Wang,et al. A Fair and Budget-Balanced Incentive Mechanism for Energy Management in Buildings , 2018, IEEE Transactions on Smart Grid.

[41] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[42] Yuanqing Xia,et al. Big Data Analytics by CrowdLearning: Architecture and Mechanism Design , 2020, IEEE Network.

[43] Yuanqing Xia,et al. Incentive mechanism in platform-centric mobile crowdsensing: A one-to-many bargaining approach , 2018, Comput. Networks.

[44] Wei Wang,et al. Continuum: A Platform for Cost-Aware, Low-Latency Continual Learning , 2018, SoCC.

[45] Anit Kumar Sahu,et al. On the Convergence of Federated Optimization in Heterogeneous Networks , 2018, ArXiv.