Attention-Weighted Federated Deep Reinforcement Learning for Device-to-Device Assisted Heterogeneous Collaborative Edge Caching

In order to meet the growing demands for multimedia service access and release the pressure of the core network, edge caching and device-to-device (D2D) communication have been regarded as two promising techniques in next generation mobile networks and beyond. However, most existing related studies lack consideration of effective cooperation and adaptability to the dynamic network environments. In this article, based on the flexible trilateral cooperation among user equipment, edge base stations and a cloud server, we propose a D2D-assisted heterogeneous collaborative edge caching framework by jointly optimizing the node selection and cache replacement in mobile networks. We formulate the joint optimization problem as a Markov decision process, and use a deep Q-learning network to solve the long-term mixed integer linear programming problem. We further design an attention-weighted federated deep reinforcement learning (AWFDRL) model that uses federated learning to improve the training efficiency of the Q-learning network by considering the limited computing and storage capacity, and incorporates an attention mechanism to optimize the aggregation weights to avoid the imbalance of local model quality. We prove the convergence of the corresponding algorithm, and present simulation results to show the effectiveness of the proposed AWFDRL framework in reducing average delay of content access, improving hit rate and offloading traffic.

[1]  Éva Tardos,et al.  Algorithm design , 2005 .

[2]  Xiaoyan Sun,et al.  Communication-Efficient Federated Deep Learning With Layerwise Asynchronous Model Update and Temporally Weighted Aggregation , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[3]  Fan Yang,et al.  DQN Inspired Joint Computing and Caching Resource Allocation Approach for Software Defined Information-Centric Internet of Things Network , 2019, IEEE Access.

[4]  Indranil Gupta,et al.  Asynchronous Federated Optimization , 2019, ArXiv.

[5]  Yan Wang,et al.  Computation Offloading with Multiple Agents in Edge-Computing–Supported IoT , 2019, ACM Trans. Sens. Networks.

[6]  Blaise Agüera y Arcas,et al.  Communication-Efficient Learning of Deep Networks from Decentralized Data , 2016, AISTATS.

[7]  Anit Kumar Sahu,et al.  Federated Optimization in Heterogeneous Networks , 2018, MLSys.

[8]  Gang Feng,et al.  Multi-Agent Reinforcement Learning Based Cooperative Content Caching for Mobile Edge Networks , 2019, IEEE Access.

[9]  Haitian Pang,et al.  Toward Smart and Cooperative Edge Caching for 5G Networks: A Deep Learning Based Approach , 2018, 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS).

[10]  Haitian Pang,et al.  Toward Edge-Assisted Video Content Intelligent Caching With Long Short-Term Memory Learning , 2019, IEEE Access.

[11]  Mohamed Hefeeda,et al.  Traffic Modeling and Proportional Partial Caching for Peer-to-Peer Systems , 2008, IEEE/ACM Transactions on Networking.

[12]  Ke Zhang,et al.  Blockchain Empowered Asynchronous Federated Learning for Secure Data Sharing in Internet of Vehicles , 2020, IEEE Transactions on Vehicular Technology.

[13]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[14]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[15]  T. Govier Social trust and human communities , 1997 .

[16]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[17]  Pan Hui,et al.  Privacy-Preserving Asynchronous Federated Learning Mechanism for Edge Network Computing , 2020, IEEE Access.

[18]  Tarik Taleb,et al.  Edge Caching Replacement Optimization for D2D Wireless Networks via Weighted Distributed DQN , 2020, 2020 IEEE Wireless Communications and Networking Conference (WCNC).

[19]  Xiaofei Wang,et al.  STCS: Spatial-Temporal Collaborative Sampling in Flow-Aware Software Defined Networks , 2020, IEEE Journal on Selected Areas in Communications.

[20]  Zhi Ding,et al.  Mixed Mode Transmission and Resource Allocation for D2D Communication , 2016, IEEE Transactions on Wireless Communications.

[21]  Jia Wang,et al.  A survey of web caching schemes for the Internet , 1999, CCRV.

[22]  Shusen Yang,et al.  Asynchronous Federated Learning with Differential Privacy for Edge Intelligence , 2019, ArXiv.

[23]  Hubert Eichner,et al.  Towards Federated Learning at Scale: System Design , 2019, SysML.

[24]  Xiaofei Wang,et al.  Federated Deep Reinforcement Learning for Internet of Things With Decentralized Cooperative Edge Caching , 2020, IEEE Internet of Things Journal.

[25]  Geyong Min,et al.  Federated Learning Based Proactive Content Caching in Edge Computing , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[26]  Xiaofei Wang,et al.  Convergence of Edge Computing and Deep Learning: A Comprehensive Survey , 2019, IEEE Communications Surveys & Tutorials.

[27]  Yan Zhang,et al.  Differentially Private Asynchronous Federated Learning for Mobile Edge Computing in Urban Informatics , 2020, IEEE Transactions on Industrial Informatics.

[28]  Yanwei Zheng,et al.  Mining Hard Samples Globally and Efficiently for Person Reidentification , 2020, IEEE Internet of Things Journal.

[29]  Peter Richtárik,et al.  Federated Learning: Strategies for Improving Communication Efficiency , 2016, ArXiv.

[30]  Albert Y. Zomaya,et al.  Edge Intelligence: The Confluence of Edge Computing and Artificial Intelligence , 2019, IEEE Internet of Things Journal.

[31]  Kai Li,et al.  RIPQ: Advanced Photo Caching on Flash for Facebook , 2015, FAST.

[32]  Robbert van Renesse,et al.  An analysis of Facebook photo caching , 2013, SOSP.

[33]  Xiaofei Wang,et al.  Hierarchical Edge Caching in Device-to-Device Aided Mobile Networks: Modeling, Optimization, and Design , 2018, IEEE Journal on Selected Areas in Communications.

[34]  E. Sivasankar,et al.  A Novel Singularity Based Improved Tanimoto Similarity Measure for Effective Recommendation Using Collaborative Filtering , 2018, 2018 8th International Conference on Cloud Computing, Data Science & Engineering (Confluence).

[35]  Xu Chen,et al.  In-Edge AI: Intelligentizing Mobile Edge Computing, Caching and Communication by Federated Learning , 2018, IEEE Network.

[36]  Martin F. Arlitt,et al.  Evaluating content management techniques for Web proxy caches , 2000, PERV.

[37]  Chita R. Das,et al.  Cooperative cache-based data access in ad hoc networks , 2004, Computer.

[38]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[39]  L. Sweeney Simple Demographics Often Identify People Uniquely , 2000 .

[40]  Xiang Li,et al.  On the Convergence of FedAvg on Non-IID Data , 2019, ICLR.

[41]  László Böszörményi,et al.  A survey of Web cache replacement strategies , 2003, CSUR.

[42]  Yan Zhang,et al.  Deep Reinforcement Learning for Cooperative Content Caching in Vehicular Edge Computing and Networks , 2020, IEEE Internet of Things Journal.