Deep Reinforcement Learning-Based Collaborative Video Caching and Transcoding in Clustered and Intelligent Edge B5G Networks

In the next-generation wireless communications system of Beyond 5G networks, video streaming services have held a surprising proportion of the whole network traffic. Furthermore, the user preference and demand towards a specific video might be different because of the heterogeneity of users’ processing capabilities and the variation of network condition. Thus, it is a complicated decision problem with high-dimensional state spaces to choose appropriate quality videos according to users’ actual network condition. To address this issue, in this paper, a Content Distribution Network and Cluster-based Mobile Edge Computing framework has been proposed to enhance the ability of caching and computing and promote the collaboration among edge severs. Then, we develop a novel deep reinforcement learning-based framework to automatically obtain the intracluster collaborative caching and transcoding decisions, which are executed based on video popularity, user requirement prediction, and abilities of edge servers. Simulation results demonstrate that the quality of video streaming service can be significantly improved by using the designed deep reinforcement learning-based algorithm with less backhaul consumption and processing costs.

[1]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[2]  Yongming Huang,et al.  Cache-Enabled Adaptive Bit Rate Streaming via Deep Self-Transfer Reinforcement Learning , 2018, 2018 10th International Conference on Wireless Communications and Signal Processing (WCSP).

[3]  Xin Zhang,et al.  Deep-Q-Network-Based Multimedia Multi-Service QoS Optimization for Mobile Edge Computing Systems , 2019, IEEE Access.

[4]  Xiaofei Wang,et al.  Convergence of Edge Computing and Deep Learning: A Comprehensive Survey , 2019, IEEE Communications Surveys & Tutorials.

[5]  Dong Li,et al.  Ultra-reliable MU-MIMO detector based on deep learning for 5G/B5G-enabled IoT , 2020, Phys. Commun..

[6]  Nei Kato,et al.  Ten Challenges in Advancing Machine Learning Technologies toward 6G , 2020, IEEE Wireless Communications.

[7]  Cheng Zhang,et al.  Task migration for mobile edge computing using deep reinforcement learning , 2019, Future Gener. Comput. Syst..

[8]  Michael Schukat,et al.  Deep Reinforcement Learning: An Overview , 2016, IntelliSys.

[9]  Hongbo Jiang,et al.  Adaptive Wireless Video Streaming Based on Edge Computing: Opportunities and Approaches , 2019, IEEE Transactions on Services Computing.

[10]  Wei Huang,et al.  Generic Deep Learning-Based Linear Detectors for MIMO Systems Over Correlated Noise Environments , 2020, IEEE Access.

[11]  Khaled Ben Letaief,et al.  Delay-optimal computation task scheduling for mobile-edge computing systems , 2016, 2016 IEEE International Symposium on Information Theory (ISIT).

[12]  MyungJae Shin,et al.  Joint Offloading and Streaming in Mobile Edges: A Deep Reinforcement Learning Approach , 2019, 2019 IEEE VTS Asia Pacific Wireless Communications Symposium (APWCS).

[13]  Xiong Xiong,et al.  Collaborative Edge Caching through Service Function Chaining: Architecture and Challenges , 2018, IEEE Wireless Communications.

[14]  Dario Pompili,et al.  Adaptive Bitrate Video Caching and Processing in Mobile-Edge Computing Networks , 2019, IEEE Transactions on Mobile Computing.

[15]  Dario Pompili,et al.  Collaborative Mobile Edge Computing in 5G Networks: New Paradigms, Scenarios, and Challenges , 2016, IEEE Communications Magazine.

[16]  Danda B. Rawat,et al.  Recent advances in mobile edge computing and content caching , 2020, Digit. Commun. Networks.

[17]  Dongning Guo,et al.  Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks , 2018, IEEE Journal on Selected Areas in Communications.

[18]  Ying-Chang Liang,et al.  Joint Optimization of Handover Control and Power Allocation Based on Multi-Agent Deep Reinforcement Learning , 2020, IEEE Transactions on Vehicular Technology.

[19]  Khaled Ben Letaief,et al.  Dynamic Computation Offloading for Mobile-Edge Computing With Energy Harvesting Devices , 2016, IEEE Journal on Selected Areas in Communications.

[20]  Min Chen,et al.  Online Cloud Transcoding and Distribution for Crowdsourced Live Game Video Streaming , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Ke Zhang,et al.  Edge Intelligence and Blockchain Empowered 5G Beyond for the Industrial Internet of Things , 2019, IEEE Network.

[22]  Ejaz Ahmed,et al.  A survey on mobile edge computing , 2016, 2016 10th International Conference on Intelligent Systems and Control (ISCO).

[23]  Minghua Chen,et al.  Utility Maximization in Peer-to-Peer Systems With Applications to Video Conferencing , 2012, IEEE/ACM Transactions on Networking.

[24]  Chin-Feng Lai,et al.  Q-learning based collaborative cache allocation in mobile edge computing , 2020, Future Gener. Comput. Syst..

[25]  M. Siekkinen,et al.  Edge Computing Assisted Adaptive Mobile Video Streaming , 2019, IEEE Transactions on Mobile Computing.

[26]  Victor C. M. Leung,et al.  Adaptive Bitrate Streaming in Wireless Networks With Transcoding at Network Edge Using Deep Reinforcement Learning , 2020, IEEE Transactions on Vehicular Technology.

[27]  Victor C. M. Leung,et al.  Deep-Reinforcement-Learning-Based Optimization for Cache-Enabled Opportunistic Interference Alignment Wireless Networks , 2017, IEEE Transactions on Vehicular Technology.

[28]  Haitian Pang,et al.  Intelligent Edge-Assisted Crowdcast with Deep Reinforcement Learning for Personalized QoE , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[29]  Meng Li,et al.  Deep Reinforcement Learning-Based Offloading Decision Optimization in Mobile Edge Computing , 2019, 2019 IEEE Wireless Communications and Networking Conference (WCNC).

[30]  George K. Karagiannidis,et al.  Secure Cache-Aided Multi-Relay Networks in the Presence of Multiple Eavesdroppers , 2019, IEEE Transactions on Communications.

[31]  Victor C. M. Leung,et al.  A Deep Reinforcement Learning-Based Transcoder Selection Framework for Blockchain-Enabled Wireless D2D Transcoding , 2020, IEEE Transactions on Communications.

[32]  Yonggang Wen,et al.  Content-Aware Personalised Rate Adaptation for Adaptive Streaming via Deep Video Analysis , 2018, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[33]  Osvaldo Simeone,et al.  Online Edge Caching and Wireless Delivery in Fog-Aided Networks With Dynamic Content Popularity , 2017, IEEE Journal on Selected Areas in Communications.

[34]  Ke Zhang,et al.  Energy-Efficient Offloading for Mobile Edge Computing in 5G Heterogeneous Networks , 2016, IEEE Access.

[35]  Jianfei Cai,et al.  vCache: Supporting Cost-Efficient Adaptive Bitrate Streaming , 2017, IEEE MultiMedia.

[36]  Yan Zhang,et al.  Artificial Intelligence Empowered Edge Computing and Caching for Internet of Vehicles , 2019, IEEE Wireless Communications.

[37]  Senem Velipasalar,et al.  Deep Reinforcement Learning-Based Edge Caching in Wireless Networks , 2020, IEEE Transactions on Cognitive Communications and Networking.

[38]  Xing Zhang,et al.  A Survey on Mobile Edge Networks: Convergence of Computing, Caching and Communications , 2017, IEEE Access.

[39]  Basem Shihada,et al.  Energy-Efficient Power Allocation in Multitier 5G Networks Using Enhanced Online Learning , 2017, IEEE Transactions on Vehicular Technology.