Intelligent Video Caching at Network Edge: A Multi-Agent Deep Reinforcement Learning Approach

Today’s explosively growing Internet video traffics and viewers’ ever-increasing quality of experience (QoE) demands for video streaming bring tremendous pressures to the backbone network. As a new network paradigm, mobile edge caching provides a promising alternative by pushing video content closer at the network edge rather than the remote CDN servers so as to reduce both content access latency and redundant network traffic. However, our large-scale trace analysis shows that different from CDN based caching, edge caching environment is much more complicated with massively dynamic and diverse request patterns, which renders that existing rule-based and model-based caching solutions may not well fit such complicated edge environments. Moreover, although cooperative caching has been proposed to better afford limited storage on each individual edge server, our trace analysis also shows that the request similarity among neighboring edges can be highly dynamic and diverse, which is drastically different from CDN based caching environment, and can easily compromise the benefits from traditional cooperative caching mostly designed based on CDN environment. In this paper, we propose MacoCache, an intelligent edge caching framework that is carefully designed to afford the massively diversified and distributed caching environment to minimize both content access latency and traffic cost. Specifically, MacoCache leverages a multi-agent deep reinforcement learning (MADRL) based solution, where each edge is able to adaptively learn its own best policy in conjunction with other edges for intelligent caching. The real trace-driven evaluation further demonstrates that MacoCache is able to reduce an average of 21% latency and 26% cost compared with the state-of-the-art caching solution.

[1]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[2]  Feng Wang,et al.  When RFID Meets Deep Learning: Exploring Cognitive Intelligence for Activity Identification , 2019, IEEE Wireless Communications.

[3]  Tao Jiang,et al.  Deep Reinforcement Learning for Mobile Edge Caching: Review, New Features, and Open Issues , 2018, IEEE Network.

[4]  Gang Feng,et al.  Multi-Agent Reinforcement Learning for Efficient Content Caching in Mobile D2D Networks , 2019, IEEE Transactions on Wireless Communications.

[5]  Xiaofei Wang,et al.  Cache in the air: exploiting content caching and delivery techniques for 5G systems , 2014, IEEE Communications Magazine.

[6]  Ke Xu,et al.  On popularity prediction of videos shared in online social networks , 2013, CIKM.

[7]  László Böszörményi,et al.  A survey of Web cache replacement strategies , 2003, CSUR.

[8]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[9]  Suhas N. Diggavi,et al.  Content caching and delivery over heterogeneous wireless networks , 2014, 2015 IEEE Conference on Computer Communications (INFOCOM).

[10]  Baochun Li,et al.  Joint request mapping and response routing for geo-distributed cloud services , 2013, 2013 Proceedings IEEE INFOCOM.

[11]  Gerhard Fettweis,et al.  Fronthaul and backhaul requirements of flexibly centralized radio access networks , 2015, IEEE Wireless Communications.

[12]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[13]  Haitian Pang,et al.  Towards Low Latency Multi-viewpoint 360° Interactive Video: A Multimodal Deep Reinforcement Learning Approach , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[14]  Abdallah Khreishah,et al.  Collaborative caching for multicell-coordinated systems , 2015, 2015 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[15]  Baochun Li,et al.  Demand forecast and performance prediction in peer-assisted on-demand streaming systems , 2011, 2011 Proceedings IEEE INFOCOM.

[16]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[17]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[18]  Shimon Whiteson,et al.  Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning , 2017, ICML.

[19]  Jiangchuan Liu,et al.  Demystifying the Crowd Intelligence in Last Mile Parcel Delivery for Smart Cities , 2019, IEEE Network.

[20]  Lifeng Sun,et al.  Joint Request Balancing and Content Aggregation in Crowdsourced CDN , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[21]  Geoffrey Ye Li,et al.  Spectrum Sharing in Vehicular Networks Based on Multi-Agent Reinforcement Learning , 2019, IEEE Journal on Selected Areas in Communications.

[22]  Haitian Pang,et al.  Intelligent Edge-Assisted Crowdcast with Deep Reinforcement Learning for Personalized QoE , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[23]  Mustafa Cenk Gursoy,et al.  A deep reinforcement learning-based framework for content caching , 2017, 2018 52nd Annual Conference on Information Sciences and Systems (CISS).

[24]  Dario Pompili,et al.  Octopus: A Cooperative Hierarchical Caching Strategy for Cloud Radio Access Networks , 2016, 2016 IEEE 13th International Conference on Mobile Ad Hoc and Sensor Systems (MASS).

[25]  Mihaela van der Schaar,et al.  Forecasting Popularity of Videos Using Social Media , 2014, IEEE Journal of Selected Topics in Signal Processing.

[26]  Robbert van Renesse,et al.  An analysis of Facebook photo caching , 2013, SOSP.

[27]  Baher Abdulhai,et al.  Multiagent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto , 2013, IEEE Transactions on Intelligent Transportation Systems.

[28]  Abdallah Khreishah,et al.  A Provably Efficient Online Collaborative Caching Algorithm for Multicell-Coordinated Systems , 2015, IEEE Transactions on Mobile Computing.

[29]  Tao Jiang,et al.  Caching Transient Data for Internet of Things: A Deep Reinforcement Learning Approach , 2019, IEEE Internet of Things Journal.

[30]  Alexandros G. Dimakis,et al.  FemtoCaching: Wireless Content Delivery Through Distributed Caching Helpers , 2013, IEEE Transactions on Information Theory.

[31]  Ondrej Krajsa,et al.  RTT measurement and its dependence on the real geographical distance , 2011, 2011 34th International Conference on Telecommunications and Signal Processing (TSP).

[32]  Dario Pompili,et al.  Adaptive Bitrate Video Caching and Processing in Mobile-Edge Computing Networks , 2019, IEEE Transactions on Mobile Computing.

[33]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[34]  Weinan Zhang,et al.  MAgent: A Many-Agent Reinforcement Learning Platform for Artificial Collective Intelligence , 2017, AAAI.