Dynamic Coded Caching in Wireless Networks Using Multi-Agent Reinforcement Learning

We consider distributed caching of content across several small base stations (SBSs) in a wireless network, where the content is encoded using a maximum distance separable code. Specifically, we apply soft time-to-live (STTL) cache management policies, where coded packets may be evicted from the caches at periodic times. We propose a reinforcement learning (RL) approach to find coded STTL policies minimizing the overall network load. We demonstrate that such caching policies achieve almost the same network load as policies obtained through optimization, where the latter assumes perfect knowledge of the distribution of times between file requests as well the distribution of the number of SBSs within communication range of a user placing a request. We also suggest a multi-agent RL (MARL) framework for the scenario of non-uniformly distributed requests in space. For such a scenario, we show that MARL caching policies achieve lower network load as compared to optimized caching policies assuming a uniform request placement. We also provide convincing evidence that synchronous updates offer a lower network load than asynchronous updates for spatially homogeneous renewal request processes due to the memory of the renewal processes.

[1]  Gang Feng,et al.  Multi-Agent Reinforcement Learning for Efficient Content Caching in Mobile D2D Networks , 2019, IEEE Transactions on Wireless Communications.

[2]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[3]  Robert W. Heath,et al.  Five disruptive technology directions for 5G , 2013, IEEE Communications Magazine.

[4]  Deniz Gündüz,et al.  Learning-based optimization of cache content in a small cell base station , 2014, 2014 IEEE International Conference on Communications (ICC).

[5]  Jiangchuan Liu,et al.  Statistics and Social Network of YouTube Videos , 2008, 2008 16th Interntional Workshop on Quality of Service.

[6]  Urs Niesen,et al.  Fundamental limits of caching , 2012, 2013 IEEE International Symposium on Information Theory.

[7]  Deniz Gündüz,et al.  A Reinforcement-Learning Approach to Proactive Caching in Wireless Networks , 2017, IEEE Journal on Selected Areas in Communications.

[8]  Giuseppe Caire,et al.  Fundamental Limits of Caching in Wireless D2D Networks , 2014, IEEE Transactions on Information Theory.

[9]  Ítalo S. Cunha,et al.  Analyzing client interactivity in streaming media , 2004, WWW '04.

[10]  Geoffrey Ye Li,et al.  Reinforcement Learning Based Cooperative Coded Caching Under Dynamic Popularities in Ultra-Dense Networks , 2020, IEEE Transactions on Vehicular Technology.

[11]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[13]  Min Sheng,et al.  Learning-Based Content Caching and Sharing for Wireless Networks , 2017, IEEE Transactions on Communications.

[14]  Alexandre Graell i Amat,et al.  Distributed Storage in Mobile Wireless Networks With Device-to-Device Communication , 2016, IEEE Transactions on Communications.

[15]  Alexandros G. Dimakis,et al.  FemtoCaching: Wireless Content Delivery Through Distributed Caching Helpers , 2013, IEEE Transactions on Information Theory.

[16]  Alireza Sadeghi,et al.  Optimal and Scalable Caching for 5G Using Reinforcement Learning of Space-Time Popularities , 2017, IEEE Journal of Selected Topics in Signal Processing.

[17]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[18]  R. Michael Buehrer,et al.  Learning distributed caching strategies in small cell networks , 2014, 2014 11th International Symposium on Wireless Communications Systems (ISWCS).

[19]  O. Antoine,et al.  Theory of Error-correcting Codes , 2022 .

[20]  Osvaldo Simeone,et al.  Soft-TTL: Time-Varying Fractional Caching , 2018, IEEE Networking Letters.

[21]  H. Vincent Poor,et al.  Multi-Agent Reinforcement Learning for Cooperative Coded Caching via Homotopy Optimization , 2020, IEEE Transactions on Wireless Communications.

[22]  Alexandre Graell i Amat,et al.  Dynamic Coded Caching in Wireless Networks , 2020, ArXiv.

[23]  Iryna Andriyanova,et al.  Optimizing MDS Coded Caching in Wireless Networks With Device-to-Device Communication , 2017, IEEE Transactions on Wireless Communications.

[24]  Valerio Bioglio,et al.  Optimizing MDS Codes for Caching at the Edge , 2014, 2015 IEEE Global Communications Conference (GLOBECOM).

[25]  Meixia Tao,et al.  Deep Learning for Wireless Coded Caching With Unknown and Time-Variant Content Popularity , 2021, IEEE Transactions on Wireless Communications.