Online Reinforcement Learning of X-Haul Content Delivery Mode in Fog Radio Access Networks

We consider a Fog Radio Access Network (F-RAN) with a Base Band Unit (BBU) in the cloud and multiple cache-enabled enhanced Remote Radio Heads (eRRHs). The system aims at delivering contents on demand with minimal average latency from a time-varying library of popular contents. Uncached requested files can be transferred from the cloud to the eRRHs by following either backhaul or fronthaul modes. The backhaul mode transfers fractions of the requested files, while the fronthaul mode transmits quantized baseband samples as in Cloud-RAN (C-RAN). The backhaul mode allows the caches of the eRRHs to be updated, which may lower future delivery latencies. In contrast, the fronthaul mode enables cooperative C-RAN transmissions that may reduce the current delivery latency. Taking into account the trade-off between current and future delivery performance, this letter proposes an adaptive selection method between the two delivery modes to minimize the long-term delivery latency. Assuming an unknown and time-varying popularity model, the method is based on model-free Reinforcement Learning (RL). Numerical results confirm the effectiveness of the proposed RL.

[1]  David M. W. Powers,et al.  Applications and Explanations of Zipf’s Law , 1998, CoNLL.

[2]  Massimo Gallo,et al.  Modeling data transfer in content-centric networking , 2011, 2011 23rd International Teletraffic Congress (ITC).

[3]  Volker Jungnickel,et al.  Fronthaul evolution: From CPRI to Ethernet , 2015 .

[4]  Paolo Giaccone,et al.  Temporal locality in today's content caching: why it matters and how to model it , 2013, CCRV.

[5]  Inkyu Lee,et al.  Minimum Rate Maximization for Wireless Powered Cloud Radio Access Networks , 2019, IEEE Transactions on Vehicular Technology.

[6]  Inkyu Lee,et al.  Joint Design of Fronthauling and Hybrid Beamforming for Downlink C-RAN Systems , 2019, IEEE Transactions on Communications.

[7]  L. B. Milstein,et al.  On the accuracy of a first-order Markov model for data transmission on fading channels , 1995, Proceedings of ICUPC '95 - 4th IEEE International Conference on Universal Personal Communications.

[8]  Gang Wang,et al.  Efficient traffic engineering for 5g core and backhaul networks , 2017, Journal of Communications and Networks.

[9]  Hongsheng Xi,et al.  Finding optimal memoryless policies of POMDPs under the expected average reward criterion , 2011, Eur. J. Oper. Res..

[10]  Urs Niesen,et al.  Online coded caching , 2014, ICC.

[11]  Michael L. Littman,et al.  Memoryless policies: theoretical limitations and practical results , 1994 .

[12]  Tharmalingam Ratnarajah,et al.  Content Placement Learning for Success Probability Maximization in Wireless Edge Caching Networks , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Deniz Gündüz,et al.  A Reinforcement-Learning Approach to Proactive Caching in Wireless Networks , 2017, IEEE Journal on Selected Areas in Communications.

[14]  Osvaldo Simeone,et al.  Fundamental Limits of Cloud and Cache-Aided Interference Management With Multi-Antenna Edge Nodes , 2019, IEEE Transactions on Information Theory.

[15]  Dennis Shasha,et al.  2Q: A Low Overhead High Performance Buffer Management Replacement Algorithm , 1994, VLDB.

[16]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[17]  Osvaldo Simeone,et al.  Fog-Aided Wireless Networks for Content Delivery: Fundamental Latency Tradeoffs , 2016, IEEE Transactions on Information Theory.

[18]  Alireza Sadeghi,et al.  Optimal and Scalable Caching for 5G Using Reinforcement Learning of Space-Time Popularities , 2017, IEEE Journal of Selected Topics in Signal Processing.

[19]  Inkyu Lee,et al.  Joint Designs of Fronthaul Compression and Precoding for Full-Duplex Cloud Radio Access Networks , 2016, IEEE Wireless Communications Letters.

[20]  Shlomo Shamai,et al.  Joint cloud and edge processing for latency minimization in Fog Radio Access Networks , 2016, 2016 IEEE 17th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[21]  Gang Wang,et al.  Adaptive Caching via Deep Reinforcement Learning , 2019, ArXiv.

[22]  Osvaldo Simeone,et al.  Online Edge Caching and Wireless Delivery in Fog-Aided Networks With Dynamic Content Popularity , 2017, IEEE Journal on Selected Areas in Communications.

[23]  Thomas Pfeiffer,et al.  Next generation mobile fronthaul and midhaul architectures [Invited] , 2015, IEEE/OSA Journal of Optical Communications and Networking.

[24]  Shlomo Shamai,et al.  Joint optimization of cloud and edge processing for fog radio access networks , 2016, 2016 IEEE International Symposium on Information Theory (ISIT).

[25]  Hung-Yu Wei,et al.  5G Radio Access Network Design with the Fog Paradigm: Confluence of Communications and Computing , 2017, IEEE Communications Magazine.

[26]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[27]  Wei-Ho Chung,et al.  Enabling Low-Latency Applications in Fog-Radio Access Networks , 2017, IEEE Network.

[28]  Xiaohu You,et al.  Low-Latency C-RAN: An Next-Generation Wireless Approach , 2018, IEEE Vehicular Technology Magazine.

[29]  Arturo Azcorra,et al.  Xhaul: toward an integrated fronthaul/backhaul architecture in 5G networks , 2015, IEEE Wireless Communications.