NA-Caching: An Adaptive Content Management Approach Based on Deep Reinforcement Learning

Video streaming is a dominant application over today’s Internet. The current mainstream video streaming solution is to utilize the services of a Content Delivery Network (CDN) provider. By replicating video content closer to the network edge, caching provides an effective mechanism for alleviating the demand for massive bandwidth for the Internet backbone. It reduces the network traffic and capital expense for streaming the video content, and in the meantime, enhance Internet’s Quality of Service (QoS). In this paper, we propose a neural adaptive caching approach, named NA-Caching, for helping cache learn to make caching decisions from its own experiences rather than a specific mathematical model, in a way similar to how a human being learns a new skill (e.g. cycling, swimming). NA-Caching leverages the benefits of the Recurrent Neural Network (RNN) as well as the Deep Reinforcement Learning (DRL) to maximize the cache efficiency by jointly learning request features, caching space dynamics and making decisions. Specifically, we utilize Gated Recurrent Unit (GRU) to characterize the evolving features of the dynamic requests and caching space. Moreover, the above GRU-based representation network is integrated into a Deep Q-Network (DQN) framework for making adaptive caching decisions online. To evaluate the performance of the proposed approach, we conduct extensive experiments on anonymized real-world traces from a video provider. The results demonstrate that our algorithm significantly outperform several candidate methods.

[1]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[2]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[3]  Dario Rossi,et al.  Caching performance of content centric networks under multi-path routing (and more) , 2011 .

[4]  Paolo Giaccone,et al.  Temporal locality in today's content caching: why it matters and how to model it , 2013, CCRV.

[5]  Michele Garetto,et al.  A unified approach to the performance analysis of caching systems , 2014, INFOCOM.

[6]  Paolo Giaccone,et al.  Analyzing the Performance of LRU Caches under Non-Stationary Traffic Patterns , 2013, ArXiv.

[7]  Jiaru Lin,et al.  Iterative Gaussian-Approximated Message Passing Receiver for MIMO-SCMA System , 2019, IEEE Journal of Selected Topics in Signal Processing.

[8]  Hao Che,et al.  Hierarchical Web caching systems: modeling, design and experimental results , 2002, IEEE J. Sel. Areas Commun..

[9]  Seungjoon Lee,et al.  Optimal Content Placement for a Large-Scale VoD System , 2016, TNET.

[10]  Zhidu Li,et al.  Socially Aware Caching in D2D Enabled Fog Radio Access Networks , 2019, IEEE Access.

[11]  Yuzhe Liu,et al.  Overlapping Community Deep Exploring-Based Relay Selection Method Toward Multi-Hop D2D Communication , 2019, IEEE Wireless Communications Letters.

[12]  Zufan Zhang,et al.  Social tie-driven content priority scheme for D2D communications , 2019, Inf. Sci..

[13]  Nimrod Megiddo,et al.  ARC: A Self-Tuning, Low Overhead Replacement Cache , 2003, FAST.

[14]  Peter Scheuermann,et al.  A Case for Delay-Conscious Caching of Web Documents , 1997, Comput. Networks.

[15]  Przemysław Rokita,et al.  Predicting Popularity of Online Videos Using Support Vector Regression , 2017, IEEE Transactions on Multimedia.

[16]  Peter J. Denning,et al.  Operating Systems Theory , 1973 .

[17]  László Böszörményi,et al.  A survey of Web cache replacement strategies , 2003, CSUR.

[18]  Xuyuan Kang,et al.  High-Accuracy Entity State Prediction Method Based on Deep Belief Network Toward IoT Search , 2019, IEEE Wireless Communications Letters.

[19]  Robbert van Renesse,et al.  An analysis of Facebook photo caching , 2013, SOSP.

[20]  Chen Zhang,et al.  On the “familiar stranger” phenomenon in a large-scale VoD system , 2017, 2017 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[21]  Kianoosh Mokhtarian,et al.  Flexible Caching Algorithms for Video Content Distribution Networks , 2017, IEEE/ACM Transactions on Networking.

[22]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[23]  Philippe Robert,et al.  A versatile and accurate approximation for LRU cache performance , 2012, 2012 24th International Teletraffic Congress (ITC 24).

[24]  Yongdong Zhang,et al.  Unfolding Temporal Dynamics: Predicting Social Media Popularity Using Multi-scale Temporal Decomposition , 2016, AAAI.

[25]  Deniz Gündüz,et al.  Learning-based optimization of cache content in a small cell base station , 2014, 2014 IEEE International Conference on Communications (ICC).

[26]  Yuehong Gao,et al.  On Buffer-Constrained Throughput of a Wireless-Powered Communication System , 2019, IEEE Journal on Selected Areas in Communications.

[27]  Sandy Irani,et al.  Cost-Aware WWW Proxy Caching Algorithms , 1997, USENIX Symposium on Internet Technologies and Systems.

[28]  Shaoen Wu,et al.  Biologically Inspired Resource Allocation for Network Slices in 5G-Enabled Internet of Things , 2019, IEEE Internet of Things Journal.

[29]  Benny Van Houdt,et al.  Asymptotically Exact TTL-Approximations of the Cache Replacement Algorithms LRU(m) and h-LRU , 2016, 2016 28th International Teletraffic Congress (ITC 28).

[30]  Azer Bestavros,et al.  Popularity-aware greedy dual-size Web proxy caching algorithms , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[31]  Honggang Wang,et al.  Socially Aware Energy-Efficient Mobile Edge Collaboration for Video Distribution , 2017, IEEE Transactions on Multimedia.

[32]  Qing Yang,et al.  Cache Less for More: Exploiting Cooperative Video Caching and Delivery in D2D Communications , 2019, IEEE Transactions on Multimedia.

[33]  Gerhard Weikum,et al.  The LRU-K page replacement algorithm for database disk buffering , 1993, SIGMOD Conference.

[34]  Laszlo A. Belady,et al.  A Study of Replacement Algorithms for Virtual-Storage Computer , 1966, IBM Syst. J..

[35]  Mihaela van der Schaar,et al.  Popularity-driven content caching , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[36]  David Tse,et al.  Probabilistic methods for web caching , 2001, Perform. Evaluation.

[37]  Hang Shi,et al.  A Feature-Based Learning System for Internet of Things Applications , 2019, IEEE Internet of Things Journal.

[38]  Chunfeng Yang,et al.  Video Popularity Dynamics and Its Implication for Replication , 2015, IEEE Transactions on Multimedia.