Caching in Base Station with Recommendation via Q-Learning

Proactive caching in base station (BS) is a promising way to leverage the human-behavior related information to boost the system throughput and improve user experience with low cost. Yet existing caching policies are "blind" to users, i.e., the users are unaware about whether the files to be requested are locally cached at the BS around them, whose performance will degrade in mobile systems due to the randomness of user behavior. For example, when a user stays in a cell where the BS caches the file to be requested later, the user may not initiate the request. When the user sends the request, it may have entered another cell where the BS does not cache the requested file. In this paper, we introduce a simple idea of informing the users about what the BS has cached to make local caching efficient, which can be regarded as a form of recommendation. Since the probabilities that the users request the files are unknown, we resort to Q-learning to perceive the request probability and the statistics of random arrival and departure of mobile users. Then, a cache replacement policy is optimized. Simulation results demonstrate that the proposed policy with recommendation can provide higher long-term system reward than existing policies without recommendation.

[1]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[2]  Chenyang Yang,et al.  Cooperative Device-to-Device Communications with Caching , 2016, 2016 IEEE 83rd Vehicular Technology Conference (VTC Spring).

[3]  Ido Guy,et al.  Increasing engagement through early recommender intervention , 2009, RecSys '09.

[4]  Guy Shani,et al.  An MDP-Based Recommender System , 2002, J. Mach. Learn. Res..

[5]  Mehdi Bennis,et al.  Living on the edge: The role of proactive caching in 5G wireless networks , 2014, IEEE Communications Magazine.

[6]  Abdallah Khreishah,et al.  A Provably Efficient Online Collaborative Caching Algorithm for Multicell-Coordinated Systems , 2015, IEEE Transactions on Mobile Computing.

[7]  Sean M. McNee,et al.  Being accurate is not enough: how accuracy metrics have hurt recommender systems , 2006, CHI Extended Abstracts.

[8]  Bin Xia,et al.  Analysis on Cache-Enabled Wireless Heterogeneous Networks , 2015, IEEE Transactions on Wireless Communications.

[9]  Kai Lai Chung,et al.  A Course in Probability Theory , 1949 .

[10]  Mehrbakhsh Nilashi,et al.  Collaborative filtering recommender systems , 2013 .

[11]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12]  Mihaela van der Schaar,et al.  Smart caching in wireless small cell networks via contextual multi-armed bandits , 2016, 2016 IEEE International Conference on Communications (ICC).

[13]  John Riedl,et al.  Explaining collaborative filtering recommendations , 2000, CSCW '00.

[14]  Dong Liu,et al.  Caching at the wireless edge: design aspects, challenges, and future directions , 2016, IEEE Communications Magazine.

[15]  Bhaskar D. Rao,et al.  Caching strategies in heterogeneous networks with D2D, small BS and macro BS communications , 2016, 2016 IEEE International Conference on Communications (ICC).

[16]  S. Sénécal,et al.  The influence of online product recommendations on consumers' online choices , 2004 .

[17]  Dong Liu,et al.  Energy Efficiency of Downlink Networks With Caching at Base Stations , 2015, IEEE Journal on Selected Areas in Communications.

[18]  Deniz Gündüz,et al.  Learning-based optimization of cache content in a small cell base station , 2014, 2014 IEEE International Conference on Communications (ICC).