Multi-Armed Bandit Learning for Cache Content Placement in Vehicular Social Networks

In this letter, the efficient dissemination of content in a socially-aware cache-enabled hybrid network using multi-armed bandit learning theory is analyzed. Specifically, an overlay cellular network over a vehicular social network is considered, where commuters request for multimedia content from either the stationary road-side units (RSUs), the base station, or the single mobile cache unit (MCU), if accessible. Firstly, we propose an algorithm to optimally distribute popular contents among the locally deployed RSU caches. To further maximize the cache hits experienced by vehicles, we then present an algorithm to find the best traversal path for the MCU based on commuters’ social degree distribution. For performance evaluation, the asymptotic regret upper bounds of the two algorithms are also derived. Simulations reveal that the proposed algorithms outperform existing content placement methods in terms of overall network throughput.

[1]  Michael Jong Kim,et al.  Thompson Sampling for Stochastic Control: The Finite Parameter Case , 2017, IEEE Transactions on Automatic Control.

[2]  Feng Xia,et al.  Vehicular Social Networks: Enabling Smart Mobility , 2017, IEEE Communications Magazine.

[3]  Feng Xia,et al.  Mobility Dataset Generation for Vehicular Social Networks Based on Floating Car Data , 2018, IEEE Transactions on Vehicular Technology.

[4]  Ahmad Khonsari,et al.  Cooperative caching for content dissemination in vehicular networks , 2018, International Journal of Communication Systems.

[5]  Shipra Agrawal,et al.  Near-Optimal Regret Bounds for Thompson Sampling , 2017, J. ACM.

[6]  Liwei Wang,et al.  Distributed Probabilistic Caching strategy in VANETs through Named Data Networking , 2016, 2016 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[7]  Alexandros G. Dimakis,et al.  Femtocaching and device-to-device collaboration: A new architecture for wireless video distribution , 2012, IEEE Communications Magazine.

[8]  Jianhua Li,et al.  Dynamic Privacy Pricing: A Multi-Armed Bandit Approach With Time-Variant Rewards , 2017, IEEE Transactions on Information Forensics and Security.

[9]  Zhou Su,et al.  An Edge Caching Scheme to Distribute Content in Vehicular Networks , 2018, IEEE Transactions on Vehicular Technology.

[10]  Nicolò Cesa-Bianchi,et al.  Combinatorial Bandits , 2012, COLT.

[11]  Song Guo,et al.  D2D-based content delivery with parked vehicles in vehicular social networks , 2016, IEEE Wireless Communications.

[12]  Walid Krichene,et al.  On Social Optimal Routing Under Selfish Learning , 2018, IEEE Transactions on Control of Network Systems.

[13]  Anna Maria Vegni,et al.  A Survey on Vehicular Social Networks , 2015, IEEE Communications Surveys & Tutorials.