Towards Wi-Fi AP-Assisted Content Prefetching for On-Demand TV Series: A Reinforcement Learning Approach

The emergence of smart Wi-Fi APs (Access Point), which are equipped with huge storage space, opens a new research area on how to utilize these resources at the edge network to improve users' quality of experience (QoE) (e.g., a short startup delay and smooth playback). One important research interest in this area is content prefetching, which predicts and accurately fetches contents ahead of users' requests to shift the traffic away during peak periods. However, in practice, the different video watching patterns among users, and the varying network connection status lead to the time-varying server load, which eventually makes the content prefetching problem challenging. To understand this challenge, this paper first performs a large-scale measurement study on users' AP connection and TV series watching patterns using real-traces. Then, based on the obtained insights, we formulate the content prefetching problem as a Markov Decision Process (MDP). The objective is to strike a balance between the increased prefetching&storage cost incurred by incorrect prediction and the reduced content download delay because of successful prediction. A learning-based approach is proposed to solve this problem and another three algorithms are adopted as baselines. In particular, first, we investigate the performance lower bound by using a random algorithm, and the upper bound by using an ideal offline approach. Then, we present a heuristic algorithm as another baseline. Finally, we design a reinforcement learning algorithm that is more practical to work in the online manner. Through extensive trace-based experiments, we demonstrate the performance gain of our design. Remarkably, our learning-based algorithm achieves a better precision and hit ratio (e.g., 80%) with about 70% (resp. 50%) cost saving compared to the random (resp. heuristic) algorithm.

[1]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[2]  Florin Ciucu,et al.  Exact analysis of TTL cache networks: the case of caching policies driven by stopping times , 2014, SIGMETRICS '14.

[3]  Jiangchuan Liu,et al.  NetTube: Exploring Social Networks for Peer-to-Peer Short Video Sharing , 2009, IEEE INFOCOM 2009.

[4]  Ana Pont,et al.  Web prefetching performance metrics: A survey , 2006, Perform. Evaluation.

[5]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[6]  Yonggang Wen,et al.  On the Cost–QoE Tradeoff for Cloud-Based Video Streaming Under Amazon EC2's Pricing Models , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Long Ji Lin,et al.  Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.

[8]  Niklas Carlsson,et al.  Quality-adaptive Prefetching for Interactive Branched Video using HTTP-based Adaptive Streaming , 2014, ACM Multimedia.

[9]  Stratis Ioannidis,et al.  Orchestrating massively distributed CDNs , 2012, CoNEXT '12.

[10]  Dilip Kumar Krishnappa,et al.  On the Feasibility of Prefetching and Caching for Online TV Services: A Measurement Study on Hulu , 2011, PAM.

[11]  Donald F. Towsley,et al.  Analysis of TTL-based cache networks , 2012, 6th International ICST Conference on Performance Evaluation Methodologies and Tools.

[12]  Lifeng Sun,et al.  Understanding content placement strategies in smartrouter-based peer video CDN , 2016, NOSSDAV.

[13]  R Bellman,et al.  On the Theory of Dynamic Programming. , 1952, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Richard T. B. Ma,et al.  Thunder crystal: a novel crowdsourcing-based content distribution platform , 2015, NOSSDAV '15.

[15]  Niklas Carlsson,et al.  Bandwidth-aware Prefetching for Proactive Multi-video Preloading and Improved HAS Performance , 2015, ACM Multimedia.

[16]  Bharadwaj Veeravalli,et al.  Utilization-based pricing for power management and profit optimization in data centers , 2012, J. Parallel Distributed Comput..

[17]  David K. Y. Yau,et al.  Integrated prefetching and caching for adaptive video streaming over HTTP: an online approach , 2015, MMSys.

[18]  Michael Zink,et al.  Watching user generated videos with prefetching , 2011, MMSys.

[19]  Cisco Visual Networking Index: Forecast and Methodology 2016-2021.(2017) http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual- networking-index-vni/complete-white-paper-c11-481360.html. High Efficiency Video Coding (HEVC) Algorithms and Architectures https://jvet.hhi.fraunhofer. , 2017 .

[20]  Lifeng Sun,et al.  Edge Video CDN: A Wi-Fi Content Hotspot Solution , 2016, Journal of Computer Science and Technology.

[21]  Xuelong Li,et al.  Joint Content Replication and Request Routing for Social Video Distribution Over Cloud CDN: A Community Clustering Method , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Yong Liu,et al.  Measurement and Modeling of Video Watching Time in a Large-Scale Internet Video-on-Demand System , 2013, IEEE Transactions on Multimedia.

[23]  Lifeng Sun,et al.  Prefetching strategy in peer-assisted social video streaming , 2011, MM '11.

[24]  Moshe Zukerman,et al.  Improving Scalability of VoD Systems by Optimal Exploitation of Storage and Multicast , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Ramesh K. Sitaraman,et al.  Video Stream Quality Impacts Viewer Behavior: Inferring Causality Using Quasi-Experimental Designs , 2012, IEEE/ACM Transactions on Networking.

[26]  Chun-Ying Huang,et al.  Quantifying Skype user satisfaction , 2006, SIGCOMM.

[27]  Christian Koch,et al.  Optimizing Mobile Prefetching by Leveraging Usage Patterns and Social Information , 2014, 2014 IEEE 22nd International Conference on Network Protocols.

[28]  Wei Tsang Ooi,et al.  A Video Timeline with Bookmarks and Prefetch State for Faster Video Browsing , 2015, ACM Multimedia.

[29]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[30]  Tianyin Xu,et al.  Offline Downloading in China: A Comparative Study , 2015, Internet Measurement Conference.

[31]  Matthijs Douze,et al.  Optimizing hypervideo navigation using a Markov decision process approach , 2002, MULTIMEDIA '02.

[32]  Terence C. Mills,et al.  Time series techniques for economists , 1990 .