论文信息 - Intelligent Edge-Assisted Crowdcast with Deep Reinforcement Learning for Personalized QoE

Intelligent Edge-Assisted Crowdcast with Deep Reinforcement Learning for Personalized QoE

Recent years have seen booming development and great success in interactive crowdsourced livecast (i.e., crowdcast). Different from traditional livecast services, crowdcast is featured with tremendous video contents at the broadcaster side, highly diverse viewer side content watching environments/preferences as well as viewers’ personalized quality of experience (QoE) demands (e.g., individual preferences for streaming delays, channel switching latencies and bitrates). This imposes unprecedented key challenges on how to flexibly and cost-effectively accommodate the heterogeneous and personalized QoE demands for the mass of viewers.In this paper, we propose DeepCast, an edge-assisted crowdcast framework, which makes intelligent decisions at edges based on the massive amount of real-time information from the network and viewers to accommodate personalized QoE with minimized system cost. Given the excessive computation complexity in this context, we propose a data-driven deep reinforcement learning (DRL) based solution that can automatically learn the best suitable strategies for viewer scheduling and transcoding selection. To our best knowledge, DeepCast is the first edge-assisted framework that applies the advance of DRL to explicitly accommodate personalized QoE optimization for crowdcast services. We collect multiple real-world datasets and evaluate the performance of DeepCast using trace-driven experiments. The results demonstrate the superiority of our DeepCast framework and its DRL-based solution.

[1] Cong Zhang,et al. On crowdsourced interactive live streaming: a Twitch.tv-based measurement study , 2015, NOSSDAV.

[2] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[3] Ondrej Krajsa,et al. RTT measurement and its dependence on the real geographical distance , 2011, 2011 34th International Conference on Telecommunications and Signal Processing (TSP).

[4] Chi Harold Liu,et al. Experience-driven Networking: A Deep Reinforcement Learning based Approach , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[5] Hermann Hellwagner,et al. QoE-Assured 4K HTTP Live Streaming via Transient Segment Holding at Mobile Edge , 2018, IEEE Journal on Selected Areas in Communications.

[6] Lifeng Sun,et al. Joint online transcoding and geo-distributed delivery for dynamic adaptive streaming , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[7] Tom Schaul,et al. Reinforcement Learning with Unsupervised Auxiliary Tasks , 2016, ICLR.

[8] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[9] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[10] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[11] Ramesh K. Sitaraman,et al. BOLA: Near-Optimal Bitrate Adaptation for Online Videos , 2016, IEEE/ACM Transactions on Networking.

[12] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[13] Hongzi Mao,et al. Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[14] Srikanth Kandula,et al. Resource Management with Deep Reinforcement Learning , 2016, HotNets.

[15] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[16] Haitian Pang,et al. First Mile in Crowdsourced Live Streaming: A Content Harvest Network Approach , 2017, ACM Multimedia.

[17] Yuandong Tian,et al. Training Agent for First-Person Shooter Game with Actor-Critic Curriculum Learning , 2016, ICLR.

[18] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[19] Rittwik Jana,et al. LiveJack: Integrating CDNs and Edge Clouds for Live Content Broadcasting , 2017, ACM Multimedia.

[20] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.

[21] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.