Towards QoS-Aware Cloud Live Transcoding: A Deep Reinforcement Learning Approach

Video transcoding is widely adopted in live streaming services to bridge the format and resolution gap between content producers and consumers (i.e., broadcasters and viewers). Meanwhile, the cloud has been recognized as one of the most reliable and cost-effective ways for video transcoding. However, due to the dynamic and uncertainty of the transcoding workloads in live streaming, it is very challenging for cloud service providers to provision computing resources and schedule transcoding tasks while guaranteeing the Service Level Agreement (SLA). To this end, we propose a joint resource provisioning and task scheduling approach for transcoding live streams in the cloud. We adopt Deep Reinforcement Learning (DRL) to train a neural network model for resource provisioning under dynamic workloads. Moreover, we design a QoS-aware task scheduling algorithm that maps transcoding tasks to Virtual Machines (VMs) by considering the real-time QoS requirement. We evaluate our approach with trace-driven experiments and the results demonstrate that our approach outperforms heuristic baselines by up to 89% improvements on average QoS with 4% extra resource overhead at most.

[1]  Magdy A. Bayoumi,et al.  VLSC: Video Live Streaming Using Cloud Services , 2016, 2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking (SocialCom), Sustainable Computing and Communications (SustainCom) (BDCloud-SocialCom-SustainCom).

[2]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[3]  Yonggang Wen,et al.  Dynamic Resource Provisioning with QoS Guarantee for Video Transcoding in Online Video Sharing Service , 2016, ACM Multimedia.

[4]  Rajkumar Buyya,et al.  Cost-Efficient and Robust On-Demand Video Transcoding Using Heterogeneous Cloud Services , 2018, IEEE Transactions on Parallel and Distributed Systems.

[5]  Zhengfang Duanmu,et al.  A Quality-of-Experience Database for Adaptive Video Streaming , 2018, IEEE Transactions on Broadcasting.

[6]  Adrien Lèbre,et al.  Virtual Machine Boot Time Model , 2017, 2017 25th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP).

[7]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[8]  Feng Liu,et al.  AuTO: scaling deep reinforcement learning for datacenter-scale automatic traffic optimization , 2018, SIGCOMM.

[9]  Yonggang Wen,et al.  Resource Provisioning and Profit Maximization for Transcoding in Clouds: A Two-Timescale Approach , 2017, IEEE Transactions on Multimedia.

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.