QoE-Aware 3D Video Streaming via Deep Reinforcement Learning in Software Defined Networking Enabled Mobile Edge Computing

With the advancements of wireless network transmission technology, 2D video is hard to satisfy people's requirement for multimedia. Therefore, the high-definition 3D video that can bring a whole new viewing experience is starting to enter people's vision. However, when a tremendously large number of users play 3D video, it puts enormous computational pressure on the cloud server, which incurs high transmission latency. To release the tension, in this articl we consider a promising computing and networking architecture by incorporating Mobile Edge Computing (MEC) and Software-defined Networking (SDN) and propose a novel resource allocation model (RAM) to allocate resources and reduce delay. At the same time, we introduce the Quality of Experience (QoE) Model (QoEM), which uses information collected during 3D video playback to adaptively allocate the rate of future tiles. The model addresses the problem of assigning the best transmission speed to the block in the case of time-varying characteristicfactors during transmission. We propose an Actor-Critic-based deep reinforcement learning algorithm for viewport prediction and QoE optimization, called QoE-AC. For the differential transmission in the playback phase, we use the LSTM network for bandwidth and viewport prediction, while combining the historical information of the blocks into the Actor-Critic network as observations. The network can be adaptively assigned the best transmission speed for future tiles based on observations to maximize QoE. Finally, the experimental results show that the actual performance of the model is much better than other existing 3D video network models. Under different QoE targets, our proposed system can be adapted to all situations and has a 10%-15% performance improvement.

[1]  F. Richard Yu,et al.  Adaptive Video Streaming With Edge Caching and Video Transcoding Over Software-Defined Mobile Networks: A Deep Reinforcement Learning Approach , 2020, IEEE Transactions on Wireless Communications.

[2]  Dario Pompili,et al.  On-Demand Video-Streaming Quality of Experience Maximization in Mobile Edge Computing , 2019, 2019 IEEE 20th International Symposium on "A World of Wireless, Mobile and Multimedia Networks" (WoWMoM).

[3]  Jenq-Neng Hwang,et al.  Utility-oriented resource allocation for 360-degree video transmission over heterogeneous networks , 2019, Digit. Signal Process..

[4]  De-Nian Yang,et al.  Efficient Multi-View 3D Video Multicast with Mobile Edge Computing , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[5]  Zongming Guo,et al.  CLS: A Cross-user Learning based System for Improving QoE in 360-degree Video Adaptive Streaming , 2018, ACM Multimedia.

[6]  Matti Siekkinen,et al.  CloudVR: Cloud Accelerated Interactive Mobile Virtual Reality , 2018, ACM Multimedia.

[7]  Antonios Argyriou,et al.  3DQoE-Oriented and Energy-Efficient 2D plus Depth Based 3D Video Streaming Over Centrally Controlled Networks , 2018, IEEE Transactions on Multimedia.

[8]  Zhetao Li,et al.  Dante: Enabling FOV-Aware Adaptive FEC Coding for 360-Degree Video Streaming , 2018, APNet '18.

[9]  Jinsul Kim,et al.  NFV-Based Mobile Edge Computing for Lowering Latency of 4K Video Streaming , 2018, 2018 Tenth International Conference on Ubiquitous and Future Networks (ICUFN).

[10]  Feng Li,et al.  Rubiks: Practical 360-Degree Streaming for Smartphones , 2018, MobiSys.

[11]  Yao Liu,et al.  ClusTile: Toward Minimizing Bandwidth in 360-degree Video Streaming , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[12]  Jeongho Kwak,et al.  Hybrid Content Caching in 5G Wireless Networks: Cloud Versus Edge Caching , 2018, IEEE Transactions on Wireless Communications.

[13]  Abhishek Nandy,et al.  Reinforcement Learning with Keras, TensorFlow, and ChainerRL , 2018 .

[14]  Marc Peter Deisenroth,et al.  Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[15]  Songqing Chen,et al.  OpTile: Toward Optimal Tiling in 360-degree Video Streaming , 2017, ACM Multimedia.

[16]  Federico Chiariotti,et al.  D-DASH: A Deep Q-Learning Framework for DASH Video Streaming , 2017, IEEE Transactions on Cognitive Communications and Networking.

[17]  Hongzi Mao,et al.  Neural Adaptive Video Streaming with Pensieve , 2017, SIGCOMM.

[18]  Heiko Ludwig,et al.  Zenith: Utility-Aware Resource Allocation for Edge Computing , 2017, 2017 IEEE International Conference on Edge Computing (EDGE).

[19]  Shiqiang Yang,et al.  A Dataset for Exploring User Behaviors in VR Spherical Video Streaming , 2017, MMSys.

[20]  Cheng-Hsin Hsu,et al.  Fixation Prediction for 360° Video Streaming in Head-Mounted Virtual Reality , 2017, NOSSDAV.

[21]  Aiman Erbad,et al.  Edge computing for interactive media and video streaming , 2017, 2017 Second International Conference on Fog and Mobile Edge Computing (FMEC).

[22]  Dario Pompili,et al.  Collaborative Mobile Edge Computing in 5G Networks: New Paradigms, Scenarios, and Challenges , 2016, IEEE Communications Magazine.

[23]  Xiaoyong Li,et al.  Dynamic Cloudlet-Assisted Energy-Saving Routing Mechanism for Mobile Ad Hoc Networks , 2017, IEEE Access.

[24]  Srikanth Kandula,et al.  Resource Management with Deep Reinforcement Learning , 2016, HotNets.

[25]  Chia-Wen Lin,et al.  mDASH: A Markov Decision-Based Rate Adaptation Approach for Dynamic HTTP Streaming , 2016, IEEE Transactions on Multimedia.

[26]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[27]  Cai Jun,et al.  Efficient Caching Mechanism Based on Soft Defined Information-centric Networks , 2016 .

[28]  Miska M. Hannuksela,et al.  HEVC-compliant viewport-adaptive streaming of stereoscopic panoramic video , 2016, 2016 Picture Coding Symposium (PCS).

[29]  Bruno Sinopoli,et al.  A Control-Theoretic Approach for Dynamic Adaptive Video Streaming over HTTP , 2015, Comput. Commun. Rev..

[30]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[31]  Nan Zhang,et al.  Software Defined Mobile Networks (SDMN): Beyond LTE Network Architecture , 2015 .

[32]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[33]  Carsten Griwodz,et al.  Commute path bandwidth traces from 3G networks: analysis and applications , 2013, MMSys.

[34]  Thomas Stockhammer,et al.  Dynamic adaptive streaming over HTTP --: standards and design principles , 2011, MMSys.

[35]  Yashar Ganjali,et al.  HyperFlow: A Distributed Control Plane for OpenFlow , 2010, INM/WREN.

[36]  Mihaela van der Schaar,et al.  Cross-Layer Optimized Video Streaming Over Wireless Multihop Mesh Networks , 2006, IEEE Journal on Selected Areas in Communications.

[37]  Bernd Girod,et al.  Congestion-distortion optimized video transmission over ad hoc networks , 2005, Signal Process. Image Commun..

[38]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[39]  Vijay R. Konda,et al.  Actor-Critic Algorithms , 1999, NIPS.

[40]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.