Implementation and evaluation of adaptive video streaming based on Markov decision process

In HTTP-based adaptive streaming systems, media server simply stores video content segmented into a series of small chunks coded in different qualities and sizes. The decision for next chunk's quality level to achieve a high quality viewing experience is left to the client which is a challenging task, especially in mobile environment due to unexpected changes in network bandwidth. Using computer simulations, previous work has demonstrated that Markov decision process (MDP) is very effective for such decision making and that it can reduce video freezing or re-buffering events drastically compared to other methods of adaptation. However, to date there has been no practical implementation and evaluation of MDP-based DASH players. In this work, we extend a publicly available DASH player recently released by DASH industry forum to realise a real DASH player that implements MDP-based video adaptation. We implement two alternative MDP optimisation algorithms, value iteration and Q learning and evaluate their performances in real driving conditions under 300 minutes of video streaming. Our results show that value iteration and Q learning reduce video freezing by a factor of 8 and 11, respectively, compared to the default decision making algorithm implemented in the public DASH player.

[1]  Mahbub Hassan,et al.  HTTP-Based Adaptive Streaming for Mobile Clients using Markov Decision Process , 2013, 2013 20th International Packet Video Workshop.

[2]  Ayub Bokani Location-Based Adaptation for DASH in Vehicular Environment , 2014, CoNEXT Student Workshop '14.

[3]  Rolf Stadler,et al.  In-Network Monitoring , 2010, Algorithms for Next Generation Networks.

[4]  Mahbub Hassan,et al.  Optimizing HTTP-Based Adaptive Streaming in Vehicular Environment Using Markov Decision Process , 2015, IEEE Transactions on Multimedia.

[5]  Thomas Stockhammer,et al.  Dynamic adaptive streaming over HTTP --: standards and design principles , 2011, MMSys.

[6]  Ran Li,et al.  Markov decision process toolbox , 2009, 2009 IEEE International Workshop on Open-source Software for Scientific Computation (OSSC).

[7]  Ayub Bokani,et al.  A Geo-Adaptive JavaScript DASH Player , 2014, VideoNext '14.

[8]  Chris Watkins,et al.  Learning from delayed rewards , 1989 .