Spatio-Temporal Activity based Tiling for Panorama Streaming

In panorama streaming an arbitrary Region-of-Interest (RoI) of a high-resolution video is transmitted, allowing users to navigate interactively within the videos. Transmitting the whole video becomes unfeasible due to the required high bitrates and sending a single video per user, which is encoded for that specific user (i.e. its RoI), has scalability issues. Tile based panoramic streaming overcomes the mentioned drawbacks by allowing users to receive a set of tiles that match their RoI instead of the whole set of tiles. However, optimal tiling -- so that the transmitted bitrate of the RoI content is minimized - is content dependent. In this paper, we propose a model based on a spatio-temporal activity metric so that optimization of the tiling process can be performed in a low complexity manner.

[1]  Bernd Girod,et al.  Spatial-Random-Access-Enabled Video Coding for Interactive Virtual Pan/Tilt/Zoom Functionality , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Aljoscha Smolic,et al.  Efficient representation and interactive streaming of high-resolution panoramic views , 2002, Proceedings. International Conference on Image Processing.

[3]  Hans Stokking,et al.  Spatial segmentation for immersive media delivery , 2011, 2011 15th International Conference on Intelligence in Next Generation Networks.

[4]  Peter Eisert,et al.  The Ultimate Immersive Experience: Panoramic 3D Video Acquisition , 2012, MMM.

[5]  Eckehard G. Steinbach,et al.  Bit rate estimation for H.264/AVC video encoding based on temporal and spatial activities , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[6]  Cyril Concolato,et al.  Tiled-based adaptive streaming using MPEG-DASH , 2016, MMSys.

[7]  Zhengguo Li,et al.  A Novel Rate Control Scheme for Low Delay Video Communication of H.264/AVC Standard , 2007, IEEE Transactions on Circuits and Systems for Video Technology.