Multiview Video Coding Using Video Game Context Information

Remote rendering of video games for 3DTV becomes a hot topic with the emergence of 3D-enabled mobile devices and cloud-based services. It is however a very challenging task that requires live encoding at very low latency for user interactivity as well as optimal encoding decisions for an acceptable QoE. One key-aspect is that most video games make use of a 3D engine, which is typically accelerated on a GPU, containing information on the composition of the 3D scene and its objects as well as their motion. In this paper, we explore how to extract this information from the GPU and how to exploit it in order to successfully offload the most time consuming tasks of a multiview video encoder. We show that near-optimal encoding decisions can be taken while minimizing the encoder computational complexity as well as the total delay.

[1]  Marius Preda,et al.  MPEG-4-based adaptive remote rendering for video games , 2011, Web3D '11.

[2]  Peter Eisert,et al.  Accelerated video encoding using render context information , 2010, 2010 IEEE International Conference on Image Processing.

[3]  Enrico Masala,et al.  Rate-distortion optimized low-delay 3D video communications , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[4]  Ying Chen,et al.  The Emerging MVC Standard for 3D Video Services , 2008, EURASIP J. Adv. Signal Process..

[5]  Peter Eisert,et al.  Depth map enhanced macroblock partitioning for H.264 video coding of computer graphics content , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[6]  Thomas Wiegand,et al.  Mixed resolution coding of stereoscopic video for Mobile devices , 2009, 2009 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[7]  Peter Eisert,et al.  Enhanced Video Streaming for Remote 3 D Gaming , 2010 .