Fast H.264 Mode Selection Using Depth Information for Distributed Game Viewing

While H.264 has been known for its superior coding performance over its predecessors, its encoding complexity can be high, especially if an exhaustive search is performed for the rate-distortion optimal mode among many available coding modes for each macroblock. In this paper, we show that in the context of coding 3D graphically rendered frames, depth information-the distance between rendered pixels of objects and the virtual camera popularly used to determine object occlusion during graphics rendering-can be exploited to expedite the mode selection process at a negligible cost of decrease in coding performance. Our proposed techniques take into consideration that regions of similar depth are likely to correspond to regions of uniform motion, an observation that can be exploited to speed-up the selection of block sizes for motion compensation. In our experiments, we show that complexity of the full motion vector search can be reduced by 40% at the expense of small increase of less than 0.7% in bitrate and small decrease of less than 0.07dB in PSNR. Moreover, because our proposed schemes operate in the depth domain, we show that they can be easily combined with an existing pixel-domain mode selection algorithm in the literature leading to similar complexity reductions.

[1]  Oscar C. Au,et al.  A novel approach to fast multi-block motion estimation for H.264 video coding , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[2]  Chad Zhu RTP Payload Format for H.263 Video Streams , 1997, RFC.

[3]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[4]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[5]  Jill M. Boyce,et al.  Fast mode decision and motion estimation for JVT/H.264 , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[6]  Tien-Ying Kuo,et al.  Fast Variable Block Size Motion Estimation for H.264 Using Likelihood and Correlation of Motion Field , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Dipl.-Ing,et al.  Real-time Rendering , 2022 .

[8]  Susanto Rahardja,et al.  Fast intermode decision in H.264/AVC video coding , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Gene Cheung,et al.  Graphics-to-video encoding for 3G mobile game viewer multicast using depth values , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[10]  Zhi Zhou,et al.  Fast macroblock inter mode decision and motion estimation for H.264/MPEG-4 AVC , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[11]  Antonio Ortega,et al.  ECHO: a community video streaming system with interactive visual overlays , 2008, Electronic Imaging.

[12]  Sanjit K. Mitra,et al.  Rate-distortion optimized mode selection for very low bit rate video coding and the emerging H.263 standard , 1996, IEEE Trans. Circuits Syst. Video Technol..

[13]  Srinivasan Seshan,et al.  Supporting Spectators in Online Multiplayer Games , 2004, SIGCOMM 2004.

[14]  G. Cheung Directed acyclic graph based mode optimization for H.263 video encoding , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[15]  Sanjit K. Mitra,et al.  Combined mode selection and macroblock quantization step adaptation for the H.263 video encoder , 1997, Proceedings of International Conference on Image Processing.