Coding Order Decision of B Frames for Rate-Distortion Performance Improvement in Single-View Video and Multiview Video Coding

The coding gain that can be achieved by improving the coding order of B frames in the H.264/AVC standard is investigated in this work. We first represent the coding order of B frames and their reference frames with a binary tree. We then formulate a recursive equation to find out the binary tree that provides a suboptimal, but very efficient, coding order. The recursive equation is efficiently solved using a dynamic programming method. Furthermore, we extend the coding order improvement technique to the case of multiview video sequences, in which the quadtree representation is used instead of the binary tree representation. Simulation results demonstrate that the proposed algorithm provides significantly better R-D performance than conventional prediction structures.

[1]  Thomas Wiegand,et al.  Long-term memory motion-compensated prediction , 1999, IEEE Trans. Circuits Syst. Video Technol..

[2]  Clifford Stein,et al.  Introduction to Algorithms, 2nd edition. , 2001 .

[3]  Yo-Sung Ho,et al.  Efficient view-temporal prediction structures for multi-view video coding , 2008 .

[4]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[6]  Bernard Harris,et al.  Graph theory and its applications , 1970 .

[7]  Heiko Schwarz,et al.  Analysis of Hierarchical B Pictures and MCTF , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[8]  Barry G. Haskell,et al.  I/P/B frame type decision by collinearity of displacements , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[9]  Wenxian Yang,et al.  A multiview sequence CODEC with view scalability , 2004, Signal Process. Image Commun..

[10]  Jungwoo Lee,et al.  Rate-distortion optimized frame type selection for MPEG encoding , 1997, IEEE Trans. Circuits Syst. Video Technol..

[11]  Yutaka Yokoyama Adaptive GOP structure selection for real-time MPEG-2 video encoding , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[12]  Dongxiao Li,et al.  Optimising inter-view prediction structure for multiview video coding with minimum spanning tree , 2007 .

[13]  Markus Flierl,et al.  Low-latency video transmission over lossy packet networks using rate-distortion optimized reference picture selection , 2002, Proceedings. International Conference on Image Processing.

[14]  Itu-T Video coding for low bitrate communication , 1996 .

[15]  Sang Uk Lee,et al.  Graph Theoretical Optimization of Prediction Structure in Multiview Video Coding , 2007, 2007 IEEE International Conference on Image Processing.

[16]  Toshiaki Fujii,et al.  Multi-View Video Coding using View Interpolation and Reference Picture Selection , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[17]  Thomas Wiegand,et al.  Multi-frame motion compensated prediction for video transmission , 2001 .

[18]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[19]  Antonio Ortega,et al.  Rate-distortion methods for image and video compression , 1998, IEEE Signal Process. Mag..

[20]  Heiko Schwarz,et al.  Overview of the Scalable Video Coding Extension of the H.264/AVC Standard , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Wenxian Yang,et al.  An MPEG-4-compatible stereoscopic/multiview video coding scheme , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Thomas Wiegand,et al.  Error-resilient video transmission using long-term memory motion-compensated prediction , 2000, IEEE Journal on Selected Areas in Communications.

[23]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .