Coding efficiency and complexity analysis of MVC prediction structures

Based on the idea to exploit the statistical dependencies from both temporal and inter-view reference pictures for motion compensated prediction this paper presents a systematic evaluation of multi-view video coding with optimized prediction structures. The compression method is based on the multiple reference picture technique in the H.264/AVC video coding standard. The advantages of hierarchical B pictures for temporal prediction are combined with inter-view prediction for different temporal hierarchy levels starting from simulcast coding with no inter-view prediction up to full level inter-view prediction. When using inter-view prediction at key picture temporal level average gains of 1.4 dB PSNR are reported while additionally using inter-view prediction at non-key picture temporal levels average gains of 1.6 dB PSNR are reported. For some cases gains of more than 3 dB corresponding to bit rate savings of up to 50% are obtained.

[1]  Thomas Wiegand,et al.  3D Video and Free Viewpoint Video - Technologies, Applications and MPEG Standards , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[2]  Aljoscha Smolic,et al.  Interactive 3-D Video Representation and Coding Technologies , 2005, Proceedings of the IEEE.

[3]  Mei Yu,et al.  A New Image Correction Method for Multiview Video System , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[4]  Aljoscha Smolic,et al.  An Overview of a New European Consortium: Integrated Three-Dimensional Television - Capture, Transmission and Display (3DTV) , 2004, EWIMT.

[5]  Aljoscha Smolic,et al.  Efficient Compression of Multi-View Video Exploiting Inter-View Dependencies Based on H.264/MPEG4-AVC , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[6]  Hideaki Kimata,et al.  Development of MPEG standards for 3D and free viewpoint video , 2005, SPIE Optics East.

[7]  André Kaup,et al.  Analysis of Multi-Reference Block Matching for MultiView Video Coding , 2006 .

[8]  Heiko Schwarz,et al.  Analysis of Hierarchical B Pictures and MCTF , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[9]  Kwan-Jung Oh Multi-view Video Coding based on the Lattice-like Pyramid GOP Structure , 2006 .

[10]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[11]  Toshiaki Fujii,et al.  Multi-View Video Coding using View Interpolation and Reference Picture Selection , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[12]  Anthony Vetro,et al.  View Synthesis for Multiview Video Compression , 2006 .