Overview of the Scalable Video Coding Extension of the H.264/AVC Standard

With the introduction of the H.264/AVC video coding standard, significant improvements have recently been demonstrated in video compression capability. The Joint Video Team of the ITU-T VCEG and the ISO/IEC MPEG has now also standardized a Scalable Video Coding (SVC) extension of the H.264/AVC standard. SVC enables the transmission and decoding of partial bit streams to provide video services with lower temporal or spatial resolutions or reduced fidelity while retaining a reconstruction quality that is high relative to the rate of the partial bit streams. Hence, SVC provides functionalities such as graceful degradation in lossy transmission environments as well as bit rate, format, and power adaptation. These functionalities provide enhancements to transmission and storage applications. SVC has achieved significant improvements in coding efficiency with an increased degree of supported scalability relative to the scalable profiles of prior video coding standards. This paper provides an overview of the basic concepts for extending H.264/AVC towards SVC. Moreover, the basic tools for providing temporal, spatial, and quality scalability are described in detail and experimentally analyzed regarding their efficiency and complexity.

[1]  E. Francois,et al.  Interlaced Coding in SVC , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  William T. Freeman,et al.  Understanding belief propagation and its generalizations , 2003 .

[3]  Antonio Ortega,et al.  Bit allocation for dependent quantization with applications to multiresolution and MPEG video coders , 1994, IEEE Trans. Image Process..

[4]  Gary J. Sullivan,et al.  New Standardized Extensions of MPEG4-AVC/H.264 for Professional-Quality Video Applications , 2007, 2007 IEEE International Conference on Image Processing.

[5]  K. H. Barratt Digital Coding of Waveforms , 1985 .

[6]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[7]  Thomas Wiegand,et al.  Long-term memory motion-compensated prediction , 1999, IEEE Trans. Circuits Syst. Video Technol..

[8]  Gary J. Sullivan,et al.  Video Compression - From Concepts to the H.264/AVC Standard , 2005, Proceedings of the IEEE.

[9]  Miska M. Hannuksela,et al.  System and Transport Interface of SVC , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Thomas Wiegand,et al.  Mobile Video Transmission Using Scalable Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  三田 真弓,et al.  ISO/IEC JTC 1 : 情報技術の国際標準化組織 , 1996 .

[12]  Alexandros Eleftheriadis,et al.  Multipoint videoconferencing with scalable video coding , 2006 .

[13]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[14]  Heiko Schwarz,et al.  Performance Analysis of SVC , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Justin Dauwels,et al.  Phase estimation by message passing , 2004, 2004 IEEE International Conference on Communications (IEEE Cat. No.04CH37577).

[16]  John W. Woods,et al.  Motion-compensated 3-D subband coding of video , 1999, IEEE Trans. Image Process..

[17]  Henning Schulzrinne,et al.  RTP: A Transport Protocol for Real-Time Applications , 1996, RFC.

[18]  William T. Freeman,et al.  Learning Low-Level Vision , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[19]  Stéphane Pateux,et al.  Optimized Rate-Distortion Extraction With Quality Layers in the Scalable Extension of H.264/AVC , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[21]  Nanning Zheng,et al.  Stereo Matching Using Belief Propagation , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Brendan J. Frey,et al.  Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[23]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[24]  Béatrice Pesquet-Popescu,et al.  Three-dimensional lifting schemes for motion compensated video compression , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[25]  Heiko Schwarz,et al.  Analysis of Hierarchical B Pictures and MCTF , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[26]  Gary J. Sullivan,et al.  Spatial Scalability Within the H.264/AVC Scalable Video Coding Extension , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Stéphane Pateux,et al.  Optimized Rate-Distortion Extraction with Quality Layers , 2006, 2006 International Conference on Image Processing.

[28]  Mathias Wien,et al.  Real-Time System for Adaptive Video Streaming Based on SVC , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Markus Flierl,et al.  A locally optimal design algorithm for block-based multi-hypothesis motion-compensated prediction , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).

[30]  Iso/iec 14496-2 Information Technology — Coding of Audio-visual Objects — Part 2: Visual , 2022 .

[31]  Eleftheriadis Alexandros,et al.  Multipoint videoconferencing with scalable video coding , 2006 .

[32]  Itu-T Video coding for low bitrate communication , 1996 .

[33]  William T. Freeman,et al.  Nonparametric belief propagation , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[34]  Shipeng Li,et al.  Motion compensated lifting wavelet and its application in video coding , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[35]  Jens-Rainer Ohm,et al.  Advances in Scalable Video Coding , 2005, Proceedings of the IEEE.

[36]  Daniel P. Huttenlocher,et al.  Efficient Belief Propagation for Early Vision , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[37]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[38]  Peter Amon,et al.  File Format for Scalable Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[39]  Thomas Schierl,et al.  Transport and Signaling of SVC in IP Networks , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[40]  Michael I. Jordan,et al.  Graphical Models, Exponential Families, and Variational Inference , 2008, Found. Trends Mach. Learn..

[41]  David S. Taubman,et al.  Motion-compensated highly scalable video compression using an adaptive 3D wavelet transform based on lifting , 2001, ICIP.

[42]  D. Marpe,et al.  The H.264/MPEG4 advanced video coding standard and its applications , 2006, IEEE Communications Magazine.

[43]  Heiko Schwarz,et al.  R-D Optimized Multi-Layer Encoder Control for SVC , 2007, 2007 IEEE International Conference on Image Processing.

[44]  Heiko Schwarz,et al.  Constrained inter-layer prediction for single-loop decoding in spatial scalability , 2005, IEEE International Conference on Image Processing 2005.

[45]  K. Rijkse,et al.  H.263: video coding for low-bit-rate communication , 1996, IEEE Commun. Mag..

[46]  Brendan J. Frey,et al.  A comparison of algorithms for inference and learning in probabilistic graphical models , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.