An improved 3D wavelet-based scalable video coding codec for MC-EZBC

With the rapid growth of modern multimedia applications, 3D wavelet-based scalable video coding (SVC) codec has received considerable attention lately because of its high coding performance and flexibility in bitstream scalability. It combines the motion-compensated temporal filtering (MCTF) together with the spatial decomposition to produce an embedded bitstream offering various levels of video quality over the heterogeneous networks. However, in the existing 3D wavelet-based SVC schemes, where the block types for block matching algorithms are limited, weighting matrices for block-wise motion compensation are fixed, and variations in activities of temporal subbands are not considered in the selection of the Lagrange multiplier for mode decision. In this paper, our major contribution is to provide some recent extensions to the well-known scalable subband/wavelet video codec Motion-Compensated Embedded Zero Block Coding (MC-EZBC) using three novel and content adaptive algorithms. Firstly, the enhanced hierarchical variable size block matching (Enhanced HVSBM) algorithm is proposed for the variable block size motion estimation. Then, the rate-distortion optimization (RDO) based adaptive Lagrange multiplier selection model for mode decision is presented. Finally, we introduce the adaptive weighting matrices design for overlapped block motion compensation (OBMC). Experimental results show that all the three proposed algorithms significantly improve the overall coding performance of MC-EZBC. Comparisons with other popular wavelet-based SVC codecs demonstrate the effectiveness of our improved codec in terms of both video quality assessment and computational complexity.

[1]  Frédéric Dufaux,et al.  Key Frame Selection to Represent a Video , 2000, ICIP.

[2]  Ajay Divakaran,et al.  MPEG-7 visual motion descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[3]  Ajay Divakaran,et al.  Automatic measurement of intensity of motion activity of video segments , 2001, IS&T/SPIE Electronic Imaging.

[4]  Fernando Díaz-de-María,et al.  Improved Method to Select the Lagrange Multiplier for Rate-Distortion Based Motion Estimation in Video Coding , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Ebroul Izquierdo,et al.  Lagrange multiplier selection in wavelet-based scalable video coding for quality scalability , 2009, Signal Process. Image Commun..

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Yongdong Zhang,et al.  Efficient Parallel Framework for HEVC Motion Estimation on Many-Core Processors , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[9]  John W. Woods,et al.  Improvements to the MC-EZBC scalable video coder , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[10]  John W. Woods,et al.  Embedded video coding using invertible motion compensated 3-D subband/wavelet filter bank , 2001, Signal Process. Image Commun..

[11]  John W. Woods,et al.  Motion-compensated 3-D subband coding of video , 1999, IEEE Trans. Image Process..

[12]  Meng-Ping Kao A block-based scalable motion model for highly scalable video coding , 2008 .

[13]  Jens-Rainer Ohm,et al.  Three-dimensional subband coding with motion compensation , 1994, IEEE Trans. Image Process..

[14]  John W. Woods,et al.  Enhanced MC-EZBC Scalable Video Coder , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Paul Over,et al.  Video shot boundary detection: Seven years of TRECVid activity , 2010, Comput. Vis. Image Underst..

[16]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[17]  Jose L Nunez-Yanez,et al.  Scalable video coding with multi-layer motion vector palettes , 2012 .

[18]  Ajay Divakaran,et al.  Framework for measurement of the intensity of motion activity of video segments , 2004, J. Vis. Commun. Image Represent..

[19]  Mihaela van der Schaar,et al.  Operational rate-distortion modeling for wavelet video coders , 2006, IEEE Transactions on Signal Processing.

[20]  Michael T. Orchard,et al.  Overlapped block motion compensation: an estimation-theoretic approach , 1994, IEEE Trans. Image Process..

[21]  Cordelia Schmid,et al.  Action recognition by dense trajectories , 2011, CVPR 2011.

[22]  John W. Woods,et al.  Bidirectional MC-EZBC with lifting implementation , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  André Kaup,et al.  Advanced Lagrange Multiplier Selection for Hybrid Video Coding , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[24]  André Kaup,et al.  Lagrange multiplier selection for rate-distortion optimization in SVC , 2009, 2009 Picture Coding Symposium.

[25]  Tianyun Huang Improved MC-EZBC Structure for Bitstream Extraction and Live Streaming , 2014, 2014 Seventh International Symposium on Computational Intelligence and Design.

[26]  Wenjun Zhang,et al.  Enhanced motion coding in MC-EZBC , 2005, Visual Communications and Image Processing.

[27]  Azza Ouled Zaid,et al.  Scalable video coding using motion-compensated temporal filtering and intra-band wavelet based compression , 2011, 3rd European Workshop on Visual Information Processing.

[28]  Truong Q. Nguyen,et al.  A Fully Scalable Motion Model for Scalable Video Coding , 2007, ICIP.

[29]  John W. Woods,et al.  Motion-Compensated Temporal Filtering and Motion Vector Coding Using Biorthogonal Filters , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[30]  Jens-Rainer Ohm,et al.  Invertible temporal subband/wavelet filter banks with half-pixel-accurate motion compensation , 2004, IEEE Transactions on Image Processing.

[31]  Jens-Rainer Ohm,et al.  Motion-compensated 3D video coding using smooth transitions , 2003, IS&T/SPIE Electronic Imaging.

[32]  J. Devore,et al.  Applied statistics for engineers and scientists , 1994 .

[33]  Peter Schelkens,et al.  Complete-to-overcomplete discrete wavelet transforms: theory and applications , 2005, IEEE Transactions on Signal Processing.

[35]  Avideh Zakhor,et al.  Multirate 3-D subband coding of video , 1994, IEEE Trans. Image Process..

[36]  Abdellatif Mtibaa,et al.  Video shot boundary detection using motion activity descriptor , 2010, ArXiv.

[37]  R. Tourki,et al.  Preliminary study of block matching algorithms for wavelet-based t+2D video coding , 2013, 10th International Multi-Conferences on Systems, Signals & Devices 2013 (SSD13).

[38]  Shih-Ta Hsiang,et al.  Embedded image coding using zeroblocks of subband/wavelet coefficients and context modeling , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).

[39]  Gunnar Karlsson,et al.  Three dimensional sub-band coding of video , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[40]  Mona Omidyeganeh,et al.  Application of 3D-wavelet statistics to video analysis , 2012, Multimedia Tools and Applications.

[41]  Russell M. Mersereau,et al.  Motion estimation methods for overlapped block motion compensation , 2000, IEEE Trans. Image Process..

[42]  Alberto Signoroni,et al.  State-of-the-Art and Trends in Scalable Video Compression With Wavelet-Based Approaches , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[43]  Hsueh-Ming Hang,et al.  Motion information scalability for MC-EZBC , 2004, Signal Process. Image Commun..

[44]  John W. Woods,et al.  Directional spatial I-blocks for the MC-EZBC video coder , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.