Video Compression Using Spatiotemporal Regularity Flow

We propose a new framework in wavelet video coding to improve the compression rate by exploiting the spatiotemporal regularity of the data. A sequence of images creates a spatiotemporal volume. This volume is said to be regular along the directions in which the pixels vary the least, hence the entropy is the lowest. The wavelet decomposition of regularized data results in a fewer number of significant coefficients, thus yielding a higher compression rate. The directions of regularity of an image sequence depend on both its motion content and spatial structure. We propose the representation of these directions by a 3-D vector field, which we refer to as the spatiotemporal regularity flow (SPREF). SPREF uses splines to approximate the directions of regularity. The compactness of the spline representation results in a low storage overhead for SPREF, which is a desired property in compression applications. Once SPREF directions are known, they can be converted into actual paths along which the data is regular. Directional decomposition of the data along these paths can be further improved by using a special class of wavelet basis called the 3-D orthonormal bandelet basis. SPREF -based video compression not only removes the temporal redundancy, but it also compensates for the spatial redundancy. Our experiments on several standard video sequences demonstrate that the proposed method results in higher compression rates as compared to the standard wavelet based compression

[1]  Detlev Marpe,et al.  Very low bit-rate video coding using wavelet-based techniques , 1999, IEEE Trans. Circuits Syst. Video Technol..

[2]  Yonghui Wang,et al.  3D video coding using redundant-wavelet multihypothesis and motion-compensated temporal filtering , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[3]  Avideh Zakhor,et al.  Multirate 3-D subband coding of video , 1994, IEEE Trans. Image Process..

[4]  Stéphane Mallat,et al.  Sparse geometric image representations with bandelets , 2005, IEEE Transactions on Image Processing.

[5]  I. Daubechies,et al.  Factoring wavelet transforms into lifting steps , 1998 .

[6]  Gunnar Karlsson,et al.  Three dimensional sub-band coding of video , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[7]  Christine Podilchuk,et al.  Video compression with dense motion fields , 2001, IEEE Trans. Image Process..

[8]  Nariman Farvardin,et al.  Three-dimensional subband coding of video , 1995, IEEE Trans. Image Process..

[9]  Yonghui Wang,et al.  Multihypothesis motion compensation in the redundant wavelet domain , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[10]  David S. Taubman,et al.  Highly scalable video compression with scalable motion coding , 2003, IEEE Transactions on Image Processing.

[11]  Eric Dubois,et al.  Bayesian Estimation of Motion Vector Fields , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Ping-Sing Tsai,et al.  JPEG2000 Standard for Image Compression: Concepts, Algorithms and VLSI Architectures , 2004 .

[13]  Tinku Acharya,et al.  JPEG2000 standard for image compression , 2004 .

[14]  Jens-Rainer Ohm Temporal domain sub-band video coding with motion compensation , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  M. Vetterli,et al.  Wavelets, subband coding, and best bases , 1996, Proc. IEEE.

[16]  Zixiang Xiong,et al.  Low bit-rate scalable video coding with 3-D set partitioning in hierarchical trees (3-D SPIHT) , 2000, IEEE Trans. Circuits Syst. Video Technol..

[17]  S. Mallat A wavelet tour of signal processing , 1998 .