Video compression using adaptive wavelet packet and DPCM

This paper presents an efficient temporal, spatial and frequency decomposition method for improving the performance of the existing methods for 3D subband coding of video signals. In this method, a given video sequence is first partitioned into constant-sized 'groups' of frames. Within each group, every two consecutive frames are decomposed into low and high temporal subbands by a two-tap temporal filter. The redundancy in the low temporal subbands is removed by a closed loop DPCM unit. The low and high subbands of the first pair of frames, the difference of the low subbands from the DPCM loop and the high subbands of the subsequent pairs of frames in the 'group' are then divided into image blocks of equal size, such that each block can be efficiently decomposed by an adaptive wavelet packet based on a rate-distortion criterion. The subbands of wavelet packets for all blocks are quantized by a hybrid scalar/pyramidal lattice vector quantizer. This scheme achieves minimum distortion under a rate constraint specified for the 'groups' of frames and under the structural constraints of the algorithm. In this scheme, the motion compensation is not used explicitly, but the effects of motion are accounted for through the low-high temporal subbanding and the DPCM procedure. The results compare favorably with those of traditional video coding techniques. Although the computational complexity of this scheme is higher than that of the existing 3D subband method, it is suitable for parallel implementation.

[1]  Sohail Zafar,et al.  Motion-compensated wavelet transform coding for color video compression , 1992, IEEE Trans. Circuits Syst. Video Technol..

[2]  Wei-Lien Hsu,et al.  Spatial and frequency decomposition for image compression , 1995, Medical Imaging.

[3]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.

[4]  Ronald R. Coifman,et al.  Signal processing and compression with wavelet packets , 1994 .

[5]  Thomas R. Fischer,et al.  A pyramid vector quantizer , 1986, IEEE Trans. Inf. Theory.

[6]  M. Wickerhauser Acoustic signal compression with wavelet packets , 1993 .

[7]  Nuggehally Sampath Jayant,et al.  Sparse codebooks for the quantization of nondominant sub-bands in image coding , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[8]  Kannan Ramchandran,et al.  Image representation using time-varying wavelet packets, spatial segmentation and quantization , 1993 .

[9]  Nariman Farvardin,et al.  Three-dimensional subband coding of video , 1995, IEEE Trans. Image Process..

[10]  K Ramchandran,et al.  Best wavelet packet bases in a rate-distortion sense , 1993, IEEE Trans. Image Process..

[11]  Antonio Ortega,et al.  Bit allocation for dependent quantization with applications to MPEG video coders , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Ronald R. Coifman,et al.  Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[13]  Yair Shoham,et al.  Efficient bit allocation for an arbitrary set of quantizers [speech coding] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[14]  Alberto Leon-Garcia,et al.  Adaptive subband coding of full motion video , 1993, Other Conferences.

[15]  Wei-Lien Hsu,et al.  3D adaptive wavelet packet for video compression , 1995, Proceedings., International Conference on Image Processing.

[16]  Arturo A. Rodriguez,et al.  Evaluating Video Codecs , 1994, IEEE MultiMedia.

[17]  Michel Barlaud,et al.  Pyramidal lattice vector quantization for multiscale image coding , 1994, IEEE Trans. Image Process..

[18]  D. Legall,et al.  MPEG : A video compression standard for multimedia applications , 1991 .

[19]  King Ngi Ngan,et al.  Very low bit rate video coding using 3D subband approach , 1994, IEEE Trans. Circuits Syst. Video Technol..