A scene adaptive and signal adaptive quantization for subband image and video compression using wavelets

The discrete wavelet transform (DWT) provides an advantageous framework of multiresolution space-frequency representation with promising applications in image processing. The challenge as well as the opportunity in wavelet-based compression is to exploit the characteristics of the subband coefficients with respect to both spectral and spatial localities. A common problem with many existing quantization methods is that the inherent image structures are severely distorted with coarse quantization. Observation shows that subband coefficients with the same magnitude generally do not have the same perceptual importance. We propose in this paper a scene adaptive and signal adaptive quantization scheme capable of exploiting the spectral and spatial localization properties resulting from the wavelet transform. The quantization is implemented as maximum a posteriori probability estimation-based clustering in which subband coefficients are quantized to their cluster means, subject to local spatial constraints. The intensity distribution of each cluster within a subband is modeled by an optimal Laplacian source to achieve signal adaptivity, while spatial constraints are enforced by appropriate Gibbs random fields (GRF) to achieve scene adaptivity. With spatially isolated coefficients removed and clustered coefficients retained at the same time, the available bits are allocated to visually important scene structures so that the information loss is least perceptible. Furthermore, the reconstruction noise in the decompressed image can be suppressed using another GRF-based enhancement algorithm.

[1]  William Grimson,et al.  Object recognition by computer - the role of geometric constraints , 1991 .

[2]  J. Woods,et al.  Sub-band coding of images , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[4]  Sei-ichiro Kamata,et al.  Hilbert scanning arithmetic coding for multispectral image compression , 1991, Optics & Photonics.

[5]  James L. Flanagan,et al.  Digital coding of speech in sub-bands , 1976, The Bell System Technical Journal.

[6]  H. Gharavi,et al.  Sub-band coding of monochrome and color images , 1988 .

[7]  Arnaud E. Jacquin,et al.  Subband video coding with a dynamic bit allocation and geometric vector quantization , 1992, Electronic Imaging.

[8]  Nariman Farvardin,et al.  Three-dimensional subband coding of video , 1995, IEEE Trans. Image Process..

[9]  Jiebo Luo,et al.  A new method for block effect removal in low bit-rate image compression , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.

[11]  N. M. Nasrabadi,et al.  Subband coding of video using an edge-based vector quantization technique for compression of the upper bands , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Robert L. Stevenson,et al.  Reduction of coding artifacts in transform image coding , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[14]  Shimon Ullman,et al.  Structural Saliency: The Detection Of Globally Salient Structures using A Locally Connected Network , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[15]  James L. Flanagan,et al.  Digital coding of speech in sub-bands , 1976, Bell Syst. Tech. J..

[16]  J. Hartung,et al.  Architecture for the real-time implementation of three-dimensional subband video coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Peter H. Westerink,et al.  Subband coding of images using vector quantization , 1988, IEEE Trans. Commun..

[18]  Hamid Gharavi,et al.  Subband Coding of Video Signals , 1991 .

[19]  A. Witkin,et al.  On the Role of Structure in Vision , 1983 .

[20]  Jiebo Luo,et al.  Applications of Gibbs random field in image processing: from segmentation to enhancement , 1994, Other Conferences.

[21]  M. Vetterli Multi-dimensional sub-band coding: Some theory and algorithms , 1984 .

[22]  Adrian S. Lewis,et al.  Image compression using the 2-D wavelet transform , 1992, IEEE Trans. Image Process..

[23]  Michel Barlaud,et al.  Image coding using vector quantization in the wavelet transform domain , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[24]  Haluk Derin,et al.  Modeling and Segmentation of Noisy and Textured Images Using Gibbs Random Fields , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Sanjit K. Mitra,et al.  A technique for the efficient coding of the upper bands in subband coding of images , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[26]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[27]  R. J. Safranek,et al.  A perceptually tuned sub-band image coder with image dependent quantization and post-quantization data compression , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[28]  Jiebo Luo,et al.  Low bit rate wavelet-based image and video compression with adaptive quantization, coding and postprocessing , 1996 .

[29]  Sridhar Lakshmanan,et al.  Simultaneous Parameter Estimation and Segmentation of Gibbs Random Fields Using Simulated Annealing , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  John W. Woods,et al.  Subband finite state scalar quantization , 1996, IEEE Trans. Image Process..

[31]  Michel Barlaud,et al.  Image coding using wavelet transform , 1992, IEEE Trans. Image Process..

[32]  William A. Pearlman,et al.  Three-dimensional subband coding of video using the zero-tree method , 1996, Other Conferences.

[33]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Nuggehally Sampath Jayant,et al.  Sparse codebooks for the quantization of nondominant sub-bands in image coding , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[35]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[36]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[37]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .

[38]  R. H. Bamberger,et al.  New subband decompositions and coders for image and video compression , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[39]  Thrasyvoulos N. Pappas,et al.  An Adaptive Clustering Algorithm For Image Segmentation , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[40]  D. Legall,et al.  MPEG : A video compression standard for multimedia applications , 1991 .

[41]  Ming Lei Liou,et al.  Overview of the p×64 kbit/s video coding standard , 1991, CACM.

[42]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[43]  Jiebo Luo,et al.  On the application of Gibbs random field in image processing: from segmentation to enhancement , 1995, J. Electronic Imaging.

[44]  John W. Woods,et al.  Subband coding of images , 1986, IEEE Trans. Acoust. Speech Signal Process..

[45]  Gunnar Karlsson,et al.  Three dimensional sub-band coding of video , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[46]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..