Image coding with mixed representations and visual masking

We propose a novel approach for low bit rate perceptually transparent image compression. It exploits both frequency and spatial visual masking effects and uses a combination of Fourier and wavelet transforms to encode different bands. Frequency domain masking is computed by using a fine to coarse analysis step. Spatial domain masking is computed either by using Girod's (1989) model or a coarse to fine analysis step that accurately computes local contrast. A discrete cosine transform is used in conjunction with frequency domain masking to encode the low frequency bands. The medium and high frequency bands are encoded using spatial domain masking and a wavelet transform. The encoding of these bands is based on a recursive selection of the important edges in each band. It uses cross-band prediction to minimize the bit rate. Experiments show the approach can achieve a very high quality to nearly transparent compression at bit rates of 0.2 to 0.4 bits/pixel.

[1]  R. J. Safranek,et al.  A perceptually tuned sub-band image coder with image dependent quantization and post-quantization data compression , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[2]  Dennis M. Healy,et al.  Wavelet transform domain filters: a spatially selective noise filtration technique , 1994, IEEE Trans. Image Process..

[3]  E. Peli Contrast in complex images. , 1990, Journal of the Optical Society of America. A, Optics and image science.

[4]  M. Kunt,et al.  Second-generation image-coding techniques , 1985, Proceedings of the IEEE.

[5]  A. SAADANE,et al.  Masking and quantization laws in a visual subband coding scheme , 1994, Proceedings of 1st International Conference on Image Processing.

[6]  Deepen Sinha,et al.  Low bit rate transparent audio compression using adapted wavelets , 1993, IEEE Trans. Signal Process..

[7]  Stéphane Mallat,et al.  Multifrequency channel decompositions of images and wavelet models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[8]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1991, CACM.

[9]  J. M. Foley,et al.  Contrast masking in human vision. , 1980, Journal of the Optical Society of America.

[10]  Bernd Girod,et al.  The Information Theoretical Significance of Spatial and Temporal Masking in Video Signals , 1989, Photonics West - Lasers and Applications in Science and Engineering.