A locally adaptive perceptual masking threshold model for image coding

This paper involves designing, implementing, and testing of a locally adaptive perceptual masking threshold model for image compression. This model computes, based on the contents of the original images, the maximum amount of noise energy that can be injected at each transform coefficient that results in perceptually distortion-free still images or sequences of images. The adaptive perceptual masking threshold model can be used as a pre-processor to a JPEG compression standard image coder. DCT coefficients less than their corresponding perceptual thresholds can be set to zero before the normal JPEG quantization and Huffman coding steps. The result is an image-dependent gain in the bit rate needed for transparent coding. In an informal subjective test involving 318 still images in the AT&T Bell Laboratory image database, this model provided a gain in bit-rate saving on the order of 10 to 30%.

[1]  H. Wilson,et al.  Orientation bandwidths of spatial mechanisms measured by masking. , 1984, Journal of the Optical Society of America. A, Optics and image science.

[2]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[3]  B. Julesz,et al.  Spatial-frequency masking in vision: critical bands and spread of masking. , 1972, Journal of the Optical Society of America.

[4]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[5]  R. Haber,et al.  Visual Perception , 2018, Encyclopedia of Database Systems.

[6]  C Blakemore,et al.  On the existence of neurones in the human visual system selectively sensitive to the orientation and size of retinal images , 1969, The Journal of physiology.

[7]  Albert J. Ahumada,et al.  Improved detection model for DCT coefficient quantization , 1993, Electronic Imaging.

[8]  Heidi A. Peterson,et al.  Luminance-model-based DCT quantization for color image compression , 1992, Electronic Imaging.

[9]  R. R. Clarke Transform coding of images , 1985 .

[10]  K WallaceGregory The JPEG still picture compression standard , 1991 .

[11]  R. J. Safranek,et al.  A perceptually tuned sub-band image coder with image dependent quantization and post-quantization data compression , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[12]  Robert J. Safranek,et al.  Signal compression based on models of human perception , 1993, Proc. IEEE.

[13]  Aaron D. Wyner,et al.  Coding Theorems for a Discrete Source With a Fidelity CriterionInstitute of Radio Engineers, International Convention Record, vol. 7, 1959. , 1993 .

[14]  Jae S. Lim,et al.  Two-Dimensional Signal and Image Processing , 1989 .

[15]  Scott J. Daly,et al.  Visible differences predictor: an algorithm for the assessment of image fidelity , 1992, Electronic Imaging.

[16]  Andrew B. Watson,et al.  Visually optimal DCT quantization matrices for individual images , 1993, [Proceedings] DCC `93: Data Compression Conference.

[17]  D. G. Albrecht,et al.  Spatial frequency selectivity of cells in macaque visual cortex , 1982, Vision Research.

[18]  Andrew B. Watson,et al.  The cortex transform: rapid computation of simulated neural images , 1987 .

[19]  J. M. Foley,et al.  Contrast masking in human vision. , 1980, Journal of the Optical Society of America.