Perceptually based quantization technique for MPEG encoding

We present a technique for controlling the adaptive quantization process in an MPEG encoder, which improves upon the commonly used TM5 rate controller. The method combines both a spatial masking model and a technique for automatically determining the visually important areas in a scene. The spatial masking model has been designed with consideration of the structure of compressed natural images. It takes into account the different levels of distortion that are tolerable by viewers in different parts of a picture by segmenting the scene into flat, edge, and textured regions and quantizing these regions differently. The visually important scene areas are calculated using Importance Maps. These maps are generated by combining factors known to influence human visual attention and eye movements. Lower quantization is assigned to visually important regions, while areas classified as being of low visual importance are more harshly quantized. Results indicate a subjective improvement in picture quality, in comparison to the TM5 method. Less ringing occurs at edges, and the visually important areas of a picture are more accurately coded. This is particularly noticeable at low bit rates. The technique is computationally efficient and flexible, and can easily be extended to specific applications.

[1]  Jiying Zhao,et al.  An Outstandingness Oriented Image Segmentation and its Application , 1996, Fourth International Symposium on Signal Processing and Its Applications.

[2]  Anthony J. Maeder,et al.  Limiting human perception for image sequences , 1996, Electronic Imaging.

[3]  Neil W. Bergmann,et al.  An MPEG encoder incorporating perceptually based quantisation , 1997, TENCON '97 Brisbane - Australia. Proceedings of IEEE TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications (Cat. No.97CH36162).

[4]  P. Cavanagh,et al.  Attentional resolution and the locus of visual awareness , 1996, Nature.

[5]  Christof Koch,et al.  Control of Selective Visual Attention: Modeling the Where Pathway , 1995, NIPS.

[6]  J. Findlay The Visual Stimulus for Saccadic Eye Movements in Human Observers , 1980, Perception.

[7]  Rajesh P. N. Rao,et al.  Eye Movements in Visual Cognition: A Computational Study , 1997 .

[8]  Anthony J. Maeder,et al.  Automatic identification of perceptually important regions in an image , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[9]  Michael P. Eckert,et al.  The significance of eye movements and image acceleration for coding television image sequences , 1993 .

[10]  Andrew B. Watson,et al.  Image quality and entropy masking , 1997, Electronic Imaging.

[11]  Hsueh-Ming Hang,et al.  Scene Analysis for DCT Image Coding , 1994 .

[12]  Wa James Tam,et al.  Static and dynamic spatial resolution in image coding: an investigation of eye movements , 1991, Electronic Imaging.

[13]  Christophe De Vleeschouwer,et al.  Automatic detection of interest areas of an image or of a sequence of images , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[14]  John K. Tsotsos An inhibitory beam for attentional selection , 1994 .

[15]  Reginald L. Lagendijk,et al.  Spatiotemporal model of human vision for digital video compression , 1997, Electronic Imaging.

[16]  A. L. Yarbus,et al.  Eye Movements and Vision , 1967, Springer US.

[17]  A van Meeteren,et al.  Effects of pictorial noise interfering with visual detection. , 1988, Journal of the Optical Society of America. A, Optics and image science.

[18]  Geoffrey M. Boynton,et al.  New model of human luminance pattern vision mechanisms: analysis of the effects of pattern orientation, spatial phase, and temporal frequency , 1994, Other Conferences.