Linear transform for simultaneous diagonalization of covariance and perceptual metric matrix in image coding

Two types ofredundancies are contained in images: statistical redundancy and psychovisual redundancy. Image representation techniques for image coding should remove both redundancies in order to obtain good results. In order to establish an appropriate representation, the standard approach to transform coding only considers the statistical redundancy, whereas the psychovisual factors are introduced after the selection ofthe representation as a simple scalar weighting in the transform domain. In this work, we take into account the psychovisual factors in the de8nition of the representation together with the statistical factors, by means of the perceptual metric and the covariance matrix, respectively. In general the ellipsoids described by these matrices are not aligned. Therefore, the optimal basis for image representation should simultaneously diagonalize both matrices. This approach to the basis selection problem has several advantages in the particular application ofimage coding. As the transform domain is Euclidean (by de8nition), the quantizer design is highly simpli8ed and at the same time, the use ofscalar quantizers is truly justi8ed. The proposed representation is compared to covariance-based representations such as the DCT and the KLT or PCA using standard JPEG-like and Max-Lloyd quantizers. ? 2003 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.

[1]  J. Malo,et al.  Comparison of perceptually uniform quantisation with average error minimisation in image transform coding , 1999 .

[2]  G. C. Tiao,et al.  Bayesian inference in statistical analysis , 1973 .

[3]  B. Dubrovin,et al.  Modern geometry--methods and applications , 1984 .

[4]  M. Carandini,et al.  Summation and division by neurons in primate visual cortex. , 1994, Science.

[5]  Francesc J. Ferri,et al.  Non-linear Invertible Representation for Joint Statistical and Perceptual Feature Decorrelation , 2000, SSPR/SPR.

[6]  Eero P. Simoncelli,et al.  Modeling Surround Suppression in V1 Neurons with a Statistically Derived Normalization Model , 1998, NIPS.

[7]  James Hu,et al.  DVQ: A digital video quality metric based on human vision , 2001 .

[8]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[9]  Michael S. Landy,et al.  Computational models of visual processing , 1991 .

[10]  Pascual Capilla,et al.  Image quality metric based on multidimensional contrast perception models , 1999 .

[11]  D Kersten,et al.  Predictability and redundancy of natural images. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[12]  Wolfgang Niehsen,et al.  Covariance analysis of motion-compensated frame differences , 1999, IEEE Trans. Circuits Syst. Video Technol..

[13]  오승준 [서평]「Digital Video Processing」 , 1996 .

[14]  Peter E. Hart,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[15]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[16]  Andrew B. Watson,et al.  Visibility of DCT basis functions: effects of contrast masking , 1994, Proceedings of IEEE Data Compression Conference (DCC'94).

[17]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[18]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[19]  Michael S. Landy,et al.  Pattern Discrimination, Visual Filters, and Spatial Sampling Irregularity , 1991 .

[20]  J A Solomon,et al.  Model of visual contrast gain control and pattern masking. , 1997, Journal of the Optical Society of America. A, Optics, image science, and vision.

[21]  J. M. Foley,et al.  Contrast masking in human vision. , 1980, Journal of the Optical Society of America.

[22]  Francesc J. Ferri,et al.  The role of perceptual contrast non-linearities in image transform quantization , 2000, Image Vis. Comput..

[23]  Jesús Malo,et al.  Subjective image fidelity metric based on bit allocation of the human visual system in the DCT domain , 1997, Image Vis. Comput..

[24]  R. Clarke,et al.  Relation between the Karhunen Loève and cosine transforms , 1981 .

[25]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[26]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[27]  Francesc J. Ferri,et al.  Perceptual feedback in multigrid motion estimation using an improved DCT quantization , 2001, IEEE Trans. Image Process..

[28]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1991, CACM.

[29]  Patrick C. Teo,et al.  Perceptual image distortion , 1994, Proceedings of 1st International Conference on Image Processing.

[30]  J. J. Sakurai,et al.  Modern Quantum Mechanics , 1986 .

[31]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.