Nonlinear image representation using divisive normalization

In this paper, we describe a nonlinear image representation based on divisive normalization that is designed to match the statistical properties of photographic images, as well as the perceptual sensitivity of biological visual systems. We decompose an image using a multi-scale oriented representation, and use studentpsilas t as a model of the dependencies within local clusters of coefficients. We then show that normalization of each coefficient by the square root of a linear combination of the amplitudes of the coefficients in the cluster reduces statistical dependencies. We further show that the resulting divisive normalization transform is invertible and provide an efficient iterative inversion algorithm. Finally, we probe the statistical and perceptual advantages of this image representation by examining its robustness to added noise, and using it to enhance image contrast.

[1]  L. Landweber An iteration formula for Fredholm integral equations of the first kind , 1951 .

[2]  P. J. Burt,et al.  Fast Filter Transforms for Image Processing , 1981 .

[3]  P. Burt Fast filter transform for image processing , 1981 .

[4]  M. West On scale mixtures of normal distributions , 1987 .

[5]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[6]  William H. Press,et al.  Numerical recipes , 1990 .

[7]  Bernhard Wegmann,et al.  Statistical dependence between orientation filter outputs used in a human-vision-based image code , 1990, Other Conferences.

[8]  D. G. Albrecht,et al.  Cortical neurons: Isolation of contrast gain control , 1992, Vision Research.

[9]  D. Heeger Normalization of cell responses in cat striate cortex , 1992, Visual Neuroscience.

[10]  Patrick C. Teo,et al.  Perceptual image distortion , 1994, Proceedings of 1st International Conference on Image Processing.

[11]  William T. Freeman,et al.  Presented at: 2nd Annual IEEE International Conference on Image , 1995 .

[12]  Eero P. Simoncelli Statistical models for images: compression, restoration and synthesis , 1997, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136).

[13]  J A Solomon,et al.  Model of visual contrast gain control and pattern masking. , 1997, Journal of the Optical Society of America. A, Optics, image science, and vision.

[14]  Eero P. Simoncelli,et al.  Image compression via joint statistical characterization in the wavelet domain , 1999, IEEE Trans. Image Process..

[15]  Eero P. Simoncelli,et al.  Natural signal statistics and sensory gain control , 2001, Nature Neuroscience.

[16]  Dani Lischinski,et al.  Gradient Domain High Dynamic Range Compression , 2023 .

[17]  Fionn Murtagh,et al.  Gray and color image contrast enhancement by the curvelet transform , 2003, IEEE Trans. Image Process..

[18]  Eero P. Simoncelli,et al.  Directly Invertible Nonlinear Divisive Normalization Pyramid for Image Representation , 2003, VLBV.

[19]  Yee Whye Teh,et al.  Energy-Based Models for Sparse Overcomplete Representations , 2003, J. Mach. Learn. Res..

[20]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[21]  Michael J. Black,et al.  Fields of Experts: a framework for learning image priors , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[22]  Edward H. Adelson,et al.  Compressing and companding high dynamic range images with subband architectures , 2005, ACM Trans. Graph..

[23]  Robert M. Gray,et al.  Toeplitz and Circulant Matrices: A Review , 2005, Found. Trends Commun. Inf. Theory.

[24]  Eero P. Simoncelli,et al.  Locally adaptive multiscale contrast optimization , 2005, IEEE International Conference on Image Processing 2005.

[25]  Robert M. Gray,et al.  Toeplitz And Circulant Matrices: A Review (Foundations and Trends(R) in Communications and Information Theory) , 2006 .

[26]  Joshua Gluckman,et al.  Higher Order Image Pyramids , 2006, ECCV.

[27]  M. Bethge Factorial coding of natural images: how effective are linear models in removing higher-order dependencies? , 2006, Journal of the Optical Society of America. A, Optics, image science, and vision.

[28]  Eero P. Simoncelli,et al.  Nonlinear image representation for efficient perceptual coding , 2006, IEEE Transactions on Image Processing.

[29]  William T. Freeman,et al.  What makes a good model of natural images? , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.