Dependency Reduction with Divisive Normalization: Justification and Effectiveness

Efficient coding transforms that reduce or remove statistical dependencies in natural sensory signals are important for both biology and engineering. In recent years, divisive normalization (DN) has been advocated as a simple and effective nonlinear efficient coding transform. In this work, we first elaborate on the theoretical justification for DN as an efficient coding transform. Specifically, we use the multivariate t model to represent several important statistical properties of natural sensory signals and show that DN approximates the optimal transforms that eliminate statistical dependencies in the multivariate t model. Second, we show that several forms of DN used in the literature are equivalent in their effects as efficient coding transforms. Third, we provide a quantitative evaluation of the overall dependency reduction performance of DN for both the multivariate t models and natural sensory signals. Finally, we find that statistical dependencies in the multivariate t model and natural sensory signals are increased by the DN transform with low-input dimensions. This implies that for DN to be an effective efficient coding transform, it has to pool over a sufficiently large number of inputs.

[1]  Oldrich A Vasicek,et al.  A Test for Normality Based on Sample Entropy , 1976 .

[2]  Michael S. Lewicki,et al.  Efficient coding of natural sounds , 2002, Nature Neuroscience.

[3]  Joseph J. Atick,et al.  What Does the Retina Know about Natural Scenes? , 1992, Neural Computation.

[4]  Eero P. Simoncelli,et al.  Embedded wavelet image compression based on a joint probability model , 1997, Proceedings of International Conference on Image Processing.

[5]  William T. Freeman,et al.  Presented at: 2nd Annual IEEE International Conference on Image , 1995 .

[6]  Eero P. Simoncelli,et al.  A model of neuronal responses in visual area MT , 1998, Vision Research.

[7]  Bernhard Wegmann,et al.  Statistical dependence between orientation filter outputs used in a human-vision-based image code , 1990, Other Conferences.

[8]  A. McNeil Multivariate t Distributions and Their Applications , 2006 .

[9]  Gerhard Krieger,et al.  The atoms of vision: Cartesian or polar? , 1999 .

[10]  J. R. Blum,et al.  On a characterization of the normal distribution , 1956 .

[11]  Christof Koch,et al.  Shunting Inhibition Does Not Have a Divisive Effect on Firing Rates , 1997, Neural Computation.

[12]  Terrence J. Sejnowski,et al.  Soft Mixer Assignment in a Hierarchical Generative Model of Natural Scene Statistics , 2006, Neural Computation.

[13]  Martin J. Wainwright,et al.  Scale Mixtures of Gaussians and the Statistics of Natural Images , 1999, NIPS.

[14]  D. Ringach Population coding under normalization , 2010, Vision Research.

[15]  Eero P. Simoncelli,et al.  Nonlinear image representation using divisive normalization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  H. B. Barlow,et al.  Possible Principles Underlying the Transformations of Sensory Messages , 2012 .

[17]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[18]  Nikolas P. Galatsanos,et al.  Variational Bayesian Image Restoration Based on a Product of $t$-Distributions Image Prior , 2008, IEEE Transactions on Image Processing.

[19]  Valero Laparra,et al.  Divisive normalization image quality metric revisited. , 2010, Journal of the Optical Society of America. A, Optics, image science, and vision.

[20]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1951 .

[21]  Matthias Bethge,et al.  Natural Image Coding in V1: How Much Use Is Orientation Selectivity? , 2008, PLoS Comput. Biol..

[22]  Eero P. Simoncelli,et al.  Spatiotemporal Elements of Macaque V1 Receptive Fields , 2005, Neuron.

[23]  Richard G. F. Visser,et al.  Meiosis Drives Extraordinary Genome Plasticity in the Haploid Fungal Plant Pathogen Mycosphaerella graminicola , 2009, PloS one.

[24]  D. F. Andrews,et al.  Scale Mixtures of Normal Distributions , 1974 .

[25]  M. Bethge Factorial coding of natural images: how effective are linear models in removing higher-order dependencies? , 2006, Journal of the Optical Society of America. A, Optics, image science, and vision.

[26]  R. Navarro,et al.  Input–output statistical independence in divisive normalization models of V1 neurons , 2003, Network.

[27]  Joseph J Atick,et al.  Could information theory provide an ecological theory of sensory processing? , 2011, Network.

[28]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[29]  M. Carandini,et al.  A Synaptic Explanation of Suppression in Visual Cortex , 2002, The Journal of Neuroscience.

[30]  K. Zografos,et al.  On maximum entropy characterization of Pearson's type II and VII multivariate distributions , 1999 .

[31]  Matthias Bethge,et al.  The Conjoint Effect of Divisive Normalization and Orientation Selectivity on Redundancy Reduction , 2008, NIPS.

[32]  D. Heeger Normalization of cell responses in cat striate cortex , 1992, Visual Neuroscience.

[33]  Eero P. Simoncelli,et al.  Modeling Multiscale Subbands of Photographic Images with Fields of Gaussian Scale Mixtures , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Barry B. Lee,et al.  Suppressive Surrounds and Contrast Gain in Magnocellular-Pathway Retinal Ganglion Cells of Macaque , 2006, The Journal of Neuroscience.

[35]  R. Navarro,et al.  Optimal coding through divisive normalization models of V1 neurons , 2003, Network.

[36]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[37]  J A Solomon,et al.  Model of visual contrast gain control and pattern masking. , 1997, Journal of the Optical Society of America. A, Optics, image science, and vision.

[38]  Eero P. Simoncelli,et al.  Dimensionality reduction in neural models: an information-theoretic generalization of spike-triggered average and covariance analysis. , 2006, Journal of vision.

[39]  Yann LeCun,et al.  What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[40]  Michael J. Black,et al.  Fields of Experts: a framework for learning image priors , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[41]  Zhou Wang,et al.  General-purpose reduced-reference image quality assessment based on perceptually and statistically motivated image representation , 2008, 2008 15th IEEE International Conference on Image Processing.

[42]  H Barlow,et al.  Redundancy reduction revisited , 2001, Network.

[43]  Eero P. Simoncelli,et al.  Nonlinear image representation for efficient perceptual coding , 2006, IEEE Transactions on Image Processing.

[44]  J. H. van Hateren,et al.  Modelling the Power Spectra of Natural Images: Statistics and Information , 1996, Vision Research.

[45]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1967 .

[46]  Liam Paninski,et al.  Estimation of Entropy and Mutual Information , 2003, Neural Computation.

[47]  James J DiCarlo,et al.  Multiple Object Response Normalization in Monkey Inferotemporal Cortex , 2005, The Journal of Neuroscience.

[48]  Peter E. Latham,et al.  Divisive Normalization, Line Attractor Networks and Ideal Observers , 1998, NIPS.

[49]  E J Chichilnisky,et al.  A simple white noise analysis of neuronal light responses , 2001, Network.

[50]  Rajesh P. N. Rao,et al.  Probabilistic Models of the Brain: Perception and Neural Function , 2002 .

[51]  Terrence J. Sejnowski,et al.  Assignment of Multiplicative Mixtures in Natural Images , 2004, NIPS.

[52]  José-Luis Guerrero-Cusumano A Measure of Total Variability for the Multivariate t Distribution with Applications to Finance , 1996, Inf. Sci..

[53]  Geoffrey E. Hinton,et al.  Learning Sparse Topographic Representations with Products of Student-t Distributions , 2002, NIPS.

[54]  D. Ruderman,et al.  Statistics of cone responses to natural images: implications for visual coding , 1998 .

[55]  Eero P. Simoncelli,et al.  Natural signal statistics and sensory gain control , 2001, Nature Neuroscience.

[56]  A. Zellner An Introduction to Bayesian Inference in Econometrics , 1971 .

[57]  W. R. Garner Uncertainty and structure as psychological concepts , 1975 .

[58]  J. M. Foley,et al.  Human luminance pattern-vision mechanisms: masking experiments require a new model. , 1994, Journal of the Optical Society of America. A, Optics, image science, and vision.

[59]  Walter W. Focke,et al.  Mixture Models Based on Neural Network Averaging , 2006, Neural Computation.

[60]  C. Zetzsche,et al.  Fundamental limits of linear filters in the visual processing of two-dimensional signals , 1990, Vision Research.

[61]  M. Carandini,et al.  Summation and division by neurons in primate visual cortex. , 1994, Science.

[62]  Siwei Lyu An implicit Markov random field model for the multi-scale oriented representations of natural images , 2009, CVPR.

[63]  Jose A. Costa,et al.  On Solutions to Multivariate Maximum α-Entropy Problems , 2003 .

[64]  Valero Laparra,et al.  Psychophysically Tuned Divisive Normalization Approximately Factorizes the PDF of Natural Images , 2010, Neural Computation.

[65]  Eero P. Simoncelli,et al.  Natural image statistics and divisive normalization: Modeling nonlinearity and adaptation in cortical neurons , 2002 .

[66]  C. Enroth-Cugell,et al.  Chapter 9 Visual adaptation and retinal gain controls , 1984 .

[67]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[68]  R. Navarro,et al.  Optimal coding through divisive normalization models of V1 neurons. , 2003 .

[69]  M. Klamkin,et al.  A spherical characterization of the normal distribution , 1976 .

[70]  John J. McCann,et al.  Rendering High-Dynamic Range Images: Algorithms that Mimic Human Vision , 2006 .

[71]  Eero P. Simoncelli,et al.  Nonlinear Extraction of Independent Components of Natural Images Using Radial Gaussianization , 2009, Neural Computation.

[72]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.

[73]  Zhaoping Li,et al.  Understanding Retinal Color Coding from First Principles , 1992, Neural Computation.

[74]  R. Baddeley,et al.  Searching for filters with 'interesting' output distributions: an uninteresting direction to explore? , 1996, Network.

[75]  D. Heeger,et al.  The Normalization Model of Attention , 2009, Neuron.

[76]  Joonyeol Lee,et al.  A Normalization Model of Attentional Modulation of Single Unit Responses , 2009, PloS one.

[77]  D J Field,et al.  Local Contrast in Natural Images: Normalisation and Coding Efficiency , 2000, Perception.

[78]  M. Carandini,et al.  Functional Mechanisms Shaping Lateral Geniculate Responses to Artificial and Natural Stimuli , 2008, Neuron.

[79]  M. Studený,et al.  The Multiinformation Function as a Tool for Measuring Stochastic Dependence , 1998, Learning in Graphical Models.

[80]  Siwei Lyu Divisive Normalization: Justification and Effectiveness as Efficient Coding Transform , 2010, NIPS.

[81]  Michael Satosi Watanabe,et al.  Information Theoretical Analysis of Multivariate Correlation , 1960, IBM J. Res. Dev..

[82]  Nuno Vasconcelos,et al.  Decision-Theoretic Saliency: Computational Principles, Biological Plausibility, and Implications for Neurophysiology and Psychophysics , 2009, Neural Computation.

[83]  Eero P. Simoncelli,et al.  Natural Sound Statistics and Divisive Normalization in the Auditory System , 2000, NIPS.

[84]  Shawn R. Olsen,et al.  Divisive Normalization in Olfactory Population Codes , 2010, Neuron.

[85]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[86]  William Palin Elderton Frequency curves and correlation , 1928 .