Wavelets, vision and the statistics of natural scenes

The processing of spatial information by the visual system shows a number of similarities to the wavelet transforms that have become popular in applied mathematics. Over the last decade, a range of studies have focused on the question of ‘why’ the visual system would evolve this strategy of coding spatial information. One such approach has focused on the relationship between the visual code and the statistics of natural scenes under the assumption that the visual system has evolved this strategy as a means of optimizing the representation of its visual environment. This paper reviews some of this literature and looks at some of the statistical properties of natural scenes that allow this code to be efficient. It is argued that such wavelet codes are efficient because they increase the independence of the vectors' outputs (i.e. they increase the independence of the responses of the visual neurons) by finding the sparse structure available in the input. Studies with neural networks that attempt to maximize the ‘sparsity’ of the representation have been shown to produce vectors (neural receptive fields) that have many of the properties of a wavelet representation. It is argued that the visual environment has the appropriate sparse structure to make this sparse output possible. It is argued that these sparse/independent representations make it computationally easier to detect and represent the higher–order structure present in complex environmental data.

[1]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[2]  O. L. Zangwill,et al.  Current Problems in Animal Behavior , 1964 .

[3]  J. Robson,et al.  Application of fourier analysis to the visibility of gratings , 1968, The Journal of physiology.

[4]  C Blakemore,et al.  On the existence of neurones in the human visual system selectively sensitive to the orientation and size of retinal images , 1969, The Journal of physiology.

[5]  S Marcelja,et al.  Mathematical description of the responses of simple cortical cells. , 1980, Journal of the Optical Society of America.

[6]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[7]  D. Tolhurst,et al.  On the variety of spatial frequency selectivities shown by neurons in area 17 of the cat , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[8]  E. Bienenstock,et al.  Theory for the development of neuron selectivity: orientation specificity and binocular interaction in visual cortex , 1982, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[9]  S. Laughlin,et al.  Predictive coding: a fresh view of inhibition in the retina , 1982, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[10]  D. G. Albrecht,et al.  Spatial frequency selectivity of cells in macaque visual cortex , 1982, Vision Research.

[11]  Andrew C. Sleigh,et al.  Physical and Biological Processing of Images , 1983 .

[12]  Andrew B. Watson,et al.  Detection and Recognition of Simple Spatial Forms , 1983 .

[13]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[14]  R. L. de Valois,et al.  Relationship between spatial-frequency and orientation tuning of striate-cortex cells. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[15]  Terry Bossomaier,et al.  Why spatial frequency processing in the visual cortex? , 1986, Vision Research.

[16]  D. Field,et al.  The structure and symmetry of simple-cell receptive-field profiles in the cat’s visual cortex , 1986, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[17]  J. P. Jones,et al.  An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[18]  G. J. Burton,et al.  Color and spatial structure in natural scenes. , 1987, Applied optics.

[19]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[20]  A. Parker,et al.  Spatial properties of neurons in the monkey striate cortex , 1987, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[21]  Edward H. Adelson,et al.  Orthogonal Pyramid Transforms For Image Coding. , 1987, Other Conferences.

[22]  Ralph Linsker,et al.  Self-organization in a perceptual network , 1988, Computer.

[23]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[24]  Terrence J. Sejnowski,et al.  Network model of shape-from-shading: neural function arises from both receptive and projective fields , 1988, Nature.

[25]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[26]  David J. Field,et al.  What The Statistics Of Natural Images Tell Us About Visual Coding , 1989, Photonics West - Lasers and Applications in Science and Engineering.

[27]  Joseph J. Atick,et al.  Towards a Theory of Early Visual Processing , 1990, Neural Computation.

[28]  D G Stork,et al.  Do Gabor functions provide appropriate descriptions of visual cortical receptive fields? , 1990, Journal of the Optical Society of America. A, Optics and image science.

[29]  G. Hartmann,et al.  Parallel Processing in Neural Systems and Computers , 1990 .

[30]  D. Tolhurst,et al.  Amplitude spectra of natural images. , 1992, Ophthalmic & physiological optics : the journal of the British College of Ophthalmic Opticians.

[31]  Nathan Intrator,et al.  Feature Extraction Using an Unsupervised Neural Network , 1992, Neural Computation.

[32]  Leslie S. Smith,et al.  The principal components of natural images , 1992 .

[33]  Joseph J. Atick,et al.  What Does the Retina Know about Natural Scenes? , 1992, Neural Computation.

[34]  D. Tolhurst,et al.  Amplitude spectra of natural images , 1992 .

[35]  D. Donoho Unconditional Bases Are Optimal Bases for Data Compression and for Statistical Estimation , 1993 .

[36]  D. Ruderman The statistics of natural images , 1994 .

[37]  David J. Field,et al.  What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[38]  D. Field,et al.  What's constant in contrast constancy? The effects of scaling on the perceived contrast of bandpass patterns , 1995, Vision Research.

[39]  L. Croner,et al.  Receptive fields of P and M ganglion cells across the primate retina , 1995, Vision Research.

[40]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[41]  D. Field,et al.  Visual sensitivity, blur and the sources of variability in the amplitude spectra of natural scenes , 1997, Vision Research.

[42]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[43]  Leon N. Cooper,et al.  BCM network develops orientation selectivity and ocular dominance in natural scene environment , 1997, Vision Research.

[44]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[45]  RussLL L. Ds Vnlos,et al.  SPATIAL FREQUENCY SELECTIVITY OF CELLS IN MACAQUE VISUAL CORTEX , 2022 .