The atoms of vision: Cartesian or polar?

The inherent structure of the encoding in early stages of the visual system is investigated from a combined information-theoretical, psychophysical, and neurophysiological perspective. We argue that the classical modeling in terms of linear spatial filters is equivalent to the assumption of a Cartesian organization of the feature space of early vision. We show that such a linear Cartesian feature space would be suboptimal for the exploitation of the statistical redundancies of natural images since these have a radially separable probability-density function. Therefore a more efficient representation can be obtained by a nonlinear encoding that yields a feature space with polar organization. This prediction of the information-theoretical approach regarding the organization of the feature space of early vision is confirmed by our psychophysical measurements of basic discrimination capabilities for elementary Gabor patches, and the necessary nonlinear operations are shown to be closely related to cortical gain control and to the phase invariance of complex cells. Finally, we point out some striking similarities between the polar representation in visual cortex and basic image-coding strategies pursued in shape-gain vector quantization schemes.

[1]  D. L. Macadam,et al.  Visual sensitivities to combined chromaticity and luminance differences. , 1949, Journal of the Optical Society of America.

[2]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.

[3]  D. Hubel,et al.  Receptive fields of single neurones in the cat's striate cortex , 1959, The Journal of physiology.

[4]  J. Robson,et al.  Application of fourier analysis to the visibility of gratings , 1968, The Journal of physiology.

[5]  D. Burr Sensitivity to spatial phase , 1980, Vision Research.

[6]  J. M. Foley,et al.  Contrast masking in human vision. , 1980, Journal of the Optical Society of America.

[7]  B. Julesz,et al.  Early visual perception. , 1981, Annual review of psychology.

[8]  D. G. Albrecht,et al.  Striate cortex of monkey and cat: contrast response function. , 1982, Journal of neurophysiology.

[9]  S. Laughlin,et al.  Predictive coding: a fresh view of inhibition in the retina , 1982, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[10]  D. G. Albrecht,et al.  Spatial frequency selectivity of cells in macaque visual cortex , 1982, Vision Research.

[11]  Jerry D. Gibson,et al.  Distributions of the Two-Dimensional DCT Coefficients for Images , 1983, IEEE Trans. Commun..

[12]  R. Gray,et al.  Product code vector quantizers for waveform and voice coding , 1984 .

[13]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[14]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[15]  Ingo Rentschler,et al.  Loss of spatial phase relationships in extrafoveal vision , 1985, Nature.

[16]  D. Field,et al.  The structure and symmetry of simple-cell receptive-field profiles in the cat’s visual cortex , 1986, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[17]  A. Gorea,et al.  Different encoding mechanisms for phase and contrast , 1986, Vision Research.

[18]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[19]  A B Watson,et al.  Efficiency of a model human image code. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[20]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[21]  D. Burr,et al.  Feature detection in human vision: a phase-dependent energy model , 1988, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[22]  J. P. Thomas,et al.  Why are some spatial discriminations independent of contrast? , 1989, Journal of the Optical Society of America. A, Optics and image science.

[23]  H. B. Barlow,et al.  Unsupervised Learning , 1989, Neural Computation.

[24]  S. F. Bowne Contrast discrimination cannot explain spatial frequency, orientation or temporal frequency discrimination , 1990, Vision Research.

[25]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.

[26]  Bernhard Wegmann,et al.  Visual-system-based polar quantization of local amplitude and local phase of orientation filter outputs , 1990, Other Conferences.

[27]  Bernhard Wegmann,et al.  Statistical dependence between orientation filter outputs used in a human-vision-based image code , 1990, Other Conferences.

[28]  D. G. Albrecht,et al.  Motion selectivity and the contrast-response function of simple cells in the visual cortex , 1991, Visual Neuroscience.

[29]  M. Banks,et al.  The effects of contrast, spatial scale, and orientation on foveal and peripheral phase discrimination , 1991, Vision Research.

[30]  D. Heeger Nonlinear model of neural responses in cat visual cortex. , 1991 .

[31]  Minami Ito,et al.  Columns for visual features of objects in monkey inferotemporal cortex , 1992, Nature.

[32]  M. Stryker Elements of visual perception , 1992, Nature.

[33]  J. H. van Hateren,et al.  Real and optimal neural images in early vision , 1992, Nature.

[34]  Bernhard Wegmann,et al.  The importance of intrinsically two-dimensional image features in biological vision and picture coding , 1993 .

[35]  Horace Barlow,et al.  What is the computational goal of the neocortex , 1994 .

[36]  M. Carandini,et al.  Summation and division by neurons in primate visual cortex. , 1994, Science.

[37]  B. Treutwein Adaptive psychophysical procedures , 1995, Vision Research.

[38]  D. G. Albrecht,et al.  Bayesian analysis of identification performance in monkey visual cortex: Nonlinear mechanisms and stimulus certainty , 1995, Vision Research.

[39]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[40]  Bernhard Wegmann,et al.  Feature-specific vector quantization of images , 1996, IEEE Trans. Image Process..

[41]  Jochen Braun,et al.  A Model of Early Visual Processing , 1997, NIPS.

[42]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[43]  G. Krieger,et al.  Neural network models and the visual cortex: the missing link between orientation selectivity and the natural environment , 1997, Neuroscience Letters.

[44]  J. H. Hateren,et al.  Independent component filters of natural images compared with simple cells in primary visual cortex , 1998 .

[45]  Eero P. Simoncelli,et al.  Modeling Surround Suppression in V1 Neurons with a Statistically Derived Normalization Model , 1998, NIPS.

[46]  Eero P. Simoncelli,et al.  Image compression via joint statistical characterization in the wavelet domain , 1999, IEEE Trans. Image Process..