Demodulation, predictive coding, and spatial vision.

We argue that some aspects of human spatial vision, particularly for textured patterns and scenes, can be described in terms of demodulation and predictive coding. Such nonlinear processes encode a pattern into local phasors that represent it completely as a modulation, in phase and amplitude, of a prediction associated with the image structure in some region by its predominant undulation(s). The demodulation representation of a pattern is an anisotropic, second-order form of predictive coding, and it offers a particularly efficient way to analyze and encode textures, as it identifies and exploits their underlying redundancies. In addition, self-consistent domains of redundancy in image structure provide a basis for image segmentation. We first provide an algorithm for computing the three elements of a complete demodulation transform of any image, and we illustrate such decompositions for both natural and synthetic images. We then present psychophysical evidence from spatial masking experiments, as well as illustrations of perceptual organization, that suggest a possible role for such underlying representations in human vision. In psychophysical experiments employing masks with more than two oriented Fourier components, we find that peaks of threshold elevation occur at locations in the Fourier plane remote from the orientations and frequencies of the actual mask components. Rather, as would occur from demodulation, these peaks in the frequency plane are related to the vector difference frequencies between the actual masking components and their spectral centers of mass. We offer a neural interpretation of demodulation coding, and finally we demonstrate a practical application of this process in a system for automatic visual recognition of personal identity by demodulation of a facial feature.

[1]  C. Harrison Experiments with linear prediction in television , 1952 .

[2]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[3]  J G Daugman,et al.  Pattern and motion vision without Laplacian zero crossings. , 1988, Journal of the Optical Society of America. A, Optics and image science.

[4]  D Kersten,et al.  Predictability and redundancy of natural images. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[5]  D. Burr,et al.  Functional implications of cross-orientation inhibition of cortical visual cells. I. Neurophysiological evidence , 1982, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[6]  S Marcelja,et al.  Mathematical description of the responses of simple cortical cells. , 1980, Journal of the Optical Society of America.

[7]  D. G. Albrecht,et al.  Striate cortex responses to periodic patterns with and without the fundamental harmonics , 1981, The Journal of physiology.

[8]  J.G. Daugman,et al.  Entropy reduction and decorrelation in visual coding by oriented neural receptive fields , 1989, IEEE Transactions on Biomedical Engineering.

[9]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[10]  David R. Badcock,et al.  Detection of spatial beats: Non-linearity or contrast increment detection? , 1986, Vision Research.

[11]  Wilson S. Geisler,et al.  Multichannel Texture Analysis Using Localized Spatial Filters , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Andrew M. Derrington Distortion products in geniculate X-cells: A physiological basis for masking by spatially modulated gratings? , 1987, Vision Research.

[13]  D. Broadbent,et al.  Some experiments bearing on the hypothesis that the visual system analyses spatial patterns in independent bands of spatial frequency , 1975, Vision Research.

[14]  J. Daugman Spatial visual channels in the fourier plane , 1984, Vision Research.

[15]  A. Seebeck,et al.  Beobachtungen über einige Bedingungen der Entstehung von Tönen , 1841 .

[16]  Alan C. Bovik,et al.  Analysis of multichannel narrow-band filters for image texture segmentation , 1991, IEEE Trans. Signal Process..

[17]  Louis A. Romero,et al.  Robust two-dimensional weighted and unweighted phase unwrapping that uses fast transforms and iterative methods , 1994 .

[18]  J. Nachmias,et al.  Masking by spatially-modulated gratings , 1983, Vision Research.

[19]  J. Daugman Two-dimensional spectral analysis of cortical receptive field profiles , 1980, Vision Research.

[20]  T Caelli Energy processing and coding factors in texture discrimination and image processing , 1983, Perception & psychophysics.

[21]  L O Harvey,et al.  Visual masking at different polar angles in the two-dimensional Fourier plane. , 1990, Journal of the Optical Society of America. A, Optics and image science.

[22]  G. B. Wetherill,et al.  SEQUENTIAL ESTIMATION OF POINTS ON A PSYCHOMETRIC FUNCTION. , 1965, The British journal of mathematical and statistical psychology.

[23]  R. Hartley Transmission of information , 1928 .

[24]  J. Bergen,et al.  Texture segregation and orientation gradient , 1991, Vision Research.

[25]  David R. Badcock,et al.  Two-stage analysis of the motion of 2-dimensional patterns, what is the first stage? , 1992, Vision Research.

[26]  H R Wilson,et al.  Curvature and separation discrimination at texture boundaries. , 1992, Journal of the Optical Society of America. A, Optics and image science.

[27]  John Daugman,et al.  Quadrature-phase simple-cell pairs are appropriately described in complex analytic form , 1993 .

[28]  John Daugman,et al.  High Confidence Visual Recognition of Persons by a Test of Statistical Independence , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  S. Laughlin,et al.  Predictive coding: a fresh view of inhibition in the retina , 1982, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[30]  Keith Langley,et al.  Computational analysis of non-Fourier motion , 1994, Vision Research.

[31]  D. Pollen,et al.  Phase relationships between adjacent simple cells in the visual cortex. , 1981, Science.

[32]  A. Derrington,et al.  Some observations on the masking effects of two-dimensional stimuli , 1989, Vision Research.

[33]  G. J. Burton,et al.  Evidence for non-linear response processes in the human visual system from measurements on the thresholds of spatial beat frequencies. , 1973, Vision research.

[34]  Jacob Nachmias Contrast modulated maskers: Test of a late nonlinearity hypothesis , 1989, Vision Research.

[35]  F. Attneave Some informational aspects of visual perception. , 1954, Psychological review.