Spatiotemporal inseparability in early vision: centre‐surround models and velocity selectivity

Several computational theories of early visual processing, such as Marr's zero‐crossing theory, are biologically motivated and based largely on the well‐known difference of Gaussians (DOG) receptive‐field model of retinal processing. We examine the physiological relevance of the DOG, particularly in the light of evidence indicating significant spatiotemporal inseparability in the behaviour of retinal cell types. From the form of the inseparability we find that commonly accepted functional interpretations of retinal processing based on the DOG, such as the Laplacian of a Gaussian and zero crossings, are not valid for time‐varying images. In contrast to current machine‐vision approaches, which attempt to separate form and motion information at an early stage, it appears that this is not the case in biological systems. It is further shown that the qualitative form of this inseparability provides a convenient precursor to the extraction of both form and motion information. We show the construction of efficient mechanisms for the extraction of orientation and two‐dimensional normal velocity through the use of a hierarchical computational framework. The resultant mechanisms are well localized in space‐time and can be easily tuned to various degrees of orientation and speed specificity.

[1]  O. Schade Optical and photoelectric analog of the eye. , 1956, Journal of the Optical Society of America.

[2]  L. Brillouin,et al.  Science and information theory , 1956 .

[3]  R. W. Rodieck Quantitative analysis of cat retinal ganglion cell response to visual stimuli. , 1965, Vision research.

[4]  R. W. Rodieck,et al.  Analysis of receptive fields of cat retinal ganglion cells. , 1965, Journal of neurophysiology.

[5]  C. Enroth-Cugell,et al.  The contrast sensitivity of retinal ganglion cells of the cat , 1966, The Journal of physiology.

[6]  A. Kaneko Physiological and morphological identification of horizontal, bipolar and amacrine cells in goldfish retina , 1970, The Journal of physiology.

[7]  S. Anstis,et al.  Phi movement as a subtraction process. , 1970, Vision research.

[8]  W. Levick,et al.  Lateral geniculate neurons of cat: retinal inputs and physiology. , 1972, Investigative ophthalmology.

[9]  R. Wurtz,et al.  Activity of superior colliculus in behaving monkey. I. Visual receptive fields of single neurons. , 1972, Journal of neurophysiology.

[10]  C. Enroth-Cugell,et al.  Adaptation and dynamics of cat retinal ganglion cells , 1973, The Journal of physiology.

[11]  D. Tolhurst,et al.  Psychophysical evidence for sustained and transient detectors in human vision , 1973, The Journal of physiology.

[12]  F. Werblin Control of Retinal Sensitivity II. Lateral Interactions at the Outer Plexiform Layer , 1974 .

[13]  D. Hubel,et al.  Ferrier lecture - Functional architecture of macaque monkey visual cortex , 1977, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[14]  Tomaso Poggio,et al.  A Theory of Human Stereo Vision , 1977 .

[15]  T. Wiesel,et al.  Functional architecture of macaque monkey visual cortex , 1977 .

[16]  G. Legge Sustained and transient mechanisms in human vision: Temporal and spatial properties , 1978, Vision Research.

[17]  Ramesh C. Jain,et al.  On the Analysis of Accumulative Difference Pictures from Image Sequences of Real World Scenes , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  T. Poggio,et al.  A computational theory of human stereo vision , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[19]  J. Bergen,et al.  A four mechanism model for threshold spatial vision , 1979, Vision Research.

[20]  S. Ullman The Interpretation of Visual Motion , 1979 .

[21]  J P Frisby,et al.  Surfaces with Steep Variations in Depth Pose Difficulties for Orientationally Tuned Disparity Filters , 1979, Perception.

[22]  D. H. Kelly Motion and vision. II. Stabilized spatio-temporal threshold surface. , 1979, Journal of the Optical Society of America.

[23]  William B. Thompson,et al.  Disparity Analysis of Images , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  M J Morgan,et al.  Analogue models of motion perception. , 1980, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[25]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[26]  D. Marr,et al.  An Information Processing Approach to Understanding the Visual Cortex , 1980 .

[27]  P Lennie,et al.  Perceptual signs of parallel pathways. , 1980, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[28]  T. Poggio,et al.  Visual hyperacuity: spatiotemporal interpolation in human vision , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[29]  P. J. Burt,et al.  Fast Filter Transforms for Image Processing , 1981 .

[30]  D Marr,et al.  Directional selectivity and its use in early visual processing , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[31]  P. Burt Fast filter transform for image processing , 1981 .

[32]  John E. W. Mayhew,et al.  Psychophysical and Computational Studies Towards a Theory of Human Stereopsis , 1981, Artif. Intell..

[33]  D. Burr Temporal summation of moving images by the human visual system , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[34]  J. Crowley A representation for visual information , 1981 .

[35]  B. Boycott,et al.  Morphology and mosaic of on- and off-beta cells in the cat retina and some functional considerations , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[36]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[37]  S. Laughlin,et al.  Predictive coding: a fresh view of inhibition in the retina , 1982, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[38]  D. Burr,et al.  Contrast sensitivity at high velocities , 1982, Vision Research.

[39]  Hans-Hellmut Nagel,et al.  Volumetric model and 3D trajectory of a moving car derived from monocular TV frame sequences of a street scene , 1981, Comput. Graph. Image Process..

[40]  P. Lennie,et al.  The influence of temporal frequency and adaptation level on receptive field organization of retinal ganglion cells in cat , 1982, The Journal of physiology.

[41]  C. Enroth-Cugell,et al.  Receptive field properties of X and Y cells in the cat retina derived from contrast sensitivity measurements , 1982, Vision Research.

[42]  C. Enroth-Cugell,et al.  Spatio‐temporal interactions in cat retinal ganglion cells showing linear spatial summation. , 1983, The Journal of physiology.

[43]  Andrew B. Watson,et al.  A look at motion in the frequency domain , 1983 .

[44]  Hans-Hellmut Nagel,et al.  Displacement vectors derived from second-order intensity variations in image sequences , 1983, Comput. Vis. Graph. Image Process..

[45]  R. Shapley,et al.  The receptive field organization of X-cells in the cat: Spatiotemporal coupling and asymmetry , 1984, Vision Research.

[46]  Takeo Kanade,et al.  Adapting optical-flow to measure object motion in reflectance and x-ray image sequences (abstract only) , 1984, COMG.

[47]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[48]  J. van Santen,et al.  Elaborated Reichardt detectors. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[49]  W. Zinth,et al.  Prolonged-excitation coherent Raman spectroscopy with spectral resolution beyond the transition linewidth using two tunable picosecond dye lasers , 1985 .