Analogue VLSI primitives for perceptual tasks in machine vision

A variety of computational tasks in early vision can be formulated through lattice networks. The cooperative action of these networks depends upon the topology of interconnections, both feedforward and recurrent ones. The Gabor-like impulse response of a 2nd-order lattice network (i.e. with nearest and next-to-nearest interconnections) is analysed in detail, pointing out how a near-optimal filtering behaviour in space and frequency domains can be achieved through excitatory/inhibitory interactions without impairing the stability of the system. These architectures can be mapped, very efficiently at transistor level, on VLSI structures operating as analogue perceptual engines. The hardware implementation of early vision tasks can, indeed, be tackled by combining these perceptual agents through suitable weighted sums. Various implementation strategies have been pursued with reference to: (i) the algorithm-circuit mapping (current-mode and transconductor approaches); (ii) the degree of programmability (fixed, selectable and tunable); and (iii) the implementation technology (2μ and 0.8μ gate lengths). Applications of the perceptual engine to machine vision algorithms are discussed.

[1]  Takashi Matsumoto,et al.  Spatial versus temporal stability issues in image processing neuro chips , 1992, IEEE Trans. Neural Networks.

[2]  Terrence J. Sejnowski,et al.  Filter selection model for motion segmentation and velocity integration , 1994 .

[3]  Yehoshua Y. Zeevi,et al.  The Generalized Gabor Scheme of Image Representation in Biological and Machine Vision , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[5]  T. Sanger,et al.  Stereo disparity computation using Gabor filters , 1988, Biological Cybernetics.

[6]  H. Wechsler Multiscale and Distributed Visual Representations and Mappings for Invariant Low-Level Perception , 1992 .

[7]  N. Qian Binocular Disparity and the Perception of Depth , 1997, Neuron.

[8]  Luigi Raffo Analysis and synthesis of resistive networks for distributed visual elaborations , 1996 .

[9]  A. Parker,et al.  Two-dimensional spatial structure of receptive fields in monkey striate cortex. , 1988, Journal of the Optical Society of America. A, Optics and image science.

[10]  Carver Mead,et al.  Analog VLSI and neural systems , 1989 .

[11]  Z. Czarnul,et al.  Erratum: MOS transconductors and integrators with high linearity , 1986 .

[12]  Christof Koch,et al.  Neuromorphic vision chips , 1996 .

[13]  P. O. Bishop,et al.  Theory of spatial position and spatial frequency relations in the receptive fields of simple cells in the visual cortex , 1982, Biological Cybernetics.

[14]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[15]  J. J. Koenderink,et al.  Scale-time , 1988, Biological Cybernetics.

[16]  Ian P. Howard,et al.  Binocular Vision and Stereopsis , 1996 .

[17]  P Perona,et al.  Preattentive texture discrimination with early vision mechanisms. , 1990, Journal of the Optical Society of America. A, Optics and image science.

[18]  Eric A. Vittoz,et al.  Analog VLSI signal processing: Why, where, and how? , 1994 .

[19]  A J Ahumada,et al.  Model of human visual-motion sensing. , 1985, Journal of the Optical Society of America. A, Optics and image science.