Preattentive grouping and attentive selection for early visual computation

The segmentation of objects in a real world scene is a prerequisite for any higher level recognition or interpretation process. Biological visual systems exploit efficient mechanisms for object extraction which seem to be mostly data driven. We propose a network for perceptual grouping inspired from neurophysiological and psychophysical findings, incorporating a phase diffusion process which labels the whole image into its constituent objects and the background, followed by a selective attention stage which sequentially extracts objects in the scene. The image is processed by four successive stages, copying the design of visual cortical mechanisms. Direction specific edge responses are used as starting points for a competitive and cooperative phase process. The resulting phase image is processed by an attention mechanism, extracting homogeneous regions using both spatial and phase information, followed by the generation of a saccadic signal.

[1]  O. Reiser,et al.  Principles Of Gestalt Psychology , 1936 .

[2]  A. L. I︠A︡rbus Eye Movements and Vision , 1967 .

[3]  A. L. Yarbus,et al.  Eye Movements and Vision , 1967, Springer US.

[4]  J. Pokorny Foundations of Cyclopean Perception , 1972 .

[5]  Azriel Rosenfeld,et al.  Scene Labeling by Relaxation Operations , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[6]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[7]  Steven W. Zucker,et al.  On the Foundations of Relaxation Labeling Processes , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[9]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  S Grossberg,et al.  Neural dynamics of brightness perception: Features, boundaries, diffusion, and resonance , 1984, Perception & Psychophysics.

[11]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[12]  Demetri Terzopoulos,et al.  Regularization of Inverse Visual Problems Involving Discontinuities , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  C Koch,et al.  Analog "neuronal" networks in early vision. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Geoffrey E. Hinton,et al.  Separating Figure from Ground with a Parallel Network , 1986, Perception.

[15]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Stephen Grossberg,et al.  Nonlinear neural networks: Principles, mechanisms, and architectures , 1988, Neural Networks.

[17]  D. Burr,et al.  Feature detection in human vision: a phase-dependent energy model , 1988, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[18]  T Poggio,et al.  Parallel integration of vision modules. , 1988, Science.

[19]  John Y. Aloimonos,et al.  Unification and integration of visual modules: an extension of the Marr Paradigm , 1989 .

[20]  W. Singer,et al.  Oscillatory responses in cat visual cortex exhibit inter-columnar synchronization which reflects global stimulus properties , 1989, Nature.

[21]  Jitendra Malik,et al.  Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Jitendra Malik,et al.  Detecting and localizing edges composed of steps, peaks and roofs , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[23]  P. Perona,et al.  Detecting and localizing edges composed of steps , 1990 .

[24]  J T McIlwain,et al.  Distributed spatial coding in the superior colliculus: A review , 1991, Visual Neuroscience.

[25]  S. Yantis Multielement visual tracking: Attention and perceptual organization , 1992, Cognitive Psychology.

[26]  Winfried A. Fellenz,et al.  A Sequential Model for Attentive Object Selection , 1994 .

[27]  Winfried A. Fellenz,et al.  Image Segmentation by Phase Label Diffusion , 1995 .

[28]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[29]  Jean-Michel Morel,et al.  Variational methods in image segmentation , 1995 .