Scene analysis with saccadic eye movements: Top-down and bottom-up modeling

The perception of an image by a human observer is usually modeled as a parallel process in which all parts of the image are treated more or less equivalently, but in reality the analysis of scenes is a highly selective procedure, in which only a small subset of image locations is processed by the precise and efficient neural machinery of foveal vision. To understand the principles behind this selection of the ‘‘informative’’ regions of images, we have developed a hybrid system that consists of a combination of a knowledgebased reasoning system with a low-level preprocessing by linear and nonlinear neural operators. This hybrid system is intended as a first step towards a complete model of the sensorimotor system of saccadic scene analysis. In the analysis of a scene, the system calculates in each step which eye movement has to be made to reach a maximum of information about the scene. The possible information gain is calculated by means of a parallel strategy which is suitable for adaptive reasoning. The output of the system is a fixation sequence, and finally, a hypothesis about the scene.

[1]  Rajesh P. N. Rao,et al.  Eye Movements in Visual Cognition: A Computational Study , 1997 .

[2]  G. Hauske,et al.  Object and scene analysis by saccadic eye-movements: an investigation with higher-order statistics. , 2000, Spatial vision.

[3]  Gerhard Krieger,et al.  Nonlinear neurons and higher-order statistics: new approaches to human vision and digital image processing , 1999, Electronic Imaging.

[4]  M. Just,et al.  Eye fixations and cognitive processes , 1976, Cognitive Psychology.

[5]  Bernhard Wegmann,et al.  The importance of intrinsically two-dimensional image features in biological vision and picture coding , 1993 .

[6]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[7]  K. Schill Analysing uncertain data in decision support systems , 1995, Proceedings of 3rd International Symposium on Uncertainty Modeling and Analysis and Annual Conference of the North American Fuzzy Information Processing Society.

[8]  E. Pöppel,et al.  A hierarchical model of temporal perception , 1997, Trends in Cognitive Sciences.

[9]  Edward H. Shortliffe,et al.  A Method for Managing Evidential Reasoning in a Hierarchical Hypothesis Space , 1985, Artif. Intell..

[10]  G. Krieger,et al.  Higher-order statistics of natural images and their exploitation by operators selective to intrinsic dimensionality , 1997, Proceedings of the IEEE Signal Processing Workshop on Higher-Order Statistics.

[11]  L. Stark,et al.  Scanpaths in saccadic eye movements while viewing and recognizing patterns. , 1971, Vision research.

[12]  Kerstin Schill Distributed Reasoning With Uncertain Data , 1998 .

[13]  C. Zetzsche,et al.  Fundamental limits of linear filters in the visual processing of two-dimensional signals , 1990, Vision Research.

[14]  I. Rybak,et al.  A model of attention-guided visual perception and recognition , 1998, Vision Research.

[15]  H Strasburger,et al.  Cortical Magnification Theory Fails to Predict Visual Recognition , 1994, The European journal of neuroscience.

[16]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[17]  Heiner Deubel Visuelle Verarbeitung und kognitive Faktoren bei der Generierung sakkadischer Augenbewegungen , 1994 .

[18]  A. Treisman Features and Objects: The Fourteenth Bartlett Memorial Lecture , 1988, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[19]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[20]  Christof Koch,et al.  Feature combination strategies for saliency-based visual attention systems , 2001, J. Electronic Imaging.

[21]  Ernst Pöppel,et al.  Completing Knowledge by Competing Hierarchies , 1991, UAI.

[22]  D. V. van Essen,et al.  A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[23]  Gerhard Krieger,et al.  Investigation of a sensorimotor system for saccadic scene analysis: an integrated approach , 1998 .

[24]  Kerstin Schill,et al.  General Perspective on the Formalization of Uncertain Knowledge , 1998 .

[25]  Philippe Smets,et al.  The Canonical Decomposition of a Weighted Belief , 1995, IJCAI.

[26]  G. Legge,et al.  The importance of eye movements in the analysis of simple patterns , 1986, Vision Research.