A Biologically-Inspired Visual Saliency Model to Test Different Strategies of Saccade Programming

Saliency models provide a saliency map that is a topographically arranged map to represent the saliency of the visual scene. Saliency map is used to sequentially select particular locations of the scene to predict a subject’s eye scanpath when viewing the corresponding scene. A saliency map is most of the time computed using the same point of view or foveated point. Few models were interested in saccade programming strategies. In visual search tasks, studies shown that people can plan from one foveated point the next two saccades (and so, the next two fixations): this is called concurrent saccade programming. In this paper, we tested if such strategy occurs during natural scene free viewing. We tested different saccade programming strategies depending on the number of programmed saccades. The results showed that the strategy of programming one saccade at a time from the foveated point best matches the experimental data from free viewing of natural images. Because saccade programming models depend on the foveated point, we took into account the spatially variant retinal resolution. We showed that the predicted eye fixations were more effective when this retinal resolution was combined with the saccade programming strategies.

[1]  Derrick J. Parkhurst,et al.  Modeling the role of salience in the allocation of overt visual attention , 2002, Vision Research.

[2]  Wilson S. Geisler,et al.  Real-time foveated multiresolution system for low-bandwidth video communication , 1998, Electronic Imaging.

[3]  S. Yantis,et al.  Visual attention: control, representation, and time course. , 1997, Annual review of psychology.

[4]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[5]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[6]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[7]  J. Henderson Human gaze control during real-world scene perception , 2003, Trends in Cognitive Sciences.

[8]  Jeanny Hérault,et al.  Realistic Simulation Tool for Early Visual Processing Including Space, Time and Colour Data , 1993, IWANN.

[9]  Stefan Wermter,et al.  Emergent Neural Computational Architectures Based on Neuroscience , 2001, Lecture Notes in Computer Science.

[10]  D. Coppola,et al.  Idiosyncratic characteristics of saccadic eye movements when viewing different visual environments , 1999, Vision Research.

[11]  L. Itti Quantitative modelling of perceptual salience at human eye position , 2006 .

[12]  L. Stark,et al.  Most naturally occurring human saccades have magnitudes of 15 degrees or less. , 1975, Investigative ophthalmology.

[13]  Robert M. McPeek,et al.  Concurrent processing of saccades in visual search , 2000, Vision Research.

[14]  Antonio Torralba,et al.  Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. , 2006, Psychological review.

[15]  B. Wandell Foundations of vision , 1995 .

[16]  Heiko Neumann,et al.  Recurrent Long-Range Interactions in Early Vision , 2001, Emergent Neural Computational Architectures Based on Neuroscience.

[17]  D. Navon Forest before trees: The precedence of global features in visual perception , 1977, Cognitive Psychology.

[18]  Alberto Prieto,et al.  New Trends in Neural Computation , 1993 .

[19]  C. J. Erkelens,et al.  PII: S0042-6989(97)00287-3 , 2003 .