Simulating human saccadic scanpaths on natural images

Human saccade is a dynamic process of information pursuit. Based on the principle of information maximization, we propose a computational model to simulate human saccadic scanpaths on natural images. The model integrates three related factors as driven forces to guide eye movements sequentially — reference sensory responses, fovea-periphery resolution discrepancy, and visual working memory. For each eye movement, we compute three multi-band filter response maps as a coherent representation for the three factors. The three filter response maps are combined into multi-band residual filter response maps, on which we compute residual perceptual information (RPI) at each location. The RPI map is a dynamic saliency map varying along with eye movements. The next fixation is selected as the location with the maximal RPI value. On a natural image dataset, we compare the saccadic scanpaths generated by the proposed model and several other visual saliency-based models against human eye movement data. Experimental results demonstrate that the proposed model achieves the best prediction accuracy on both static fixation locations and dynamic scanpaths.

[1]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[2]  T. Foulsham,et al.  What can saliency models predict about eye movements? Spatial and sequential aspects of fixations during encoding and recognition. , 2008, Journal of vision.

[3]  Wen Gao,et al.  Measuring visual saliency by Site Entropy Rate , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  John K. Tsotsos,et al.  Saliency Based on Information Maximization , 2005, NIPS.

[5]  L. Stark,et al.  Most naturally occurring human saccades have magnitudes of 15 degrees or less. , 1975, Investigative ophthalmology.

[6]  J. H. Hateren,et al.  Independent component filters of natural images compared with simple cells in primary visual cortex , 1998 .

[7]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[8]  Preeti Verghese,et al.  Where to look next? Eye movements reduce local uncertainty. , 2007, Journal of vision.

[9]  Frank Tong,et al.  Cognitive neuroscience: Primary visual cortex and visual awareness , 2003, Nature Reviews Neuroscience.

[10]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[11]  Deepu Rajan,et al.  Random walks on graphs to model saliency in images , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Wilson S. Geisler,et al.  Real-time simulation of arbitrary visual fields , 2002, ETRA.

[13]  Nuno Vasconcelos,et al.  The discriminant center-surround hypothesis for bottom-up saliency , 2007, NIPS.

[14]  Terrence J. Sejnowski,et al.  Unsupervised Learning , 2018, Encyclopedia of GIS.

[15]  Refractor Vision , 2000, The Lancet.

[16]  Marina Bloj,et al.  Real and predicted influence of image manipulations on eye movements during scene recognition. , 2010, Journal of vision.

[17]  J. V. van Hateren,et al.  Independent component filters of natural images compared with simple cells in primary visual cortex , 1998, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[18]  J. Rowe,et al.  What “Works” in Working Memory? Separate Systems for Selection and Updating of Critical Information , 2009, The Journal of Neuroscience.

[19]  M. Carrasco,et al.  Spatial covert attention increases contrast sensitivity across the CSF: support for signal enhancement , 2000, Vision Research.

[20]  Wilson S. Geisler,et al.  Real-time foveated multiresolution system for low-bandwidth video communication , 1998, Electronic Imaging.

[21]  Giedrius T Buracas,et al.  The Effect of Spatial Attention on Contrast Response Functions in Human Visual Cortex , 2007, The Journal of Neuroscience.

[22]  Tai Sing Lee,et al.  An Information-Theoretic Framework for Understanding Saccadic Eye Movements , 1999, NIPS.

[23]  Pierre Baldi,et al.  Bayesian surprise attracts human attention , 2005, Vision Research.

[24]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Sabine Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.