Optimizing visual attention models for predicting human fixations using Genetic Algorithms

Predicting where humans look in a scene is crucial in tasks like human-computer interaction, design, graphics, image and video compression, and gaze animation. This work proposes the use of a mixed-integer constraint Genetic Algorithm (GA) for searching the optimal parameters of a bio-inspired visual saliency model for accurate prediction of human eye fixations. Bioinspired visual saliency models are complex models, mimicking the primate visual system with a vast choice of design parameters that can be tuned to achieve optimal performance. The bottom-up visual attention model used in this study was trained on three challenging image datasets from the ImgSal database using a standard performance metric (area under Receiver Operating Characteristic curve) as the fitness. To compensate for any bias of the optimized model towards the standard metric, we use two other scoring metrics to assess performance. Performance comparisons with eight state-of-the-art models have been presented for all three scoring metrics. Results show that the proposed GA optimized visual attention model provides better prediction performance than several state-of-the-art models of visual attention.

[1]  Antonio Torralba,et al.  Modeling global scene factors in attention. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[2]  Jochen J. Steil,et al.  Integrating Inhomogeneous Processing and Proto-object Formation in a Computational Model of Visual Attention , 2009, Human Centered Robot Systems, Cognition, Interaction, Technology.

[3]  Martin D. Levine,et al.  Visual Saliency Based on Scale-Space Analysis in the Frequency Domain , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Bernhard Schölkopf,et al.  A Nonparametric Approach to Bottom-Up Visual Saliency , 2006, NIPS.

[5]  Antonio Torralba,et al.  Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. , 2006, Psychological review.

[6]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Kusum Deep,et al.  A real coded genetic algorithm for solving integer and mixed integer optimization problems , 2009, Appl. Math. Comput..

[8]  John K. Tsotsos,et al.  Saliency, attention, and visual search: an information theoretic approach. , 2009, Journal of vision.

[9]  Zhou Wang,et al.  Foveation scalable video coding with automatic fixation selection , 2003, IEEE Trans. Image Process..

[10]  Christof Koch,et al.  Learning a saliency map using fixated locations in natural scenes. , 2011, Journal of vision.

[11]  S. Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, CVPR 2009.

[12]  Tim K Marks,et al.  SUN: A Bayesian framework for saliency using natural statistics. , 2008, Journal of vision.

[13]  Kusum Deep,et al.  A new crossover operator for real coded genetic algorithms , 2007, Appl. Math. Comput..

[14]  Peyman Milanfar,et al.  Static and space-time visual saliency detection by self-resemblance. , 2009, Journal of vision.

[15]  Kusum Deep,et al.  A new mutation operator for real coded genetic algorithms , 2007, Appl. Math. Comput..

[16]  Sven Leyffer,et al.  Applications and algorithms for mixed integer nonlinear programming , 2009 .

[17]  Peyman Milanfar,et al.  Nonparametric bottom-up saliency detection by self-resemblance , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[18]  Christof Koch,et al.  Feature combination strategies for saliency-based visual attention systems , 2001, J. Electronic Imaging.

[19]  Ali Borji,et al.  State-of-the-Art in Visual Attention Modeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Frédo Durand,et al.  A Benchmark of Computational Models of Saliency to Predict Human Fixations , 2012 .

[21]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[22]  Christof Koch,et al.  Modeling attention to salient proto-objects , 2006, Neural Networks.

[23]  R. Neil Braithwaite,et al.  Hierarchical Gabor filters for object detection in infrared images , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[24]  David Salesin,et al.  Gaze-based interaction for semi-automatic photo cropping , 2006, CHI.

[25]  Nicolas Riche,et al.  Rare: A new bottom-up saliency model , 2012, 2012 19th IEEE International Conference on Image Processing.

[26]  Christof Koch,et al.  Comparison of feature combination strategies for saliency-based visual attention systems , 1999, Electronic Imaging.

[27]  Asha Iyer,et al.  Components of bottom-up gaze allocation in natural images , 2005, Vision Research.

[28]  Peng Bian,et al.  Visual saliency: a biologically plausible contourlet-like frequency domain approach , 2010, Cognitive Neurodynamics.