A Novel Biologically Inspired Visual Saliency Model

The paper focuses on the modeling of visual saliency. We present a novel model to simulate the two stages of visual processing that are involved in attention. Firstly, the proto-object features are extracted in the pre-attentive stage. On the one hand, the salient pixels and regions are extracted. On the other hand, the semantic proto-objects, which involve all possible states of the observer’s memories such as face, person, car, and text, are detected. Then, the support vector machines are utilized to simulate the learning process. As a consequence, the association between the proto-object features and the salient information is established. A visual attention model is built via the method of machine learning, and the saliency information of a new image can be obtained by the way of reasoning. To validate the model, the eye fixations prediction problem on the MIT dataset is studied. Experimental results indicate that the proposed model effectively improves the predictive accuracy rates compared with other approaches.

[1]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[2]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[3]  Shi-Min Hu,et al.  Global contrast based salient region detection , 2011, CVPR 2011.

[4]  Michael L. Mack,et al.  VISUAL SALIENCY DOES NOT ACCOUNT FOR EYE MOVEMENTS DURING VISUAL SEARCH IN REAL-WORLD SCENES , 2007 .

[5]  Chih-Jen Lin,et al.  A dual coordinate descent method for large-scale linear SVM , 2008, ICML '08.

[6]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[7]  Nuno Vasconcelos,et al.  On the plausibility of the discriminant center-surround hypothesis for visual saliency. , 2008, Journal of vision.

[8]  Laurent Itti,et al.  Biologically-inspired robotics vision monte-carlo localization in the outdoor environment , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[10]  J. Jonides Towards a model of the mind's eye's movement. , 1980, Canadian journal of psychology.

[11]  J. Hupé,et al.  Bistability for audiovisual stimuli: Perceptual decision is modality specific. , 2008, Journal of vision.

[12]  Hubert Konik,et al.  A Spatiotemporal Saliency Model for Video Surveillance , 2011, Cognitive Computation.

[13]  David A. McAllester,et al.  A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[15]  Paul A. Viola,et al.  Robust Real-time Object Detection , 2001 .

[16]  Martin D. Levine,et al.  Visual Saliency Based on Scale-Space Analysis in the Frequency Domain , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Tien-Tsin Wong,et al.  Resizing by symmetry-summarization , 2010, ACM Trans. Graph..

[19]  Laurent Itti,et al.  A Bayesian model for efficient visual search and recognition , 2010, Vision Research.

[20]  Gaia Scerif,et al.  Using developmental cognitive neuroscience to study behavioral and attentional control. , 2009, Developmental psychobiology.

[21]  Amir Hussain,et al.  Cognitive Computation: An Introduction , 2009, Cognitive Computation.

[22]  Neil M. Robertson,et al.  Visual Saliency from Image Features with Application to Compression , 2012, Cognitive Computation.

[23]  Ronald A. Rensink Seeing, sensing, and scrutinizing , 2000, Vision Research.

[24]  Mubarak Shah,et al.  Visual attention detection in video sequences using spatiotemporal cues , 2006, MM '06.

[25]  K. Fujii,et al.  Visualization for the analysis of fluid motion , 2005, J. Vis..

[26]  Susan L. Franzel,et al.  Guided search: an alternative to the feature integration model for visual search. , 1989, Journal of experimental psychology. Human perception and performance.

[27]  I. THE ATTENTION SYSTEM OF THE HUMAN BRAIN , 2002 .

[28]  Frédo Durand,et al.  Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[29]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[30]  Chucai Yi,et al.  Text String Detection From Natural Scenes by Structure-Based Partition and Grouping , 2011, IEEE Transactions on Image Processing.

[31]  S. Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, CVPR 2009.

[32]  L. Itti,et al.  Modeling the influence of task on attention , 2005, Vision Research.

[33]  Anna Esposito,et al.  The Perceptual and Cognitive Role of Visual and Auditory Channels in Conveying Emotional Information , 2009, Cognitive Computation.

[34]  Lie Lu,et al.  A generic framework of user attention model and its application in video summarization , 2005, IEEE Trans. Multim..

[35]  M. Posner,et al.  Attention, self-regulation and consciousness. , 1998, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.