Affective saliency map considering psychological distance

This paper proposes a new affective saliency map (SM) model considering psychological distance as well as the pop-out property based on relative spatial distribution of the primitive visual features such as intensity, edge, color, and orientation. By reflecting congruency between the spatial distance caused by spatial proximity and distal in a visual scene and psychological distance caused by the way people think about visual stimuli, the proposed SM model can produce more human-like visual selective attention than a conventional SM model based on primary visual perception. In the proposed model, a psychological distance caused by a social distance, in which a proximal entity such as friend becomes more attractive when it is located near but a distal entity such as enemy becomes more attractive when it is located far from an observer, is considered. In the experiments, two types of visual stimuli are considered, mono-stimuli and stereo-stimuli. In the case of mono-stimuli, the visual stimuli on a picture with psychological depth cues were considered. Instead, in the case of stereo-stimuli, depth perception is also considered for obtaining real spatial distance of visual target in a visual scene. In order to verify the proposed affective SM model, an eye tracking system was used to measure the visual scan path and fixation time on a specific local area while monitoring the visual scenes by human subjects. Experimental results show that the proposed model can generate plausible visual selective attention properly reflecting both psychological distance and primitive visual stimuli inducing pop-out bottom-up features.

[1]  Ken Nakayama,et al.  Serial and parallel processing of visual feature conjunctions , 1986, Nature.

[2]  Antonio Fernández-Caballero,et al.  Dynamic stereoscopic selective visual attention (DSSVA): Integrating motion and shape with depth in video segmentation , 2008, Expert Syst. Appl..

[3]  Anne Treisman,et al.  Features and objects in visual processing , 1986 .

[4]  Laurent Itti,et al.  An Integrated Model of Top-Down and Bottom-Up Attention for Optimizing Detection Speed , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[5]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[6]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[7]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[8]  Minho Lee,et al.  Stereo saliency map considering affective factors and selective motion analysis in a dynamic environment , 2008, Neural Networks.

[9]  R Held,et al.  The development of eye alignment, convergence, and sensory binocularity in young infants. , 1994, Investigative ophthalmology & visual science.

[10]  Fiora Pirri,et al.  A biologically plausible robot attention model, based on space and time , 2006, Cognitive Processing.

[11]  Henrik I. Christensen,et al.  Visual Attention Using Game Theory , 2002, Biologically Motivated Computer Vision.

[12]  Minho Lee,et al.  Biologically motivated vergence control system using human-like selective attention model , 2006, Neurocomputing.

[13]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[14]  Minho Lee,et al.  Saliency map model with adaptive masking based on independent component analysis , 2002, Neurocomputing.

[15]  Kunihiko Fukushima,et al.  Use of non-uniform spatial blur for image comparison: symmetry axis extraction , 2005, Neural Networks.

[16]  Atsuto Maki,et al.  Attentional Scene Segmentation: Integrating Depth and Motion , 2000, Comput. Vis. Image Underst..

[17]  Robert Ward,et al.  Emotion recognition following human pulvinar damage , 2007, Neuropsychologia.

[18]  L. Itti,et al.  Visual causes versus correlates of attentional selection in dynamic scenes , 2006, Vision Research.

[19]  J. Gallant,et al.  Goal-Related Activity in V4 during Free Viewing Visual Search Evidence for a Ventral Stream Visual Salience Map , 2003, Neuron.

[20]  E. Mizutani,et al.  Neuro-Fuzzy and Soft Computing-A Computational Approach to Learning and Machine Intelligence [Book Review] , 1997, IEEE Transactions on Automatic Control.

[21]  Yaacov Trope,et al.  Automatic processing of psychological distance: evidence from a Stroop task. , 2007, Journal of experimental psychology. General.

[22]  Stephen Grossberg,et al.  Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps , 1992, IEEE Trans. Neural Networks.

[23]  Pietro Perona,et al.  Selective visual attention enables learning and recognition of multiple objects in cluttered scenes , 2005, Comput. Vis. Image Underst..

[24]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.