Stereo saliency map considering affective factors and selective motion analysis in a dynamic environment

We propose new integrated saliency map and selective motion analysis models partly inspired by a biological visual attention mechanism. The proposed models consider not only binocular stereopsis to identify a final attention area so that the system focuses on the closer area as in human binocular vision, based on the single eye alignment hypothesis, but also both the static and dynamic features of an input scene. Moreover, the proposed saliency map model includes an affective computing process that skips an unwanted area and pays attention to a desired area, which reflects the human preference and refusal in subsequent visual search processes. In addition, we show the effectiveness of considering the symmetry feature determined by a neural network and an independent component analysis (ICA) filter which are helpful to construct an object preferable attention model. Also, we propose a selective motion analysis model by integrating the proposed saliency map with a neural network for motion analysis. The neural network for motion analysis responds selectively to rotation, expansion, contraction and planar motion of the optical flow in a selected area. Experiments show that the proposed model can generate plausible scan paths and selective motion analysis results for natural input scenes.

[1]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[2]  Stephen Grossberg,et al.  Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps , 1992, IEEE Trans. Neural Networks.

[3]  Robert Ward,et al.  Emotion recognition following human pulvinar damage , 2007, Neuropsychologia.

[4]  L. Itti,et al.  Visual causes versus correlates of attentional selection in dynamic scenes , 2006, Vision Research.

[5]  K. Fukushima Extraction of visual motion and optic flow , 2007, Neural Networks.

[6]  R Held,et al.  The development of eye alignment, convergence, and sensory binocularity in young infants. , 1994, Investigative ophthalmology & visual science.

[7]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[8]  Jun Saiki,et al.  Stochastic Guided Search Model for Search Asymmetries in Visual Search Tasks , 2002, Biologically Motivated Computer Vision.

[9]  Kunihiko Fukushima,et al.  Use of non-uniform spatial blur for image comparison: symmetry axis extraction , 2005, Neural Networks.

[10]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[11]  Mark W Pettet,et al.  Temporal dynamics of the human response to symmetry. , 2002, Journal of vision.

[12]  A. Guyton,et al.  Textbook of Medical Physiology , 1961 .

[13]  Minho Lee,et al.  Saliency map model with adaptive masking based on independent component analysis , 2002, Neurocomputing.

[14]  Fiora Pirri,et al.  A biologically plausible robot attention model, based on space and time , 2006, Cognitive Processing.

[15]  J. Duncan Selective attention and the organization of visual information. , 1984, Journal of experimental psychology. General.

[16]  Atsuto Maki,et al.  Attentional Scene Segmentation: Integrating Depth and Motion , 2000, Comput. Vis. Image Underst..

[17]  Henrik I. Christensen,et al.  Visual Attention Using Game Theory , 2002, Biologically Motivated Computer Vision.

[18]  Minho Lee,et al.  Biologically motivated vergence control system using human-like selective attention model , 2006, Neurocomputing.

[19]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[20]  J. Gallant,et al.  Goal-Related Activity in V4 during Free Viewing Visual Search Evidence for a Ventral Stream Visual Salience Map , 2003, Neuron.

[21]  Heinz Hügli,et al.  Computing visual attention from scene depth , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[22]  Ken Nakayama,et al.  Serial and parallel processing of visual feature conjunctions , 1986, Nature.

[23]  Antonio Fernández-Caballero,et al.  Dynamic stereoscopic selective visual attention (DSSVA): Integrating motion and shape with depth in video segmentation , 2008, Expert Syst. Appl..

[24]  Laurent Itti,et al.  An Integrated Model of Top-Down and Bottom-Up Attention for Optimizing Detection Speed , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[25]  Anne Treisman,et al.  Features and objects in visual processing , 1986 .

[26]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[27]  C. Connor,et al.  Population coding of shape in area V4 , 2002, Nature Neuroscience.

[28]  Simone Frintrop,et al.  A Bimodal Laser-Based Attention System , 2005, Comput. Vis. Image Underst..

[29]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[30]  Minho Lee,et al.  Dynamic visual selective attention model , 2008, Neurocomputing.

[31]  Yehezkel Yeshurun,et al.  Context-free attentional operators: The generalized symmetry transform , 1995, International Journal of Computer Vision.

[32]  Dennis M. Levi,et al.  Mechanisms of perceptual learning for vernier acuity , 2002 .

[33]  Pietro Perona,et al.  Selective visual attention enables learning and recognition of multiple objects in cluttered scenes , 2005, Comput. Vis. Image Underst..

[34]  M. Paradiso,et al.  Neuroscience: Exploring the Brain , 1996 .