Top-down visual attention control based on a particle filter for human-interactive robots

A novel visual attention model based on a particle filter is that also has a filter-type feature, (2) a compact model independent of the high-level processes, and (3) a unitary model that naturally integrates top-down modulation and bottom-up processes. These features allow the model to be applied simply to robots and to be easily understood by the developers. In this paper, we first briefly discuss human visual attention, computational models for bottom-up attention, and attentional metaphors. We then describe the proposed model and its top-down control interface. Finally, three experiments demonstrate the potential of the proposed model as an attentional metaphor and top-down attention control interface.

[1]  L. Itti,et al.  Modeling the influence of task on attention , 2005, Vision Research.

[2]  S. A. Hillyard,et al.  Sustained division of the attentional spotlight , 2003, Nature.

[3]  George K. I. Mann,et al.  An Object-Based Visual Attention Model for Robotic Applications , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4]  Gordon Cheng,et al.  Biologically Based Top-Down Attention Modulation for Humanoid Interactions , 2008, Int. J. Humanoid Robotics.

[5]  Antonio Torralba,et al.  Top-down control of visual attention in object detection , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[6]  Tingting Xu,et al.  Attentional Object Detection with an Active Multi-Focal Vision System , 2010, Int. J. Humanoid Robotics.

[7]  Nong Sang,et al.  A Biologically-Inspired Top-Down Learning Model Based on Visual Attention , 2010, 2010 20th International Conference on Pattern Recognition.

[8]  George K. I. Mann,et al.  Task-driven moving object detection for robots using visual attention , 2007, 2007 7th IEEE-RAS International Conference on Humanoid Robots.

[9]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[10]  Nuno Vasconcelos,et al.  Integrated learning of saliency, complex features, and object detectors from cluttered scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Kunio Kashino,et al.  A stochastic model of human visual attention with a dynamic Bayesian network , 2010, ArXiv.

[12]  Brian Scassellati,et al.  A Context-Dependent Attention System for a Social Robot , 1999, IJCAI.