A Probabilistic Model of Overt Visual Attention for Cognitive Robots

Visual attention is one of the major requirements for a robot to serve as a cognitive companion for human. The robotic visual attention is mostly concerned with overt attention which accompanies head and eye movements of a robot. In this case, each movement of the camera head triggers a number of events, namely transformation of the camera and the image coordinate systems, change of content of the visual field, and partial appearance of the stimuli. All of these events contribute to the reduction in probability of meaningful identification of the next focus of attention. These events are specific to overt attention with head movement and, therefore, their effects are not addressed in the classical models of covert visual attention. This paper proposes a Bayesian model as a robot-centric solution for the overt visual attention problem. The proposed model, while taking inspiration from the primates visual attention mechanism, guides a robot to direct its camera toward behaviorally relevant and/or visually demanding stimuli. A particle filter implementation of this model addresses the challenges involved in overt attention with head movement. Experimental results demonstrate the performance of the proposed model.

[1]  Lawrence W. Stark,et al.  Top-down guided eye movements , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[2]  C. Koch,et al.  A saliency-based search mechanism for overt and covert shifts of visual attention , 2000, Vision Research.

[3]  Robert B. Fisher,et al.  Object-based visual attention for computer vision , 2003, Artif. Intell..

[4]  Andrés Faiña,et al.  An adaptive detection/attention mechanism for real time robot operation , 2009, Neurocomputing.

[5]  Nicolas P. Rougier,et al.  A Distributed Model of Spatial Visual Attention , 2005, Biomimetic Neural Learning for Intelligent Robots.

[6]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[7]  Linda J. Lanyon,et al.  A model of active visual search with object-based attention guiding scan paths , 2004, Neural Networks.

[8]  M. Posner,et al.  Components of visual orienting , 1984 .

[9]  David E. Irwin,et al.  Visual Search has Memory , 2001, Psychological science.

[10]  R. Desimone,et al.  Competitive Mechanisms Subserve Attention in Macaque Areas V2 and V4 , 1999, The Journal of Neuroscience.

[11]  Stefan Schaal,et al.  Overt visual attention for a humanoid robot , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[12]  Pierre Baldi,et al.  A principled approach to detecting surprising events in video , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Minoru Asada,et al.  Acquisition of human-robot joint attention through real-time natural interaction , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[14]  Erich Rome,et al.  GPU-accelerated affordance cueing based on visual attention , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Fiora Pirri,et al.  Bottom-Up Gaze Shifts and Fixations Learning by Imitation , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[16]  Leslie G. Ungerleider,et al.  The neural basis of biased competition in human visual cortex , 2001, Neuropsychologia.

[17]  James J. Clark,et al.  Modal Control Of An Attentive Vision System , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[18]  Laurent Itti,et al.  Top-down attention selection is fine grained. , 2006, Journal of vision.

[19]  Brian Scassellati,et al.  A Behavioral Analysis of Computational Models of Visual Attention , 2007, International Journal of Computer Vision.

[20]  Paolo Dario,et al.  Implementation of a neurophysiological model of saccadic eye movements on an anthropomorphic robotic head , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[21]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[22]  K. Cave The FeatureGate model of visual selection , 1999, Psychological research.

[23]  John Duncan,et al.  A neural basis for visual search in inferior temporal cortex , 1993, Nature.

[24]  José María Cañas,et al.  An Overt Visual Attention Mechanism Based on Saliency Dynamics , 2008 .

[25]  Helge J. Ritter,et al.  Integrating Context-Free and Context-Dependent Attentional Mechanisms for Gestural Object Reference , 2003, ICVS.

[26]  Joseph A. Driscoll,et al.  A visual attention network for a humanoid robot , 1998, Proceedings. 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, Practice and Applications (Cat. No.98CH36190).

[27]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[28]  Peter Ford Dominey,et al.  A cortico-subcortical model for generation of spatially accurate sequential saccades. , 1992, Cerebral cortex.

[29]  Brian Scassellati,et al.  Active vision for sociable robots , 2001, IEEE Trans. Syst. Man Cybern. Part A.

[30]  Gordon Cheng,et al.  Distributed visual attention on a humanoid robot , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[31]  Masaki Ogino,et al.  Interaction rule learning with a human partner based on an imitation faculty with a simple visuo-motor mapping , 2006, Robotics Auton. Syst..

[32]  Philippe Gaussier,et al.  A context and task dependent visual attention system to control a mobile robot , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[33]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[34]  Gustavo Deco,et al.  Computational neuroscience of vision , 2002 .

[35]  Bärbel Mertsching,et al.  Playing Domino: A Case Study for an Active Vision System , 1999, ICVS.

[36]  Alexandre Bernardino,et al.  Multimodal saliency-based bottom-up attention a framework for the humanoid robot iCub , 2008, 2008 IEEE International Conference on Robotics and Automation.

[37]  Simone Frintrop,et al.  Active gaze control for attentional visual SLAM , 2008, 2008 IEEE International Conference on Robotics and Automation.

[38]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[39]  Alexander Zelinsky,et al.  A Reactive Vision System: Active-Dynamic Saliency , 2007 .

[40]  José Santos-Victor,et al.  A Developmental Roadmap for Learning by Imitation in Robots , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[41]  David W. Murray,et al.  Simultaneous Localization and Map-Building Using Active Vision , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  James J. Clark Spatial attention and saccadic camera motion , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[43]  James J. Little,et al.  Autonomous vision-based exploration and mapping using hybrid maps and Rao-Blackwellised particle filters , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[44]  David G. Lowe,et al.  Local feature view clustering for 3D object recognition , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[45]  Giulio Sandini,et al.  Object-based Visual Attention: a Model for a Behaving Robot , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[46]  Viii Supervisor Sonar-Based Real-World Mapping and Navigation , 2001 .

[47]  Sven J. Dickinson,et al.  Active Object Recognition Integrating Attention and Viewpoint Control , 1994, Comput. Vis. Image Underst..

[48]  T. Duckett,et al.  VOCUS : A Visual Attention System for Object Detection and Goal-directed Search , 2010 .

[49]  M. A. Tanner,et al.  Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions, 3rd Edition , 1998 .

[50]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.