Data-and Model-driven Attention Mechanism for Autonomous Visual Landmark Acquisition

This paper presents a visual attention mechanism for the acquisition of landmarks in an arbitrary scene. The proposed mechanism consists of two consecutive selection stages. The first one employs classical preattentive saliency computa tions to select a reduced set of interest regions from the whole input image (data-driven stage). The second stage selects from the output of the first selection stage the region that can be considered as a potential landmark (model-driven stage). This potential landmark is the input of the attentive stage, that must characterize it and finally determine if this object is a real landmark. The used imaging sensor is a stereo vision system which is capable of providing depth data as well as color images. This stereo vision system is mounted on an autonomous mobile robot and serves map-building and localisation purposes. We present results achieved by applying the proposed visual attention scheme to on-line acquired stereo pairs of indoor and outdoor scenes.

[1]  E. Nebot,et al.  Simultaneous Localization and Map Building Using Natural features in Outdoor Environments , .

[2]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[3]  G. Backer,et al.  Two selection stages provide efficient object-based attentional control for dynamic vision , 2003 .

[4]  Bärbel Mertsching,et al.  Data- and Model-Driven Gaze Control for an Active-Vision System , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Francisco Sandoval Hernández,et al.  Bounded irregular pyramid: a new structure for color image segmentation , 2004, Pattern Recognit..

[6]  Atsuto Maki,et al.  Attentional Scene Segmentation: Integrating Depth and Motion , 2000, Comput. Vis. Image Underst..

[7]  Benjamin Kuipers,et al.  Local metrical and global topological maps in the hybrid spatial semantic hierarchy , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[8]  Carme Torras,et al.  Detection of natural landmarks through multiscale opponent features , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[9]  R.W. Ehrich,et al.  Computer image processing and recognition , 1981, Proceedings of the IEEE.

[10]  Simone Frintrop,et al.  An Attentive, Multi-modal Laser "Eye" , 2003, ICVS.

[11]  Robert B. Fisher,et al.  Object-based visual attention for computer vision , 2003, Artif. Intell..

[12]  Frédéric Lerasle,et al.  Environment modeling for topological navigation using visual landmarks and range data , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).