SURF features for efficient robot localization with omnidirectional images

Many robotic applications work with visual reference maps, which usually consist of sets of more or less organized images. In these applications, there is a compromise between the density of reference data stored and the capacity to identify later the robot localization, when it is not exactly in the same position as one of the reference views. Here we propose the use of a recently developed feature, SURF, to improve the performance of appearance-based localization methods that perform image retrieval in large data sets. This feature is integrated with a vision-based algorithm that allows both topological and metric localization using omnidirectional images in a hierarchical approach. It uses pyramidal kernels for the topological localization and three-view geometric constraints for the metric one. Experiments with several omnidirectional images sets are shown, including comparisons with other typically used features (radial lines and SIFT). The advantages of this approach are proved, showing the use of SURF as the best compromise between efficiency and accuracy in the results.

[1]  Yasushi Yagi,et al.  Map-based navigation for a mobile robot with omnidirectional image sensor COPIS , 1995, IEEE Trans. Robotics Autom..

[2]  José Santos-Victor,et al.  Vision-based navigation and environmental representations with an omnidirectional camera , 2000, IEEE Trans. Robotics Autom..

[3]  Olivier D. Faugeras,et al.  Self-Calibration of a 1D Projective Camera and Its Application to the Self-Calibration of a 2D Projective Camera , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[5]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[6]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[7]  Magnus Oskarsson,et al.  Solutions and Ambiguities of the Structure and Motion Problem for 1D Retinal Vision , 2000, Journal of Mathematical Imaging and Vision.

[8]  Jiun-Hung Chen,et al.  Panoramic appearance-based recognition of video contents using matching graphs , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9]  Emanuele Menegatti,et al.  Image-based memory for robot navigation using properties of omnidirectional images , 2004, Robotics Auton. Syst..

[10]  Trevor Darrell,et al.  The pyramid match kernel: discriminative classification with sets of image features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[11]  Jana Kosecka,et al.  Global localization and relative positioning based on scale-invariant keypoints , 2005, Robotics Auton. Syst..

[12]  Ben J. A. Kröse,et al.  Hierarchical map building using visual landmarks and geometric constraints , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  L. Gool,et al.  Interactive museum guide : fast and robust recognition of museum objects , 2006 .

[14]  Luc Van Gool,et al.  Visual topological map building in self-similar environments , 2006, ICINCO-RA.

[15]  Luc Van Gool,et al.  Localization with omnidirectional images using the radial trifocal tensor , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[16]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[17]  Udo Frese,et al.  Closing a Million-Landmarks Loop , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Luc Van Gool,et al.  From omnidirectional images to hierarchical localization , 2007, Robotics Auton. Syst..

[19]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .