A Learning Algorithm for Place Recognition

We present a place recognition algorithm for SLAM systems using stereo cameras that considers both appearance and geometric information. Both near and far scene points provide information for the recognition process. Hypotheses about loop closings are generated using a fast appearance technique based on the bag-of-words (BoW) method. Loop closing candidates are evaluated in the context of recent images in the sequence. In cases where similarity is not sufficiently clear, loop closing verification is carried out using a method based on Conditional Random Fields (CRFs). We compare our system with the state of the art using visual indoor and outdoor data from the RAWSEEDS project, and a multisession outdoor dataset obtained at the MIT campus. Our system achieves higher recall (less false negatives) for full precision (no false positives), as compared with the state of the art. It is also more robust to changes in appearance of places because of changes in illumination (different shadow configurations in different days or time of day). We discuss the promise of learning algorithms such as ours, where learning can be modified on-line to re-evaluate the knowledge that the system has about a changing environment.

[1]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[2]  Paul Newman,et al.  FAB-MAP 3D: Topological mapping with spatial and visual appearance , 2010, 2010 IEEE International Conference on Robotics and Automation.

[3]  Dieter Fox,et al.  Learning to Associate Image Features with CRF-Matching , 2008, ISER.

[4]  Jan Faigl,et al.  Simple yet stable bearing-only navigation , 2010 .

[5]  Paul Newman,et al.  Appearance-only SLAM at large scale with FAB-MAP 2.0 , 2011, Int. J. Robotics Res..

[6]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[7]  Achim J. Lilienthal,et al.  SIFT, SURF & seasons: Appearance-based long-term localization in outdoor environments , 2010, Robotics Auton. Syst..

[8]  Fabio Tozeto Ramos,et al.  Robust place recognition with stereo cameras , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Dorian Gálvez-López,et al.  CI-Graph simultaneous localization and mapping for three-dimensional reconstruction of large and complex environments using a multicamera system , 2010, J. Field Robotics.

[10]  Paul Newman,et al.  FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance , 2008, Int. J. Robotics Res..

[11]  Jean-Arcady Meyer,et al.  Fast and Incremental Method for Loop-Closure Detection Using Bags of Visual Words , 2008, IEEE Transactions on Robotics.

[12]  Timothy D. Barfoot,et al.  Visual teach and repeat for long-range rover autonomy , 2010 .

[13]  Wolfram Burgard,et al.  CRF-Matching: Conditional Random Fields for Feature-Based Scan Matching , 2008 .

[14]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15]  John J. Leonard,et al.  Place Recognition using Near and Far Visual Information , 2011 .