Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation

In appearance-based localization and mapping, loop-closure detection is the process used to determinate if the current observation comes from a previously visited location or a new one. As the size of the internal map increases, so does the time required to compare new observations with all stored locations, eventually limiting online processing. This paper presents an online loop-closure detection approach for large-scale and long-term operation. The approach is based on a memory management method, which limits the number of locations used for loop-closure detection so that the computation time remains under real-time constraints. The idea consists of keeping the most recent and frequently observed locations in a working memory (WM) that is used for loop-closure detection, and transferring the others into a long-term memory (LTM). When a match is found between the current location and one stored in WM, associated locations that are stored in LTM can be updated and remembered for additional loop-closure detections. Results demonstrate the approach's adaptability and scalability using ten standard datasets from other appearance-based loop-closure approaches, one custom dataset using real images taken over a 2-km loop of our university campus, and one custom dataset (7 h) using virtual images from the racing video game “Need for Speed: Most Wanted”.

[1]  Ronald Parr,et al.  DP-SLAM 2.0 , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[2]  Aram Kawewong,et al.  PIRF-Nav 2.0: Fast and online incremental appearance-based loop-closure detection in an indoor environment , 2011, Robotics Auton. Syst..

[3]  Juan D. Tardós,et al.  Large-Scale SLAM Building Conditionally Independent Local Maps: Application to Monocular Vision , 2008, IEEE Transactions on Robotics.

[4]  Paul Newman,et al.  FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance , 2008, Int. J. Robotics Res..

[5]  Kurt Konolige,et al.  Towards lifelong visual maps , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Ben J. A. Kröse,et al.  Efficient data association for view based SLAM using connected dominating sets , 2009, Robotics Auton. Syst..

[7]  Juan I. Nieto,et al.  Tree of Words for Visual Loop Closure Detection in Urban SLAM , 2008, ICRA 2008.

[8]  Gordon Wyeth,et al.  FAB-MAP + RatSLAM: Appearance-based SLAM for multiple times of day , 2010, 2010 IEEE International Conference on Robotics and Automation.

[9]  A. Baddeley Human Memory: Theory and Practice, Revised Edition , 1990 .

[10]  Luis Miguel Bergasa,et al.  Real-time hierarchical stereo Visual SLAM in large-scale environments , 2010, Robotics Auton. Syst..

[11]  Sebastian Thrun,et al.  FastSLAM 2.0: An Improved Particle Filtering Algorithm for Simultaneous Localization and Mapping that Provably Converges , 2003, IJCAI.

[12]  Sebastian Thrun,et al.  Probabilistic robotics , 2002, CACM.

[13]  Gordon Wyeth,et al.  Persistent Navigation and Mapping using a Biologically Inspired SLAM System , 2010, Int. J. Robotics Res..

[14]  Jean-Arcady Meyer,et al.  Fast and Incremental Method for Loop-Closure Detection Using Bags of Visual Words , 2008, IEEE Transactions on Robotics.

[15]  Michael Bosse,et al.  Histogram Matching and Global Initialization for Laser-only SLAM in Large Unstructured Environments , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[16]  Javier González,et al.  Toward a Unified Bayesian Approach to Hybrid Metric--Topological SLAM , 2008, IEEE Transactions on Robotics.

[17]  Dorian Gálvez-López,et al.  Real-time loop detection with bags of binary words , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Wolfram Burgard,et al.  Improved Techniques for Grid Mapping With Rao-Blackwellized Particle Filters , 2007, IEEE Transactions on Robotics.

[19]  Barbara Caputo,et al.  The more you learn, the less you store: Memory-controlled incremental SVM for visual place recognition , 2010, Image Vis. Comput..

[20]  Vincent Lepetit,et al.  View-based Maps , 2010, Int. J. Robotics Res..

[21]  Steven Mills,et al.  Bag‐of‐words‐driven, single‐camera simultaneous localization and mapping , 2011, J. Field Robotics.

[22]  Paul Newman,et al.  Highly scalable appearance-only SLAM - FAB-MAP 2.0 , 2009, Robotics: Science and Systems.

[23]  Paul Newman,et al.  SLAM-Loop Closing with Visually Salient Features , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[24]  Jean-Arcady Meyer,et al.  Incremental vision-based topological SLAM , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Ian D. Reid,et al.  Mapping Large Loops with a Single Hand-Held Camera , 2007, Robotics: Science and Systems.

[26]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  François Michaud,et al.  Memory management for real-time appearance-based loop closure detection , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  Gordon Wyeth,et al.  Mapping a Suburb With a Single Camera Using a Biologically Inspired SLAM System , 2008, IEEE Transactions on Robotics.

[29]  Joel W. Burdick,et al.  Springer Tracts in Advanced Robotics , 2004 .

[30]  Richard C. Atkinson,et al.  Human Memory: A Proposed System and its Control Processes , 1968, Psychology of Learning and Motivation.

[31]  Paul Newman,et al.  Outdoor SLAM using visual appearance and laser ranging , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[32]  Michael Bosse,et al.  Simultaneous Localization and Map Building in Large-Scale Cyclic Environments Using the Atlas Framework , 2004, Int. J. Robotics Res..

[33]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[34]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[35]  Ananth Ranganathan,et al.  PLISS: Detecting and Labeling Places Using Online Change-Point Detection , 2010, Robotics: Science and Systems.

[36]  Henrik I. Christensen,et al.  Closing the Loop With Graphical SLAM , 2007, IEEE Transactions on Robotics.

[37]  Giulio Fontana,et al.  Rawseeds ground truth collection systems for indoor self-localization and mapping , 2009, Auton. Robots.

[38]  Juan D. Tardós,et al.  Hierarchical SLAM: real-time accurate mapping of large environments , 2005, IEEE Transactions on Robotics.

[39]  Winston Churchill,et al.  The New College Vision and Laser Data Set , 2009, Int. J. Robotics Res..

[40]  Paul Newman,et al.  Navigating, Recognizing and Describing Urban Spaces With Vision and Lasers , 2009, Int. J. Robotics Res..

[41]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[42]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[43]  Lina María Paz,et al.  Divide and Conquer: EKF SLAM in O(n) , 2008, IEEE Trans. Robotics.

[44]  Olivier Stasse,et al.  MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Gordon Wyeth,et al.  Continuous appearance-based trajectory SLAM , 2011, 2011 IEEE International Conference on Robotics and Automation.

[46]  Tom Duckett,et al.  An adaptive appearance-based map for long-term topological localization of mobile robots , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[47]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..