Visual loop closing using multi-resolution SIFT grids in metric-topological SLAM

We present an image based simultaneous localization and mapping (SLAM) framework with online, appearance only loop closing. We adopt a layered approach with metric maps over small areas at the local level and a global, graph based abstract topological framework to build consistent maps over large distances. Rao-Blackwellised particle filtering and sparse bundle adjustment are efficiently coupled with a stereo vision based odometry module to construct conditionally independent `submaps' using SIFT features. By extracting keyframes from these submaps, a multiresolution dictionary of distinct features is built online to learn a generative model of appearance and perform loop closure. Creating such a dictionary also enables the system to distinguish between similar regions during loop closure without requiring any offline training, as has been described in other approaches. Furthermore, instead of occupancy or grid maps, we build 3D reconstructions of the world; a model we plan to use as input to a scene interpretation module for providing navigational cues to the visually impaired. We demonstrate the robustness of our SLAM system with indoor and outdoor experiments for full 6 degrees of freedom motion using only a stereo camera in hand, running at 1 Hz on a standard PC.

[1]  David G. Lowe,et al.  Shape indexing using approximate nearest-neighbour search in high-dimensional spaces , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Paul Newman,et al.  Probabilistic Appearance Based Navigation and Loop Closing , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[3]  Juan D. Tardós,et al.  Hierarchical SLAM: real-time accurate mapping of large environments , 2005, IEEE Transactions on Robotics.

[4]  Hugh F. Durrant-Whyte,et al.  Simultaneous map building and localization for an autonomous mobile robot , 1991, Proceedings IROS '91:IEEE/RSJ International Workshop on Intelligent Robots and Systems '91.

[5]  Jason Jianjun Gu,et al.  Registration uncertainty for robot self-localization in 3D , 2005, The 2nd Canadian Conference on Computer and Robot Vision (CRV'05).

[6]  David W. Murray,et al.  Improving the Agility of Keyframe-Based SLAM , 2008, ECCV.

[7]  James R. Bergen,et al.  Visual odometry for ground vehicle applications , 2006, J. Field Robotics.

[8]  Paul Newman,et al.  Outdoor SLAM using visual appearance and laser ranging , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[9]  Peter Cheeseman,et al.  On the Representation and Estimation of Spatial Uncertainty , 1986 .

[10]  Sebastian Thrun,et al.  FastSLAM 2.0: An Improved Particle Filtering Algorithm for Simultaneous Localization and Mapping that Provably Converges , 2003, IJCAI.

[11]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[12]  Kevin P. Murphy,et al.  Bayesian Map Learning in Dynamic Environments , 1999, NIPS.

[13]  Niko Sünderhauf,et al.  Towards using sparse bundle adjustment for robust stereo odometry in outdoor terrain , 2006 .

[14]  Ben J. A. Kröse,et al.  Hierarchical map building using visual landmarks and geometric constraints , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[15]  Javier González,et al.  Consistent observation grouping for generating metric-topological maps that improves robot localization , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[16]  Larry H. Matthies,et al.  Error modeling in stereo navigation , 1986, IEEE J. Robotics Autom..

[17]  Jan-Michael Frahm,et al.  Detailed Real-Time Urban 3D Reconstruction from Video , 2007, International Journal of Computer Vision.

[18]  Olivier Stasse,et al.  MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  James J. Little,et al.  A Study of the Rao-Blackwellised Particle Filter for Efficient and Accurate Vision-Based SLAM , 2006, International Journal of Computer Vision.

[20]  David Nistér,et al.  Preemptive RANSAC for live structure and motion estimation , 2005, Machine Vision and Applications.

[21]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[22]  Javier González,et al.  Toward a Unified Bayesian Approach to Hybrid Metric--Topological SLAM , 2008, IEEE Transactions on Robotics.

[23]  Ian D. Reid,et al.  Mapping Large Loops with a Single Hand-Held Camera , 2007, Robotics: Science and Systems.