Loop Closure Detection by Algorithmic Information Theory: Implemented on Range and Camera Image Data

In this paper the problem of loop closing from depth or camera image information in an unknown environment is investigated. A sparse model is constructed from a parametric dictionary for every range or camera image as mobile robot observations. In contrast to high-dimensional feature-based representations, in this model, the dimension of the sensor measurements' representations is reduced. Considering the loop closure detection as a clustering problem in high-dimensional space, little attention has been paid to the curse of dimensionality in the existing state-of-the-art algorithms. In this paper, a representation is developed from a sparse model of images, with a lower dimension than original sensor observations. Exploiting the algorithmic information theory, the representation is developed such that it has the geometrically transformation invariant property in the sense of Kolmogorov complexity. A universal normalized metric is used for comparison of complexity based representations of image models. Finally, a distinctive property of normalized compression distance is exploited for detecting similar places and rejecting incorrect loop closure candidates. Experimental results show efficiency and accuracy of the proposed method in comparison to the state-of-the-art algorithms and some recently proposed methods.

[1]  Hong Zhang,et al.  BoRF: Loop-closure detection with scale invariant visual features , 2011, 2011 IEEE International Conference on Robotics and Automation.

[2]  Pere-Pau Vázquez,et al.  Using Normalized Compression Distance for image similarity measurement: an experimental study , 2011, The Visual Computer.

[3]  S. Mallat,et al.  Adaptive greedy approximations , 1997 .

[4]  Wolfram Burgard,et al.  A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[5]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[6]  Ling Shao,et al.  Enhanced Computer Vision With Microsoft Kinect Sensor: A Review , 2013, IEEE Transactions on Cybernetics.

[7]  Dorian Gálvez-López,et al.  Bags of Binary Words for Fast Place Recognition in Image Sequences , 2012, IEEE Transactions on Robotics.

[8]  Dorian Gálvez-López,et al.  Robust Place Recognition With Stereo Sequences , 2012, IEEE Transactions on Robotics.

[9]  Thomas B. Schön,et al.  Learning to close loops from range data , 2011, Int. J. Robotics Res..

[10]  Gordon Wyeth,et al.  SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights , 2012, 2012 IEEE International Conference on Robotics and Automation.

[11]  Jean-Arcady Meyer,et al.  Real-time visual loop-closure detection , 2008, 2008 IEEE International Conference on Robotics and Automation.

[12]  John K. Tsotsos,et al.  Histogram of Oriented Uniform Patterns for robust place recognition and categorization , 2012, Int. J. Robotics Res..

[13]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[15]  Alexandros Nanopoulos,et al.  Hubs in Space: Popular Nearest Neighbors in High-Dimensional Data , 2010, J. Mach. Learn. Res..

[16]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Markus Flierl,et al.  Graph-Preserving Sparse Nonnegative Matrix Factorization With Application to Facial Expression Recognition , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[18]  Jana Kosecka,et al.  Localization in Urban Environments Using a Panoramic Gist Descriptor , 2013, IEEE Transactions on Robotics.

[19]  E. D. Livshitz On the optimality of the Orthogonal Greedy Algorithm for µ-coherent dictionaries , 2012, J. Approx. Theory.

[20]  Ming Li,et al.  Clustering by compression , 2003, IEEE International Symposium on Information Theory, 2003. Proceedings..

[21]  Jean-Arcady Meyer,et al.  Fast and Incremental Method for Loop-Closure Detection Using Bags of Visual Words , 2008, IEEE Transactions on Robotics.

[22]  Juan Andrade-Cetto,et al.  Information-Based Compact Pose SLAM , 2010, IEEE Transactions on Robotics.

[23]  Paul Newman,et al.  Appearance-only SLAM at large scale with FAB-MAP 2.0 , 2011, Int. J. Robotics Res..

[24]  Cyrill Stachniss,et al.  Information-theoretic compression of pose graphs for laser-based SLAM , 2012, Int. J. Robotics Res..

[25]  Hongbin Zha,et al.  Coarse-to-fine vision-based localization by indexing scale-Invariant features , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[26]  Teresa A. Vidal-Calleja,et al.  Action Selection for Single-Camera SLAM , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[27]  Paul Newman,et al.  Describing, Navigating and Recognising Urban Spaces - Building an End-to-End SLAM System , 2007, ISRR.

[28]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[29]  Vincenzo Caglioti,et al.  An information-based exploration strategy for environment mapping with mobile robots , 2010, Robotics Auton. Syst..

[30]  Xuelong Li,et al.  Rank Preserving Sparse Learning for Kinect Based Scene Classification , 2013, IEEE Transactions on Cybernetics.

[31]  Wolfram Burgard,et al.  Place recognition in 3D scans using a combination of bag of words and point feature based relative pose estimation , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[32]  Keith R. Godfrey,et al.  The generation of binary and near-binary pseudorandom signals: an overview , 2002, IEEE Trans. Instrum. Meas..

[33]  Michael B. Wakin,et al.  Analysis of Orthogonal Matching Pursuit Using the Restricted Isometry Property , 2009, IEEE Transactions on Information Theory.

[34]  Emre Ertin,et al.  On the Relation Between Sparse Reconstruction and Parameter Estimation With Model Order Selection , 2010, IEEE Journal of Selected Topics in Signal Processing.

[35]  Pascal Frossard,et al.  Minimum Distance between Pattern Transformation Manifolds: Algorithm and Applications , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Pascal Frossard,et al.  Sparse Approximation Using M-Term Pursuit and Application in Image and Video Coding , 2012, IEEE Transactions on Image Processing.

[37]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[38]  Pascal Frossard,et al.  Optimal Image Alignment With Random Projections of Manifolds: Algorithm and Geometric Analysis , 2011, IEEE Transactions on Image Processing.

[39]  Paul Newman,et al.  Detecting Loop Closure with Scene Sequences , 2007, International Journal of Computer Vision.

[40]  Jane Labadin,et al.  Feature selection based on mutual information , 2015, 2015 9th International Conference on IT in Asia (CITA).

[41]  Mihai Datcu,et al.  A fast compression-based similarity measure with applications to content-based image retrieval , 2012, J. Vis. Commun. Image Represent..

[42]  Pascal Frossard,et al.  Low-rate and flexible image coding with redundant representations , 2006, IEEE Transactions on Image Processing.

[43]  Paul Newman,et al.  FAB-MAP 3D: Topological mapping with spatial and visual appearance , 2010, 2010 IEEE International Conference on Robotics and Automation.

[44]  Andreas Nüchter,et al.  Automatic appearance-based loop detection from three-dimensional laser data using the normal distributions transform , 2009 .

[45]  Bin Ma,et al.  The similarity metric , 2001, IEEE Transactions on Information Theory.