Learning Integrated Holism-Landmark Representations for Long-Term Loop Closure Detection

Loop closure detection is a critical component of large-scale simultaneous localization and mapping (SLAM) in loopy environments. This capability is challenging to achieve in longterm SLAM, when the environment appearance exhibits significant long-term variations across various time of the day, months, and even seasons. In this paper, we introduce a novel formulation to learn an integrated long-term representation based upon both holistic and landmark information, which integrates two previous insights under a unified framework: (1) holistic representations outperform keypoint-based representations, and (2) landmarks as an intermediate representation provide informative cues to detect challenging locations. Our new approach learns the representation by projecting input visual data into a low-dimensional space, which preserves both the global consistency (to minimize representation error) and the local consistency (to preserve landmarks’ pairwise relationship) of the input data. To solve the formulated optimization problem, a new algorithm is developed with theoretically guaranteed convergence. Extensive experiments have been conducted using two large-scale public benchmark data sets, in which the promising performances have demonstrated the effectiveness of the proposed approach.

[1]  Noah Snavely,et al.  Minimal Scene Descriptions from Structure from Motion Models , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Ya-Xiang Yuan,et al.  Optimization Theory and Methods: Nonlinear Programming , 2010 .

[3]  Niko Sünderhauf,et al.  BRIEF-Gist - closing the loop by simple means , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Wolfram Burgard,et al.  Robust Visual Robot Localization Across Seasons Using Network Flows , 2014, AAAI.

[5]  David A. McAllester,et al.  A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Peter I. Corke,et al.  Visual Place Recognition: A Survey , 2016, IEEE Transactions on Robotics.

[7]  Patricio A. Vela,et al.  Learning binary features online from motion dynamics for incremental loop-closure detection and place recognition , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[8]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[9]  C. S. G. Lee,et al.  Robust Semantic Place Recognition with Vocabulary Tree and Landmark Detection , 2011 .

[10]  Francis R. Bach,et al.  Structured Sparse Principal Component Analysis , 2009, AISTATS.

[11]  Hyun Myung,et al.  Image feature-based real-time RGB-D 3D SLAM with GPU acceleration , 2013 .

[12]  Xue Yang,et al.  SRAL: Shared Representative Appearance Learning for Long-Term Visual Place Recognition , 2017, IEEE Robotics and Automation Letters.

[13]  John J. Leonard,et al.  Sparse optimization for robust and efficient loop closing , 2017, Robotics Auton. Syst..

[14]  John Wright,et al.  Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Matrices via Convex Optimization , 2009, NIPS.

[15]  Feiping Nie,et al.  Learning Robust Locality Preserving Projection via p-Order Minimization , 2015, AAAI.

[16]  Niko Sünderhauf,et al.  Are We There Yet? Challenging SeqSLAM on a 3000 km Journey Across All Four Seasons , 2013 .

[17]  John J. Leonard,et al.  An Online Sparsity-Cognizant Loop-Closure Algorithm for Visual Navigation , 2014, Robotics: Science and Systems.

[18]  F. Michaud,et al.  Appearance-Based Loop Closure Detection for Online Large-Scale and Long-Term Operation , 2013, IEEE Transactions on Robotics.

[19]  Feiping Nie,et al.  Semi-supervised Robust Dictionary Learning via Efficient l-Norms Minimization , 2013, 2013 IEEE International Conference on Computer Vision.

[20]  Hua Wang,et al.  Robust Multimodal Sequence-Based Loop Closure Detection via Structured Sparsity , 2016, Robotics: Science and Systems.

[21]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[22]  Michael Milford,et al.  Place Recognition with ConvNet Landmarks: Viewpoint-Robust, Condition-Robust, Training-Free , 2015, Robotics: Science and Systems.

[23]  Wolfram Burgard,et al.  Robust visual SLAM across seasons , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[24]  Paul Newman,et al.  Made to measure: Bespoke landmarks for 24-hour, all-weather localisation with a camera , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[25]  Feiping Nie,et al.  Robust Distance Metric Learning via Simultaneous L1-Norm Minimization and Maximization , 2014, ICML.

[26]  Cindy Cappelle,et al.  Place Recognition Based Visual Localization Using LBP Feature and SVM , 2015, MICAI.

[27]  Michael Bosse,et al.  Trajectory-Based Place-Recognition for Efficient Large Scale Localization , 2017, International Journal of Computer Vision.

[28]  James M. Rehg,et al.  CENTRIST: A Visual Descriptor for Scene Categorization , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Feiping Nie,et al.  A Non-Greedy Algorithm for L1-Norm LDA , 2017, IEEE Transactions on Image Processing.

[30]  Junbin Gao,et al.  Robust L1 Principal Component Analysis and Its Bayesian Variational Inference , 2008, Neural Computation.

[31]  Takeo Kanade,et al.  Real-time topometric localization , 2012, 2012 IEEE International Conference on Robotics and Automation.

[32]  Gordon Wyeth,et al.  SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights , 2012, 2012 IEEE International Conference on Robotics and Automation.