Role of Deep Learning in Loop Closure Detection for Visual and Lidar SLAM: A Survey

Loop closure detection is of vital importance in the process of simultaneous localization and mapping (SLAM), as it helps to reduce the cumulative error of the robot’s estimated pose and generate a consistent global map. Many variations of this problem have been considered in the past and the existing methods differ in the acquisition approach of query and reference views, the choice of scene representation, and associated matching strategy. Contributions of this survey are many-fold. It provides a thorough study of existing literature on loop closure detection algorithms for visual and Lidar SLAM and discusses their insight along with their limitations. It presents a taxonomy of state-of-the-art deep learning-based loop detection algorithms with detailed comparison metrics. Also, the major challenges of conventional approaches are identified. Based on those challenges, deep learning-based methods were reviewed where the identified challenges are tackled focusing on the methods providing long-term autonomy in various conditions such as changing weather, light, seasons, viewpoint, and occlusion due to the presence of mobile objects. Furthermore, open challenges and future directions were also discussed.

[1]  Fawzi Nashashibi,et al.  Multi-vehicle cooperative localization using indirect vehicle-to-vehicle relative pose estimation , 2012, 2012 IEEE International Conference on Vehicular Electronics and Safety (ICVES 2012).

[2]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[4]  MemonAzam Rafique,et al.  Loop closure detection using supervised and unsupervised deep neural networks for monocular SLAM systems , 2020 .

[5]  Hao Qin,et al.  Loop closure detection in SLAM by combining visual CNN features and submaps , 2018, 2018 4th International Conference on Control, Automation and Robotics (ICCAR).

[6]  Dorian Gálvez-López,et al.  Real-time loop detection with bags of binary words , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Surya P. N. Singh,et al.  A Pipeline for the Segmentation and Classification of 3D Point Clouds , 2010, ISER.

[8]  Andreas Zell,et al.  Improving Feature-based Visual SLAM by Semantics , 2018, 2018 IEEE International Conference on Image Processing, Applications and Systems (IPAS).

[9]  Paul Newman,et al.  1 year, 1000 km: The Oxford RobotCar dataset , 2017, Int. J. Robotics Res..

[10]  Guoquan Huang,et al.  Lightweight Unsupervised Deep Loop Closure , 2018, Robotics: Science and Systems.

[11]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.

[12]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Javier González,et al.  Scene structure registration for localization and mapping , 2016, Robotics Auton. Syst..

[14]  Horst-Michael Groß,et al.  IRON: A fast interest point descriptor for robust NDT-map matching and its application to robot localization , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[15]  VincentPascal,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010 .

[16]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[17]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[18]  Jiwen Lu,et al.  PCANet: A Simple Deep Learning Baseline for Image Classification? , 2014, IEEE Transactions on Image Processing.

[19]  Sei Ikeda,et al.  Visual SLAM algorithms: a survey from 2010 to 2016 , 2017, IPSJ Transactions on Computer Vision and Applications.

[20]  U-Xuan Tan,et al.  Seed: A Segmentation-Based Egocentric 3D Point Cloud Descriptor for Loop Closure Detection , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[21]  Fabio Tozeto Ramos,et al.  Robust place recognition with stereo cameras , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[22]  Wolfram Burgard,et al.  Robust visual SLAM across seasons , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[23]  Daniel Cremers,et al.  LDSO: Direct Sparse Odometry with Loop Closure , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[24]  Hesheng Wang,et al.  Loop closure detection using supervised and unsupervised deep neural networks for monocular SLAM systems , 2020, Robotics Auton. Syst..

[25]  Ian D. Reid,et al.  Mapping Large Loops with a Single Hand-Held Camera , 2007, Robotics: Science and Systems.

[26]  Tim Bailey,et al.  Scan segments matching for pairwise 3D alignment , 2012, 2012 IEEE International Conference on Robotics and Automation.

[27]  Dorian Gálvez-López,et al.  CI-Graph simultaneous localization and mapping for three-dimensional reconstruction of large and complex environments using a multicamera system , 2010, J. Field Robotics.

[28]  Yiyi Liao,et al.  Graph Regularized Auto-Encoders for Image Representation , 2017, IEEE Transactions on Image Processing.

[29]  C. Qi Deep Learning on Point Sets for 3 D Classification and Segmentation , 2016 .

[30]  Nikos A. Vlassis,et al.  The global k-means clustering algorithm , 2003, Pattern Recognit..

[31]  Ali Kashif Bashir,et al.  Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , 2013, ICIRA 2013.

[32]  Tom Duckett,et al.  Scan registration for autonomous mining vehicles using 3D‐NDT , 2007, J. Field Robotics.

[33]  Renaud Dubé,et al.  SegMatch: Segment based place recognition in 3D point clouds , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[34]  Jörg Stückler,et al.  Multi-resolution surfel maps for efficient dense 3D modeling and tracking , 2014, J. Vis. Commun. Image Represent..

[35]  Xiaqing Ding,et al.  LocNet: Global Localization in 3D Point Clouds for Mobile Vehicles , 2017, 2018 IEEE Intelligent Vehicles Symposium (IV).

[36]  Peter Biber,et al.  The normal distributions transform: a new approach to laser scan matching , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[37]  Fu Zhang,et al.  A fast, complete, point cloud based loop closure for LiDAR odometry and mapping , 2019, ArXiv.

[38]  Wolfram Burgard,et al.  3-D Mapping With an RGB-D Camera , 2014, IEEE Transactions on Robotics.

[39]  Yo-Sung Ho,et al.  Loop Closure Detection in Simultaneous Localization and Mapping Using Learning Based Local Patch Descriptor , 2018 .

[40]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[41]  Reinhard Klein,et al.  Efficient RANSAC for Point‐Cloud Shape Detection , 2007, Comput. Graph. Forum.

[42]  Lei Zhang,et al.  Optimized LOAM Using Ground Plane Constraints and SegMatch-Based Loop Detection , 2019, Sensors.

[43]  Shaoming Zhang,et al.  A Novel Loop Closure Detection Approach Using Simplified Structure for Low-Cost LiDAR , 2020, Sensors.

[44]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Vincent Lepetit,et al.  Keypoint recognition using randomized trees , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Andrew Zisserman,et al.  MLESAC: A New Robust Estimator with Application to Estimating Image Geometry , 2000, Comput. Vis. Image Underst..

[48]  Paul Newman,et al.  FAB-MAP 3D: Topological mapping with spatial and visual appearance , 2010, 2010 IEEE International Conference on Robotics and Automation.

[49]  Tom Drummond,et al.  Unified Loop Closing and Recovery for Real Time Monocular SLAM , 2008, BMVC.

[50]  Javier González,et al.  Fast place recognition with plane-based maps , 2013, 2013 IEEE International Conference on Robotics and Automation.

[51]  Daniel Cremers,et al.  Direct Sparse Odometry , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52]  Masahiro Tomono Loop detection for 3D LiDAR SLAM using segment-group matching , 2020, Adv. Robotics.

[53]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[54]  Yan Su,et al.  Loop closure detection for visual SLAM systems using convolutional neural network , 2017, 2017 23rd International Conference on Automation and Computing (ICAC).

[55]  Shilin Zhou,et al.  Convolutional neural network-based image representation for visual loop closure detection , 2015, 2015 IEEE International Conference on Information and Automation.

[56]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[57]  Alberto Ortiz,et al.  iBoW-LCD: An Appearance-Based Loop-Closure Detection Approach Using Incremental Bags of Binary Words , 2018, IEEE Robotics and Automation Letters.

[58]  Bernd Girod,et al.  Inverted Index Compression for Scalable Image Matching , 2010, 2010 Data Compression Conference.

[59]  Yahya Zweiri,et al.  Feature-based visual simultaneous localization and mapping: a survey , 2020, SN Applied Sciences.

[60]  John J. Leonard,et al.  Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age , 2016, IEEE Transactions on Robotics.

[61]  Thomas B. Schön,et al.  Learning to close loops from range data , 2011, Int. J. Robotics Res..

[62]  Limin Wang,et al.  Bag of visual words and fusion methods for action recognition: Comprehensive study and good practice , 2014, Comput. Vis. Image Underst..

[63]  Javier Civera,et al.  Condition-Invariant Multi-View Place Recognition , 2019, ArXiv.

[64]  Javier Civera,et al.  Single-View Place Recognition under Seasonal Changes , 2018, ArXiv.

[65]  Charles K. Toth,et al.  CNN-BASED PLACE RECOGNITION TECHNIQUE FOR LIDAR SLAM , 2020 .

[66]  Tomohiro Shibata,et al.  High performance loop closure detection using bag of word pairs , 2016, Robotics Auton. Syst..

[67]  Tao Zhang,et al.  Unsupervised learning to detect loops using deep neural networks for visual SLAM system , 2017, Auton. Robots.

[68]  U SaputraMuhamad Risqi,et al.  Visual SLAM and Structure from Motion in Dynamic Environments , 2018 .

[69]  Michael Milford,et al.  Place Recognition with ConvNet Landmarks: Viewpoint-Robust, Condition-Robust, Training-Free , 2015, Robotics: Science and Systems.

[70]  Tao Zhang,et al.  Loop closure detection for visual SLAM systems using deep neural networks , 2015, 2015 34th Chinese Control Conference (CCC).

[71]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[72]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[73]  Michael Bosse,et al.  Place recognition using keypoint voting in large 3D lidar datasets , 2013, 2013 IEEE International Conference on Robotics and Automation.

[74]  Michael Bosse,et al.  Keypoint design and evaluation for place recognition in 2D lidar maps , 2009, Robotics Auton. Syst..

[75]  Paul Newman,et al.  Appearance-only SLAM at large scale with FAB-MAP 2.0 , 2011, Int. J. Robotics Res..

[76]  Andreas Nüchter,et al.  Automatic appearance‐based loop detection from three‐dimensional laser data using the normal distributions transform , 2009, J. Field Robotics.

[77]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[78]  Andrew Markham,et al.  A Survey on Deep Learning for Localization and Mapping: Towards the Age of Spatial Machine Intelligence , 2020, ArXiv.

[79]  Jie Li,et al.  Loop closure detection for visual SLAM using PCANet features , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[80]  Ian D. Reid,et al.  Article in Press Robotics and Autonomous Systems ( ) – Robotics and Autonomous Systems a Comparison of Loop Closing Techniques in Monocular Slam , 2022 .

[81]  John J. Leonard,et al.  Temporally scalable visual SLAM using a reduced pose graph , 2013, 2013 IEEE International Conference on Robotics and Automation.

[82]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[83]  Tao Zhang,et al.  Semi-direct monocular visual and visual-inertial SLAM with loop closure detection , 2019, Robotics Auton. Syst..

[84]  Vladlen Koltun,et al.  Fast Global Registration , 2016, ECCV.

[85]  Niko Sünderhauf,et al.  On the performance of ConvNet features for place recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[86]  Dirk Schulz,et al.  A fast histogram-based similarity measure for detecting loop closures in 3-D LIDAR data , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[87]  Jean-Arcady Meyer,et al.  Fast and Incremental Method for Loop-Closure Detection Using Bags of Visual Words , 2008, IEEE Transactions on Robotics.

[88]  Eduardo Mario Nebot,et al.  Scan-SLAM: Combining EKF-SLAM and Scan Correlation , 2005, FSR.

[89]  Jun Cheng,et al.  Loop Closure Detection Based on Improved Hybrid Deep Learning Architecture , 2019, 2019 IEEE International Conferences on Ubiquitous Computing & Communications (IUCC) and Data Science and Computational Intelligence (DSCI) and Smart Computing, Networking and Services (SmartCNS).

[90]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[91]  Roland Siegwart,et al.  BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[92]  Ian D. Reid,et al.  Automatic Relocalization and Loop Closing for Real-Time Monocular SLAM , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[93]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[94]  Wolfram Burgard,et al.  A Tutorial on Graph-Based SLAM , 2010, IEEE Intelligent Transportation Systems Magazine.

[95]  Roland Siegwart,et al.  A novel parametrization of the perspective-three-point problem for a direct computation of absolute camera position and orientation , 2011, CVPR 2011.

[96]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[97]  Muhammad Sualeh,et al.  Simultaneous Localization and Mapping in the Epoch of Semantics: A Survey , 2018, International Journal of Control, Automation and Systems.

[98]  Dong Ye,et al.  Compressed Holistic ConvNet Representations for Detecting Loop Closures in Dynamic Environments , 2020, IEEE Access.

[99]  Sven Hellbach,et al.  Large scale place recognition in 2D LIDAR scans using Geometrical Landmark Relations , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[100]  Hannes Sommer,et al.  SegMap: Segment-based mapping and localization using data-driven descriptors , 2019, Int. J. Robotics Res..

[101]  Roland Siegwart,et al.  Robust Visual Place Recognition with Graph Kernels , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[102]  Michal Nowicki,et al.  Comparison of camera-based and 3D LiDAR-based loop closures across weather conditions , 2020, ArXiv.

[103]  Gim Hee Lee,et al.  PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[104]  Markus Vincze,et al.  Ensemble of shape functions for 3D object classification , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[105]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[106]  Hexin Zhang,et al.  A Loop Closure Detection Algorithm in Dynamic Scene , 2018, DEStech Transactions on Computer Science and Engineering.

[107]  Juan D. Tardós,et al.  Fast relocalisation and loop closing in keyframe-based SLAM , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[108]  Ian D. Reid,et al.  An image-to-map loop closing method for monocular SLAM , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[109]  Paul Newman,et al.  FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance , 2008, Int. J. Robotics Res..

[110]  Baifan Chen,et al.  Loop Closure Detection Based on Multi-Scale Deep Feature Fusion , 2019, Applied Sciences.

[111]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[112]  Andreas Nüchter,et al.  Automatic Appearance-Based Loop Detection from 3 D Laser Data Using the Normal Distributions Transform , 2009 .

[113]  Dorian Gálvez-López,et al.  Bags of Binary Words for Fast Place Recognition in Image Sequences , 2012, IEEE Transactions on Robotics.

[114]  Renaud Dubé,et al.  SegMap: 3D Segment Mapping using Data-Driven Descriptors , 2018, Robotics: Science and Systems.

[115]  Sheng Li,et al.  Loop Closure Detection for Visual SLAM Fusing Semantic Information , 2019, 2019 Chinese Control Conference (CCC).

[116]  Juan D. Tardós,et al.  ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras , 2016, IEEE Transactions on Robotics.

[117]  Patrick Rives,et al.  An Efficient Direct Approach to Visual SLAM , 2008, IEEE Transactions on Robotics.

[118]  Jayender Jagadeesan,et al.  Re-weighting and 1-Point RANSAC-Based P$n$nP Solution to Handle Outliers , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[119]  Nico Blodow,et al.  Fast Point Feature Histograms (FPFH) for 3D registration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[120]  Grzegorz Cielniak,et al.  Semantically Assisted Loop Closure in SLAM Using NDT Histograms , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[121]  Paolo Valigi,et al.  Robust visual semi-semantic loop closure detection by a covisibility graph and CNN features , 2017, Robotics Auton. Syst..

[122]  Cyrill Stachniss,et al.  RangeNet ++: Fast and Accurate LiDAR Semantic Segmentation , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[123]  Damien Vivet,et al.  A Review of Visual-LiDAR Fusion based Simultaneous Localization and Mapping , 2020, Sensors.

[124]  Bertrand Douillard,et al.  On the segmentation of 3D LIDAR point clouds , 2011, 2011 IEEE International Conference on Robotics and Automation.

[125]  Hujun Bao,et al.  Robust monocular SLAM in dynamic environments , 2013, 2013 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).

[126]  Simon Lacroix,et al.  Loop closure detection using small-sized signatures from 3D LIDAR data , 2011, 2011 IEEE International Symposium on Safety, Security, and Rescue Robotics.

[127]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[128]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[129]  Dirk Wollherr,et al.  IBuILD: Incremental bag of Binary words for appearance based loop closure detection , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[130]  Hesheng Wang,et al.  Depth estimation for deformable object using a multi-layer neural network , 2017, 2017 IEEE International Conference on Real-time Computing and Robotics (RCAR).

[131]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[132]  Cyrill Stachniss,et al.  OverlapNet: Loop Closing for LiDAR-based SLAM , 2020, Robotics: Science and Systems.