Viewpoint-Invariant Loop Closure Detection Using Step-Wise Learning With Controlling Embeddings of Landmarks

The research on loop closure detection has been carried out for the last many years; however, loop closure detection efficiency is still not that good and has been affected by many factors, including illumination conditions, weather conditions, seasons, and viewpoint changes. The research on loop closure detection from a different viewpoint is still an open research problem. The paper proposes an efficient solution to loop closure detection from different viewpoints by using landmarks instead of whole frames and taking the deep and robust features of deep learning instead of handcrafted features. A different kind of training approach is used to train the deep CNN to get highly abstract embeddings of input landmarks. The approach has many advantages over the traditional training approach and can solve many complex problems. This paper has used this approach to force similar or closer embeddings for similar landmarks and is forced to have a large gap in embeddings of different landmarks. The proposed method endeavors viewpoint invariant features, and the astonishing power of deep learning makes the features robust to viewpoint changes, occlusions, and illumination variations. The proposed visual SLAM system is tested on six publicly available datasets, and the results are compared with the most popular Bag of Words methods like DBoW2, DBoW3, and state-of-the-art deep learning methods AlexNet, ResNeXt, FlyNet, AFDPR and Siamese network. The results show that our method is efficient in finding loop closures candidates from different viewpoints. Code is available at https://github.com/IRMVLab/Step-wise-Learning.

[1]  Hesheng Wang,et al.  Image-Based Visual Impedance Force Control for Contact Aerial Manipulation , 2023, IEEE Transactions on Automation Science and Engineering.

[2]  Hesheng Wang,et al.  Integrated Task Allocation and Path Coordination for Large-Scale Robot Networks With Uncertainties , 2022, IEEE Transactions on Automation Science and Engineering.

[3]  Hesheng Wang,et al.  Visual Servoing of Flexible-Link Manipulators by Considering Vibration Suppression Without Deformation Measurements , 2021, IEEE Transactions on Cybernetics.

[4]  Hesheng Wang,et al.  Spherical Interpolated Convolutional Network With Distance–Feature Density for 3-D Semantic Segmentation of Point Clouds , 2020, IEEE Transactions on Cybernetics.

[5]  Hesheng Wang,et al.  DASGIL: Domain Adaptation for Semantic and Geometric-Aware Image-Based Localization , 2020, IEEE Transactions on Image Processing.

[6]  Chenglu Wen,et al.  Mapping and Semantic Modeling of Underground Parking Lots Using a Backpack LiDAR System , 2019, IEEE Transactions on Intelligent Transportation Systems.

[7]  Yue Wang,et al.  Persistent Stereo Visual Localization on Cross-Modal Invariant Map , 2020, IEEE Transactions on Intelligent Transportation Systems.

[8]  Qilong Wang,et al.  Learning second-order statistics for place recognition based on robust covariance estimation of CNN features , 2020, Neurocomputing.

[9]  Qiang Liu,et al.  Fast and Consistent Matching for Landmark-Based Place Recognition , 2020, J. Intell. Robotic Syst..

[10]  Yue Wang,et al.  Adversarial Feature Disentanglement for Place Recognition Across Changing Appearance , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[11]  D. Scaramuzza,et al.  Augmenting Visual Place Recognition With Structural Cues , 2020, IEEE Robotics and Automation Letters.

[12]  Saba Arshad,et al.  Robustifying Visual Place Recognition with Semantic Scene Categorization , 2020, 2020 IEEE International Conference on Big Data and Smart Computing (BigComp).

[13]  Ping Ye,et al.  Multi-sensor fusion localization algorithm for outdoor mobile robot , 2020, Journal of Physics: Conference Series.

[14]  A. Barron,et al.  A Hybrid Compact Neural Architecture for Visual Place Recognition , 2019, IEEE Robotics and Automation Letters.

[15]  Ruben Gomez-Ojeda,et al.  Robust Visual SLAM in Challenging Environments with Low-texture and Dynamic Illumination , 2020 .

[16]  Weidong Chen,et al.  Domain-invariant Similarity Activation Map Metric Learning for Retrieval-based Long-term Visual Localization , 2020, ArXiv.

[17]  Tao Lu,et al.  Localizing Discriminative Visual Landmarks for Place Recognition , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[18]  Baifan Chen,et al.  Loop Closure Detection Based on Multi-Scale Deep Feature Fusion , 2019, Applied Sciences.

[19]  Yan Su,et al.  Graph-Based Place Recognition in Image Sequences with CNN Features , 2018, Journal of Intelligent & Robotic Systems.

[20]  Yo-Sung Ho,et al.  Loop closure detection in simultaneous localization and mapping using descriptor from generative adversarial network , 2019, J. Electronic Imaging.

[21]  Daniel Cremers,et al.  LDSO: Direct Sparse Odometry with Loop Closure , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[22]  Hang Zhou,et al.  Implementation of Hybrid Deep Learning Architecture on Loop-Closure Detection , 2018, 2018 13th World Congress on Intelligent Control and Automation (WCICA).

[23]  Guoquan Huang,et al.  Lightweight Unsupervised Deep Loop Closure , 2018, Robotics: Science and Systems.

[24]  Weiliang Xu,et al.  Synchronous Adversarial Feature Learning for LiDAR based Loop Closure Detection , 2018, 2018 Annual American Control Conference (ACC).

[25]  Hao Qin,et al.  Loop closure detection in SLAM by combining visual CNN features and submaps , 2018, 2018 4th International Conference on Control, Automation and Robotics (ICCAR).

[26]  Muhamad Risqi U. Saputra,et al.  Visual SLAM and Structure from Motion in Dynamic Environments , 2018, ACM Comput. Surv..

[27]  Alberto Ortiz,et al.  iBoW-LCD: An Appearance-Based Loop-Closure Detection Approach Using Incremental Bags of Binary Words , 2018, IEEE Robotics and Automation Letters.

[28]  Yo-Sung Ho,et al.  Loop Closure Detection in Simultaneous Localization and Mapping Using Learning Based Local Patch Descriptor , 2018 .

[29]  Wolfram Burgard,et al.  Robust Visual Localization Across Seasons , 2018, IEEE Transactions on Robotics.

[30]  Yong Liu,et al.  Robust visual SLAM with point and line features , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[31]  Hesheng Wang,et al.  Depth estimation for deformable object using a multi-layer neural network , 2017, 2017 IEEE International Conference on Real-time Computing and Robotics (RCAR).

[32]  Shaohua Li,et al.  Autonomous exploration of mobile robots through deep neural networks , 2017 .

[33]  Jie Li,et al.  An Evaluation of Deep Learning in Loop Closure Detection for Visual SLAM , 2017, 2017 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData).

[34]  Javier González,et al.  Appearance-invariant place recognition by discriminatively training a convolutional neural network , 2017, Pattern Recognit. Lett..

[35]  Hugo Jair Escalante,et al.  Learning Roadway Surface Disruption Patterns Using the Bag of Words Representation , 2017, IEEE Transactions on Intelligent Transportation Systems.

[36]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Juan D. Tardós,et al.  ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras , 2016, IEEE Transactions on Robotics.

[39]  Tao Zhang,et al.  Unsupervised learning to detect loops using deep neural networks for visual SLAM system , 2017, Auton. Robots.

[40]  Roland Siegwart,et al.  Robust Visual Place Recognition with Graph Kernels , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Wolfram Burgard,et al.  Robust visual SLAM across seasons , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[42]  Michael Milford,et al.  Place Recognition with ConvNet Landmarks: Viewpoint-Robust, Condition-Robust, Training-Free , 2015, Robotics: Science and Systems.

[43]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.

[44]  Niko Sünderhauf,et al.  On the performance of ConvNet features for place recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[45]  C. Lawrence Zitnick,et al.  Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[46]  Philip H. S. Torr,et al.  BING: Binarized normed gradients for objectness estimation at 300fps , 2014, Computational Visual Media.

[47]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[48]  Bernd Girod,et al.  Inverted Index Compression for Scalable Image Matching , 2010, 2010 Data Compression Conference.