A novel fusing semantic- and appearance-based descriptors for visual loop closure detection

Abstract Loop-closure detection plays an important role in visual simultaneous localisation and mapping;it is an independent part of the visual odometer and can effectively reduce its accumulated error, in addition to helping with loop-closure detection for relocalisation. With the development of deep learning methods in recent years, the training models of convolutional neural networks for major data sets have been improved for loop-closure detection. Presently, some high-level engineering problems still rely on auxiliary equipment, such as panoramic cameras and radar lasers, which greatly increase the expensive extra cost; however, owing to the extreme appearance and viewpoint changes involved in such problems, loop-closure detection that relies on two-dimensional images is not applicable. Based on the two nearest neighbour vector of locally aggregated descriptors (TNNVLAD) method, a novel feature descriptor called two nearest neighbour local sensor tensor(TNNLoST) is proposed herein by combining the semantic features of high-level neural networks with dense descriptors. This approach introduces a semantic concept similar to human cognition for the surrounding environment, thus enabling better understanding of the environment. The proposed method was applied to publicly available benchmark datasets to show its performance.

[1]  Cordelia Schmid,et al.  Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Michael Milford,et al.  LoST? Appearance-Invariant Place Recognition for Opposite Viewpoints using Visual Semantics , 2018, Robotics: Science and Systems.

[3]  Yunzhou Zhang,et al.  Semantic loop closure detection based on graph matching in multi-objects scenes , 2021, J. Vis. Commun. Image Represent..

[4]  Sanjoy Kumar Saha,et al.  Detection of loop closure in SLAM: A DeconvNet based approach , 2019, Appl. Soft Comput..

[5]  Gordon Wyeth,et al.  SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights , 2012, 2012 IEEE International Conference on Robotics and Automation.

[6]  Josef Sivic,et al.  NetVLAD: CNN Architecture for Weakly Supervised Place Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Hesheng Wang,et al.  Loop closure detection using supervised and unsupervised deep neural networks for monocular SLAM systems , 2020, Robotics Auton. Syst..

[8]  Matthieu Cord,et al.  Locality-Sensitive Hashing for Chi2 Distance , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Lei Wang,et al.  Visual place recognition: A survey from deep learning perspective , 2020, Pattern Recognit..

[10]  Yang Liu,et al.  Visual loop closure detection with a compact image descriptor , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Hamid Taheri,et al.  SLAM; definition and evolution , 2021, Eng. Appl. Artif. Intell..

[12]  Shumin Fei,et al.  A deep-learning real-time visual SLAM system based on multi-task feature extraction network and self-supervised feature points , 2021 .

[13]  Tao Zhang,et al.  Unsupervised learning to detect loops using deep neural networks for visual SLAM system , 2017, Auton. Robots.

[14]  Paul Newman,et al.  Appearance-only SLAM at large scale with FAB-MAP 2.0 , 2011, Int. J. Robotics Res..

[15]  Jie Li,et al.  Loop closure detection for visual SLAM using PCANet features , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[16]  Hongtao Lu,et al.  Image classification based on improved VLAD , 2015, Multimedia Tools and Applications.

[17]  Vladlen Koltun,et al.  Robust reconstruction of indoor scenes , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Nadia Nedjah,et al.  Simultaneous localization and mapping using swarm intelligence based methods , 2020, Expert Syst. Appl..