LoGG3D-Net: Locally Guided Global Descriptor Learning for 3D Place Recognition

Retrieval-based place recognition is an efficient and effective solution for enabling re-localization within a prebuilt map or global data association for Simultaneous Localization and Mapping (SLAM). The accuracy of such an approach is heavily dependant on the quality of the extracted scenelevel representation. While end-to-end solutions, which learn a global descriptor from input point clouds, have demonstrated promising results, such approaches are limited in their ability to enforce desirable properties at the local feature level. In this paper, we demonstrate that the inclusion of an additional training signal (local consistency loss) can guide the network to learning local features which are consistent across revisits, hence leading to more repeatable global descriptors resulting in an overall improvement in place recognition performance. We formulate our approach in an end-to-end trainable architecture called LoGG3D-Net. Experiments on two large-scale public benchmarks (KITTI and MulRan) show that our method achieves mean F1max scores of 0.939 and 0.968 on KITTI and MulRan, respectively while operating in near real-time.

[1]  Sridha Sridharan,et al.  Elasticity Meets Continuous-Time: Map-Centric Dense 3D LiDAR SLAM , 2020, IEEE Transactions on Robotics.

[2]  Adrian Penate-Sanchez,et al.  Learning to See the Wood for the Trees: Deep Laser Localization in Urban and Natural Environments on a CPU , 2019, IEEE Robotics and Automation Letters.

[3]  Hannes Sommer,et al.  SegMap: Segment-based mapping and localization using data-driven descriptors , 2019, Int. J. Robotics Res..

[4]  Jacek Komorowski,et al.  MinkLoc3D: Point Cloud Based Large-Scale Place Recognition , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[5]  Sridha Sridharan,et al.  Elastic LiDAR Fusion: Dense Map-Centric Continuous-Time SLAM , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[6]  Vladlen Koltun,et al.  Fully Convolutional Geometric Features , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[7]  Hesheng Wang,et al.  LPD-Net: 3D Point Cloud Learning for Large-Scale Place Recognition and Environment Analysis , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[8]  Ayoung Kim,et al.  Scan Context: Egocentric Spatial Descriptor for Place Recognition Within 3D Point Cloud Map , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9]  Piotr Koniusz,et al.  Power Normalizations in Fine-Grained Image, Few-Shot Image and Graph Classification , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[11]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Giorgos Tolias,et al.  Fine-Tuning CNN Image Retrieval with No Human Annotation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Jinyong Jeong,et al.  MulRan: Multimodal Range Dataset for Urban Place Recognition , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Abel Gawel,et al.  Local Descriptor for Robust Place Recognition Using LiDAR Intensity , 2018, IEEE Robotics and Automation Letters.

[15]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[16]  Federico Tombari,et al.  SHOT: Unique signatures of histograms for surface and texture description , 2014, Comput. Vis. Image Underst..

[17]  Kaiqi Huang,et al.  Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-identification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Peter I. Corke,et al.  Visual Place Recognition: A Survey , 2016, IEEE Transactions on Robotics.

[19]  Li He,et al.  M2DP: A novel 3D point cloud descriptor and its application in loop closure detection , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[20]  Krystian Mikolajczyk,et al.  Higher-Order Occurrence Pooling for Bags-of-Words: Visual Concept Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Song Han,et al.  Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution , 2020, ECCV.

[22]  Gim Hee Lee,et al.  PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23]  Qilong Wang,et al.  Is Second-Order Information Helpful for Large-Scale Visual Recognition? , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[24]  Sridha Sridharan,et al.  Locus: LiDAR-based Place Recognition using Spatiotemporal Higher-Order Pooling , 2020, ArXiv.

[25]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).