论文信息 - Towards A Deep Insight into Landmark-based Visual Place Recognition: Methodology and Practice

Towards A Deep Insight into Landmark-based Visual Place Recognition: Methodology and Practice

In this paper, we address the problem of landmark-based visual place recognition. In the state-of-the-art method, accurate object proposal algorithms are first leveraged for generating a set of local regions containing particular landmarks with high confidence. Then, these candidate regions are represented by deep features and pairwise matching is performed in an exhaustive manner for the similarity measure. Despite its success, conventional object proposal methods usually produce massive landmark-dependent image patches exhibiting significant distribution variance in scale and overlap. As a result, the inconsistency in landmark distributions tends to produce biased similarity between pairwise images yielding the suboptimal performance. In order to gain an insight into the landmark-based place recognition scheme, we conduct a comprehensive study in which the influence of landmark scales and the proportion of overlap on the recognition performance is explored. More specifically, we thoroughly study the exhaustive search based landmark matching mechanism, and thus derive three-fold important observations in terms of the beneficial effect of specific landmark generation strategies. Inspired by the above observations, a simple yet effective dense sampling based scheme is presented for accurate place recognition in this paper. Different from the conventional object proposal strategy, we generate local landmarks of multiple scales with uniform distribution from entire image by dense sampling, and subsequently perform multi-scale fusion on the densely sampled landmarks for similarity measure. The experimental results on three challenging datasets demonstrate that the recognition performance can be significantly improved by our efficient method in which the landmarks are appropriately produced for accurate pairwise matching.

Jun Li | Bo Yang | Hong Zhang | Xiaosu Xu

[1] Peter I. Corke,et al. Visual Place Recognition: A Survey , 2016, IEEE Transactions on Robotics.

[2] Henrik Andreasson,et al. Lightweight, Viewpoint-Invariant Visual Place Recognition in Changing Environments , 2018, IEEE Robotics and Automation Letters.

[3] Hong Zhang,et al. BoRF: Loop-closure detection with scale invariant visual features , 2011, 2011 IEEE International Conference on Robotics and Automation.

[4] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[5] Wolfram Burgard,et al. Robust Visual Localization Across Seasons , 2018, IEEE Transactions on Robotics.

[6] Hong Zhang,et al. Fast-SeqSLAM: A fast appearance based place recognition algorithm , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[7] Matthieu Guillaumin,et al. Non-maximum Suppression for Object Detection by Passing Messages Between Windows , 2014, ACCV.

[8] Paul Newman,et al. FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance , 2008, Int. J. Robotics Res..

[9] Niko Sünderhauf,et al. On the performance of ConvNet features for place recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[10] Gordon Wyeth,et al. SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights , 2012, 2012 IEEE International Conference on Robotics and Automation.

[11] C. Lawrence Zitnick,et al. Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[12] Hong Zhang,et al. Towards improving the efficiency of sequence-based SLAM , 2013, 2013 IEEE International Conference on Mechatronics and Automation.

[13] Larry S. Davis,et al. Soft-NMS — Improving Object Detection with One Line of Code , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[14] Svetlana Lazebnik,et al. Multi-scale Orderless Pooling of Deep Convolutional Activation Features , 2014, ECCV.

[15] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[16] Michael Milford,et al. Place Recognition with ConvNet Landmarks: Viewpoint-Robust, Condition-Robust, Training-Free , 2015, Robotics: Science and Systems.

[17] Shilin Zhou,et al. Evaluation of Object Proposals and ConvNet Features for Landmark-based Visual Place Recognition , 2018, J. Intell. Robotic Syst..

[18] Tony Lindeberg,et al. Scale Invariant Feature Transform , 2012, Scholarpedia.

[19] Niko Sünderhauf,et al. Superpixel-based appearance change prediction for long-term navigation across seasons , 2014, Robotics Auton. Syst..