论文信息 - Domain-invariant NBV Planner for Active Cross-domain Self-localization

Domain-invariant NBV Planner for Active Cross-domain Self-localization

Pole-like landmark has received increasing attention as a domain-invariant visual cue for visual robot self-localization across domains (e.g., seasons, times of day, weathers). However, self-localization using pole-like landmarks can be ill-posed for a passive observer, as many viewpoints may not provide any pole-like landmark view. To alleviate this problem, we consider an active observer and explore a novel “domain-invariant” next-best-view (NBV) planner that attains consistent performance over different domains (i.e., maintenance-free), without requiring the expensive task of training data collection and retraining. In our approach, a novel multi-encoder deep convolutional neural network enables to detect domain invariant pole-like landmarks, which are then used as the sole input to a model-free deep reinforcement learning -based domain-invariant NBV planner. Further, we develop a practical system for active self-localization using sparse invariant landmarks and dense discriminative landmarks. In experiments, we demonstrate that the proposed method is effective both in efficient landmark detection and in discriminative self-localization.

Kanji Tanaka | Kanji Tanaka

[1] Richard Pito,et al. A Solution to the Next Best View Problem for Automated Surface Acquisition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[2] Michael Drumheller,et al. Mobile Robot Localization Using Sonar , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Raúl Rojas,et al. Pole-based localization for autonomous vehicles in urban scenarios , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[4] Guoquan Huang,et al. CALC2.0: Combining Appearance, Semantic and Geometric Information for Robust and Efficient Visual Loop Closure , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[5] Paolo Fiorini,et al. Recognition self-awareness for active object recognition on depth images , 2018, BMVC.

[6] Ryan M. Eustice,et al. University of Michigan North Campus long-term vision and lidar dataset , 2016, Int. J. Robotics Res..

[7] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[8] Angel D. Sappa,et al. Dense Extreme Inception Network: Towards a Robust CNN Model for Edge Detection , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[9] John F. Canny,et al. A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.

[11] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[12] Andreas Geiger,et al. Lost! Leveraging the Crowd for Probabilistic Visual Self-Localization , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.