论文信息 - Boosting Self-localization with Graph Convolutional Neural Networks

Boosting Self-localization with Graph Convolutional Neural Networks

Scene graph representation has recently merited attention for being flexible and descriptive where visual robot self-localization is concerned. In a typical self-localization application, the objects, object features and object relationships of the environment map are projected as nodes, node features and edges, respectively, on to the scene graph and subsequently mapped to a query scene graph using a graph matching engine. However, the computational, storage, and communication overhead costs of such a system are directly proportional to the number of feature dimensionalities of the graph nodes, often significant in large-scale applications. In this study, we demonstrate the feasibility of a graph convolutional neural network (GCN) to train and predict alongside a graph matching engine. However, visual features do not often translate well into graph features in modern graph convolution models, thereby affecting their performance. Therefore, we developed a novel knowledge transfer framework that introduces an arbitrary self-localization model as the teacher to train the GCN-based self-localization system i.e., the student. The framework, additionally, facilitated lightweight storage and communication by formulating the compact output signals from the teacher model as training data. Results on the Oxford RobotCar datasets reveal that the proposed method outperforms existing comparative methods and teacher self-localization systems.

Koji Takeda | Kanji Tanaka | Koji Takeda | Kanji Tanaka

[1] Paul Newman,et al. 1 year, 1000 km: The Oxford RobotCar dataset , 2017, Int. J. Robotics Res..

[2] Niko Sünderhauf,et al. On the performance of ConvNet features for place recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[3] Stefan Maierhofer,et al. Attribute Grammars for Incremental Scene Graph Rendering , 2019, VISIGRAPP.

[4] Guoquan Huang,et al. CALC2.0: Combining Appearance, Semantic and Geometric Information for Robust and Efficient Visual Loop Closure , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[5] Byungjae Park,et al. 1-Day Learning, 1-Year Localization: Long-Term LiDAR Localization Using Scan Context Image , 2019, IEEE Robotics and Automation Letters.

[6] Titus Cieslewski,et al. Data-Efficient Decentralized Visual SLAM , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[7] Alberto Ortiz,et al. iBoW-LCD: An Appearance-Based Loop-Closure Detection Approach Using Incremental Bags of Binary Words , 2018, IEEE Robotics and Automation Letters.

[8] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[9] Peter Wonka,et al. High Quality Monocular Depth Estimation via Transfer Learning , 2018, ArXiv.

[10] John F. Canny,et al. A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Connor W. Coley,et al. A graph-convolutional neural network model for the prediction of chemical reactivity , 2018, Chemical science.

[12] Abel Gawel,et al. X-View: Graph-Based Semantic Multiview Localization , 2017, IEEE Robotics and Automation Letters.

[13] José A. Castellanos,et al. Linear time vehicle relocation in SLAM , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[14] D. Frank Hsu,et al. Comparing Rank and Score Combination Methods for Data Fusion in Information Retrieval , 2005, Information Retrieval.

[15] Jure Leskovec,et al. Graph Convolutional Neural Networks for Web-Scale Recommender Systems , 2018, KDD.

[16] George Papandreou,et al. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[17] Gordon Wyeth,et al. SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights , 2012, 2012 IEEE International Conference on Robotics and Automation.

[18] Devavrat Shah,et al. Explaining the Success of Nearest Neighbor Methods in Prediction , 2018, Found. Trends Mach. Learn..

[19] Alex Smola,et al. Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs , 2019, ArXiv.

[20] Ling Zhang,et al. Unsupervised Feature Learning for Point Cloud Understanding by Contrasting and Clustering Using Graph Convolutional Neural Networks , 2019, 2019 International Conference on 3D Vision (3DV).

[21] Martin Braschler,et al. A study of untrained models for multimodal information retrieval , 2018, Information Retrieval Journal.

[22] Wolfram Burgard,et al. Robust Visual Robot Localization Across Seasons Using Network Flows , 2014, AAAI.

[23] Paul Newman,et al. FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance , 2008, Int. J. Robotics Res..

[24] Jan-Michael Frahm,et al. USAC: A Universal Framework for Random Sample Consensus , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.