论文信息 - Topometric Localization with Deep Learning

Topometric Localization with Deep Learning

Compared to LiDAR-based localization methods, which provide high accuracy but rely on expensive sensors, visual localization approaches only require a camera and thus are more cost-effective while their accuracy and reliability typically is inferior to LiDAR-based methods. In this work, we propose a vision-based localization approach that learns from LiDAR-based localization methods by using their output as training data, thus combining a cheap, passive sensor with an accuracy that is on-par with LiDAR-based localization. The approach consists of two deep networks trained on visual odometry and topological localization, respectively, and a successive optimization to combine the predictions of these two networks. We evaluate the approach on a new challenging pedestrian-based dataset captured over the course of six months in varying weather conditions with a high degree of noise. The experiments demonstrate that the localization errors are up to 10 times smaller than with traditional vision-based localization methods.

[1] Debashish Chakravarty,et al. DeepVO: A Deep Learning approach for Monocular Visual Odometry , 2016, ArXiv.

[2] W. Cleveland. Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[3] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Geoffrey E. Hinton,et al. On the importance of initialization and momentum in deep learning , 2013, ICML.

[5] Bolei Zhou,et al. Places: An Image Database for Deep Scene Understanding , 2016, ArXiv.

[6] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7] Roland Memisevic,et al. Learning Visual Odometry with a Convolutional Network , 2015, VISAPP.

[8] Tomás Pajdla,et al. NetVLAD: CNN Architecture for Weakly Supervised Place Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Sepp Hochreiter,et al. Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[10] Wolfram Burgard,et al. Improved non-linear spline fitting for teaching trajectories to mobile robots , 2012, 2012 IEEE International Conference on Robotics and Automation.

[11] Wolfram Burgard,et al. Autonomous Robot Navigation in Highly Populated Pedestrian Zones , 2015, J. Field Robotics.

[12] Wolfram Burgard,et al. Lidar-based teach-and-repeat of mobile robot trajectories , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13] Luis Miguel Bergasa,et al. Fusion and binarization of CNN features for robust topological localization across seasons , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[14] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[15] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Emilio Garcia-Fidalgo,et al. Vision-based topological mapping and localization methods: A survey , 2015, Robotics Auton. Syst..

[17] Cordelia Schmid,et al. Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18] Peter I. Corke,et al. Visual Place Recognition: A Survey , 2016, IEEE Transactions on Robotics.

[19] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[20] Xiaolin Hu,et al. Delving deeper into convolutional neural networks for camera relocalization , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[21] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Takeo Kanade,et al. Visual topometric localization , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[23] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[24] Roberto Cipolla,et al. PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[25] Geoffrey A. Hollinger,et al. Deep Learning for Laser Based Odometry Estimation , 2016 .

[26] Wolfram Burgard,et al. Relative Topometric Localization in Globally Inconsistent Maps , 2017, ISRR.

[27] Esa Rahtu,et al. Relative Camera Pose Estimation Using Convolutional Neural Networks , 2017, ACIVS.

[28] Michael Milford,et al. Convolutional Neural Network-based Place Recognition , 2014, ICRA 2014.

[29] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.