INDOOR LIDAR RELOCALIZATION BASED ON DEEP LEARNING USING A 3D MODEL

Abstract. Indoor localization, navigation and mapping systems highly rely on the initial sensor pose information to achieve a high accuracy. Most existing indoor mapping and navigation systems cannot initialize the sensor poses automatically and consequently these systems cannot perform relocalization and recover from a pose estimation failure. For most indoor environments, a map or a 3D model is often available, and can provide useful information for relocalization. This paper presents a novel relocalization method for lidar sensors in indoor environments to estimate the initial lidar pose using a CNN pose regression network trained using a 3D model. A set of synthetic lidar frames are generated from the 3D model with known poses. Each lidar range image is a one-channel range image, used to train the CNN pose regression network from scratch to predict the initial sensor location and orientation. The CNN regression network trained by synthetic range images is used to estimate the poses of the lidar using real range images captured in the indoor environment. The results show that the proposed CNN regression network can learn from synthetic lidar data and estimate the pose of real lidar data with an accuracy of 1.9 m and 8.7 degrees.

[1]  Roberto Cipolla,et al.  PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2]  Li Wang,et al.  Research on service robots robust relocalization algorithm based on 2D/3D map of indoor environment , 2017, 2017 18th International Conference on Advanced Robotics (ICAR).

[3]  Mingyang Li,et al.  SDF-Loc: Signed Distance Field based 2D Relocalization and Map Update in Dynamic Environments , 2019, 2019 American Control Conference (ACC).

[4]  Vincent Lepetit,et al.  Feature Harvesting for Tracking-by-Detection , 2006, ECCV.

[5]  Éric Marchand,et al.  Direct model based visual tracking and pose estimation using mutual information , 2014, Image Vis. Comput..

[6]  张爱武 Zhang Aiwu,et al.  Point Cloud Registration Based on Improved Normal Distribution Transform Algorithm , 2014 .

[7]  Roberto Cipolla,et al.  Geometric Loss Functions for Camera Pose Regression with Deep Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Stephan Winter,et al.  MODELLING UNCERTAINTY OF SINGLE IMAGE INDOOR LOCALISATION USING A 3D MODEL AND DEEP LEARNING , 2019, ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[9]  Ian D. Reid,et al.  Real-Time SLAM Relocalisation , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[10]  Sergey Ioffe,et al.  Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning , 2016, AAAI.

[11]  Juan D. Tardós,et al.  Visual-Inertial Monocular SLAM With Map Reuse , 2016, IEEE Robotics and Automation Letters.

[12]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[13]  Qi Tian,et al.  A Novel Global Relocalization Method Based on Hierarchical Registration of 3D Point Cloud Map for Mobile Robot , 2019, 2019 5th International Conference on Control, Automation and Robotics (ICCAR).

[14]  Wolfram Burgard,et al.  Robot localization with sparse scan-based maps , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[15]  Hao Xu,et al.  Registration of Laser Scanning Point Clouds: A Review , 2018, Sensors.

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Andrew W. Fitzgibbon,et al.  Scene Coordinate Regression Forests for Camera Relocalization in RGB-D Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Tom Drummond,et al.  Going out: robust model-based tracking for outdoor augmented reality , 2006, 2006 IEEE/ACM International Symposium on Mixed and Augmented Reality.

[19]  Renaud Dubé,et al.  SegMap: 3D Segment Mapping using Data-Driven Descriptors , 2018, Robotics: Science and Systems.

[20]  Stephan Winter,et al.  BIM-PoseNet: Indoor camera localisation using a 3D indoor model and deep learning from synthetic images , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[21]  Kourosh Khoshelham,et al.  THE ISPRS BENCHMARK ON INDOOR MODELLING , 2017 .

[22]  Michael Trentini,et al.  Multiple‐Robot Simultaneous Localization and Mapping: A Review , 2016, J. Field Robotics.