Monocular camera relocalization in 3D point-cloud dense map

Due to the low-cost, lightweight and widely available, approaches that use a single monocular camera to relocalize pose in 3D point-cloud dense map has gained significant interests in a wide range of applications. Relocalizing camera in a given map is of crucial importance for vision-based autonomous navigation. When the robot run in a new map or tracks lost, the robot's global localization cannot be obtained. We propose a novel approach, which relocalizes the pose of monocular camera with respect to a prebuilt dense map. A common monocular visual odometry system is employed to reconstruct a sparse set of 3D points based on local bundle adjustment. These reconstructed points are matched against the dense map to get the camera global pose with the particle filter (PF) algorithm and iterative closest point (ICP) algorithm. The particles state represents the potential initial pose. The ICP alignment result is used to update particles pose and important weight. Meanwhile, adaptive resampling is used to approximate the distribution of pose. Our monocular camera relocalization approach has several advantages. The improved PF algorithm is utilized to tackle the local convergence of ICP alignment. And our approach only relies on matching geometry between local reconstruction and dense map, it is robust to photometric appearance variance in the environment. In addition, the approach also can estimate the metric scale, which cannot be recovered from monocular visual odometry. We present real-world experiments demonstrating that our approach can relocalize the monocular camera and accurately estimate the metric scale in a given dense map. The results verify the effectiveness and robustness of this algorithm.

[1]  Dongxiao Li,et al.  Accurate RGB camera relocalization using regression forest , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[2]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[3]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[4]  Dorian Gálvez-López,et al.  Bags of Binary Words for Fast Place Recognition in Image Sequences , 2012, IEEE Transactions on Robotics.

[5]  Daniel Cremers,et al.  Direct Sparse Odometry , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Shaojie Shen,et al.  Relocalization, Global Optimization and Map Merging for Monocular Visual-Inertial SLAM , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Li Wang,et al.  Research on service robots robust relocalization algorithm based on 2D/3D map of indoor environment , 2017, 2017 18th International Conference on Advanced Robotics (ICAR).

[8]  Shaojie Shen,et al.  VINS-Mono: A Robust and Versatile Monocular Visual-Inertial State Estimator , 2017, IEEE Transactions on Robotics.

[9]  Daniel Cremers,et al.  LSD-SLAM: Large-Scale Direct Monocular SLAM , 2014, ECCV.

[10]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Andrew W. Fitzgibbon,et al.  Scene Coordinate Regression Forests for Camera Relocalization in RGB-D Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Hongbin Zha,et al.  PSDF Fusion: Probabilistic Signed Distance Function for On-the-fly 3D Data Fusion and Scene Reconstruction , 2018, ECCV.

[13]  Davide Scaramuzza,et al.  SVO: Fast semi-direct monocular visual odometry , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[14]  José Ruíz Ascencio,et al.  Visual simultaneous localization and mapping: a survey , 2012, Artificial Intelligence Review.

[15]  Juan D. Tardós,et al.  Visual-Inertial Monocular SLAM With Map Reuse , 2016, IEEE Robotics and Automation Letters.

[16]  Roland Siegwart,et al.  BRISK: Binary Robust invariant scalable keypoints , 2011, 2011 International Conference on Computer Vision.

[17]  Wolfram Burgard,et al.  Monocular camera localization in 3D LiDAR maps , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[18]  Roland Siegwart,et al.  Real-time visual-inertial mapping, re-localization and planning onboard MAVs in unknown environments , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[19]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[20]  Jizhong Xiao,et al.  6-DoF pose localization in 3D point-cloud dense maps using a monocular camera , 2013, 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[21]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[22]  Jörg Stückler,et al.  Keyframe-based visual-inertial online SLAM with relocalization , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[23]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.