2-Entity Random Sample Consensus for Robust Visual Localization: Framework, Methods, and Verifications

Robust and efficient visual localization is essential for numerous robotic applications. However, it remains a challenging problem especially when significant environmental or perspective changes present, as there are high percentage of outliers, i.e. incorrect feature matches, between the query image and the map. In this paper, we propose a novel 2-entity RANSAC framework using 3D-2D point and line feature matches for visual localization with the aid of inertial measurements and derive minimal closed form solutions using only 1 point 1 line or 2 point matches for both monocular and multi-camera system. The proposed 2-entity RANSAC can achieve higher robustness against outliers as multiple types of features are utilized and the number of matches needed to compute a pose is reduced. Furthermore, we propose a learning-based sampling strategy selection mechanism and a feature scoring network to be adaptive to different environmental characteristics such as structured and unstructured. Finally, both simulation and real-world experiments are performed to validate the robustness and effectiveness of the proposed method in scenarios with long-term and perspective changes $^{1}$ .

[1]  Bo Fu,et al.  2-Entity RANSAC for robust visual localization in changing environment , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2]  Julius Ziegler,et al.  StereoScan: Dense 3d reconstruction in real-time , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[3]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[4]  Roland Siegwart,et al.  A novel parametrization of the perspective-three-point problem for a direct computation of absolute camera position and orientation , 2011, CVPR 2011.

[5]  Juan D. Tardós,et al.  Visual-Inertial Monocular SLAM With Map Reuse , 2016, IEEE Robotics and Automation Letters.

[6]  Michel Dhome,et al.  Determination of the Attitude of 3D Objects from a Single Perspective View , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Xiaqing Ding,et al.  Laser Map Aided Visual Inertial Localization in Changing Environment , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[8]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.

[9]  Marc Pollefeys,et al.  A Minimal Case Solution to the Calibrated Relative Pose Problem for the Case of Two Known Orientation Angles , 2010, ECCV.

[10]  Gim Hee Lee A Minimal Solution for Non-perspective Pose Estimation from Line Correspondences , 2016, ECCV.

[11]  Paul Newman,et al.  Made to measure: Bespoke landmarks for 24-hour, all-weather localisation with a camera , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[12]  Roland Siegwart,et al.  Robust Real-Time Visual Odometry with a Single Camera and an IMU , 2011, BMVC.

[13]  Roland Siegwart,et al.  Using multi-camera systems in robotics: Efficient solutions to the NPnP problem , 2013, 2013 IEEE International Conference on Robotics and Automation.

[14]  Laurent Kneip,et al.  OpenGV: A unified and generalized approach to real-time calibrated geometric vision , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Jianliang Tang,et al.  Complete Solution Classification for the Perspective-Three-Point Problem , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Peter F. Sturm,et al.  Pose estimation using both points and lines for geo-localization , 2011, 2011 IEEE International Conference on Robotics and Automation.

[17]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[18]  Torsten Sattler,et al.  Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[19]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[20]  Woojin Chung,et al.  Localization of a Mobile Robot Using a Laser Range Finder in a Glass-Walled Environment , 2016, IEEE Transactions on Industrial Electronics.

[21]  Michel Antunes,et al.  Plane-based Odometry using an RGB-D Camera , 2013, BMVC.

[22]  Sudeep Sarkar,et al.  BLOGS: Balanced local and global search for non-degenerate two view epipolar geometry , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[23]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[24]  Ilan Shimshoni,et al.  Balanced Exploration and Exploitation Model Search for Efficient Epipolar Geometry Estimation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[26]  Hongdong Li,et al.  UPnP: An Optimal O(n) Solution to the Absolute Pose Problem with Universal Applicability , 2014, ECCV.

[27]  V. Lepetit,et al.  EPnP: An Accurate O(n) Solution to the PnP Problem , 2009, International Journal of Computer Vision.

[28]  Fadi Dornaika,et al.  Pose Estimation using Point and Line Correspondences , 1999, Real Time Imaging.

[29]  Xiaqing Ding,et al.  Topological local-metric framework for mobile robots navigation: a long term perspective , 2018, Autonomous Robots.

[30]  Shengyong Chen,et al.  Intelligent Collaborative Localization Among Air-Ground Robots for Industrial Environment Perception , 2019, IEEE Transactions on Industrial Electronics.

[31]  Zuzana Kukelova,et al.  Closed-Form Solutions to Minimal Absolute Pose Problems with Known Vertical Direction , 2010, ACCV.

[32]  Pedro Miraldo,et al.  A Minimal Closed-Form Solution for Multi-Perspective Pose Estimation using Points and Lines , 2018, ECCV.

[33]  Max A. Viergever,et al.  A deep learning framework for unsupervised affine and deformable image registration , 2018, Medical Image Anal..

[34]  Roland Siegwart,et al.  Dense visual-inertial navigation system for mobile robots , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[35]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[36]  Jiang Liu,et al.  A low-dimensional step pattern analysis algorithm with application to multimodal retinal image registration , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Peng Liu,et al.  A Deep Step Pattern Representation for Multimodal Retinal Image Registration , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).