PDQ-Net: Deep probabilistic dual quaternion network for absolute pose regression on SE(3)

Accurate absolute pose regression is one of the key challenges in robotics and computer vision. Existing direct regression methods suffer from two limitations. First, some noisy scenarios such as poor illumination conditions are likely to result in the uncertainty of pose estimation. Second, the output n-dimensional feature vector in the Euclidean space R n cannot be well mapped to SE (3) manifold. In this work, we propose a deep dual quaternion network that performs the absolute pose regression on SE (3) . We first develop an antipodally symmetric probability distribution over the unit dual quaternion on SE (3) to model uncertainties and then propose an intermediary differential representation space to replace the final output pose, which avoids the mapping problem from R n to SE (3) . In addi-tion, we introduce a backpropagation method that considers the continuousness and differentiability of the proposed intermediary space. Extensive experiments on the camera re-localization task on the Cambridge Landmarks and 7-Scenes datasets demonstrate that our method greatly improves the accuracy of the pose as well as the robustness in dealing with uncertainty and ambiguity, compared to the state-of-the-art.

[1]  Noah Snavely,et al.  Wide-Baseline Relative Camera Pose Estimation with Directional Learning , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Eric Brachmann,et al.  Visual Camera Re-Localization Using Graph Neural Networks and Relative Pose Supervision , 2021, 2021 International Conference on 3D Vision (3DV).

[3]  Uwe D. Hanebeck,et al.  Unscented Dual Quaternion Particle Filter for SE(3) Estimation , 2021, IEEE Control Systems Letters.

[4]  Nassir Navab,et al.  Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose Estimation , 2020, International Journal of Computer Vision.

[5]  Carlos Campos,et al.  ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual–Inertial, and Multimap SLAM , 2020, IEEE Transactions on Robotics.

[6]  G. Bianchi,et al.  Probabilistic orientation estimation with matrix Fisher distributions , 2020, NeurIPS.

[7]  Igor Gilitschenski,et al.  Deep Orientation Uncertainty Learning based on a Bingham Loss , 2020, ICLR.

[8]  Slobodan Ilic,et al.  6D Camera Relocalization in Ambiguous Scenes via Continuous Multimodal Inference , 2020, ECCV.

[9]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[10]  Stephen Gould,et al.  Deep Declarative Networks , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Timothy Bretl,et al.  PoseRBPF: A Rao-Blackwellized Particle Filter for6D Object Pose Estimation , 2019, Robotics: Science and Systems.

[12]  Luca Carlone,et al.  A Quaternion-Based Certifiably Optimal Solution to the Wahba Problem With Outliers , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[13]  Hongbin Zha,et al.  Beyond Tracking: Selecting Memory and Refining Poses for Deep Visual Odometry , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Torsten Sattler,et al.  Understanding the Limitations of CNN-Based Absolute Camera Pose Regression , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Yi Zhou,et al.  On the Continuity of Rotation Representations in Neural Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Serge J. Belongie,et al.  Deep Fundamental Matrix Estimation without Correspondences , 2018, ECCV Workshops.

[17]  Sen Wang,et al.  End-to-end, sequence-to-sequence probabilistic visual odometry through deep neural networks , 2018, Int. J. Robotics Res..

[18]  Jan Kautz,et al.  Geometry-Aware Learning of Maps for Camera Localization , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[19]  Xiaolin Hu,et al.  Delving deeper into convolutional neural networks for camera relocalization , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[20]  Sen Wang,et al.  VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Juan D. Tardós,et al.  ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras , 2016, IEEE Transactions on Robotics.

[22]  R. Cipolla,et al.  Modelling uncertainty in deep learning for camera relocalization , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[23]  Roberto Cipolla,et al.  PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[24]  Roberto Cipolla,et al.  Convolutional networks for real-time 6-DOF camera relocalization , 2015, ArXiv.

[25]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[26]  Gerhard Kurz,et al.  A new probability distribution for simultaneous representation of uncertain position and orientation , 2014, 17th International Conference on Information Fusion (FUSION).

[27]  Andrew W. Fitzgibbon,et al.  Scene Coordinate Regression Forests for Camera Relocalization in RGB-D Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Guillaume Leclercq,et al.  3D kinematics using dual quaternions: theory and applications in neuroscience , 2013, Front. Behav. Neurosci..

[29]  J. Magnus On Differentiating Eigenvalues and Eigenvectors , 1985, Econometric Theory.

[30]  Christopher Bingham An Antipodally Symmetric Distribution on the Sphere , 1974 .

[31]  Ryan J. Buchanan On Configuration Space , 2023 .