Robust Online Calibration of LiDAR and Camera Based on Cross-Modal Graph Neural Network

Accurate spatial parameters of LiDAR and camera is a prerequisite for information consistency and robust online calibration is the foundation of long-term effective fusion in intelligent perception system. However, dynamic scene conditions and various hardware prior parameters pose great challenges to the robustness and generalization of existing models. To solve these problems, we propose an online calibration method based on a cross-modal graph neural network (GNN). In the data preprocessing stage, the influence of prior parameters on the inductive bias is reduced by a unified spherical space process strategy of 3-D points and 2-D pixels, which strengthens the generalization. In the graph network, the correlation of multiple windows inside the modal and the explicit correlation matrix across the modal are solved by modeling the robust matching process of human visual positioning. In multilevel graphic constraints, the precise relative position and orientation information is obtained by imposing nodes, edges, and embedding constraints on the graph structure. Extensive evaluations on KITTI and PandaSet suggest that the proposed method not only effectively improves the robustness in various scenes, but also enhances the generalization of the online calibration algorithm.

[1]  Ji-Heon Park,et al.  Drivable Dirt Road Region Identification Using Image and Point Cloud Semantic Segmentation Fusion , 2022, IEEE Transactions on Intelligent Transportation Systems.

[2]  Xiaowei Zhou,et al.  OnePose: One-Shot Object Pose Estimation without CAD Models , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Y. Aloimonos,et al.  DiffPoseNet: Direct Differentiable Camera Pose Estimation , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Xiaopei Wu,et al.  Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Zhiyong Zheng,et al.  Highly Robust Vehicle Lateral Localization Using Multilevel Robust Network , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[6]  Kun Jiang,et al.  PandaSet: Advanced Sensor Suite Dataset for Autonomous Driving , 2021, 2021 IEEE International Intelligent Transportation Systems Conference (ITSC).

[7]  Edward Johns,et al.  AGO-Net: Association-Guided 3D Point Cloud Object Detection Network , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Dong Ye,et al.  LCCNet: LiDAR and Camera Self-Calibration using Cost Volume Network , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9]  Qingyong Hu,et al.  Learning Semantic Segmentation of Large-Scale Point Clouds With Random Sampling , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Martin R. Oswald,et al.  SOLD2: Self-supervised Occlusion-aware Line Description and Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Quanfu Fan,et al.  CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[12]  Ping Tan,et al.  Single-Shot is Enough: Panoramic Infrastructure Based Calibration of Multiple Cameras and 3D LiDARs , 2021, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13]  Fu Zhang,et al.  Pixel-Level Extrinsic Self Calibration of High Resolution LiDAR and Camera in Targetless Environments , 2021, IEEE Robotics and Automation Letters.

[14]  Jonathan T. Barron,et al.  iNeRF: Inverting Neural Radiance Fields for Pose Estimation , 2020, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[15]  Klaus Dietmayer,et al.  Point Transformer , 2020, IEEE Access.

[16]  Wen Yang,et al.  Line-Based 2-D–3-D Registration and Camera Localization in Structured Environments , 2020, IEEE Transactions on Instrumentation and Measurement.

[17]  Honghai Liu,et al.  CalibRCNN: Calibrating Camera and LiDAR by Recurrent Convolutional Neural Network and Geometric Constraints , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[18]  Zhenyu Guo,et al.  RGGNet: Tolerance Aware LiDAR-Camera Online Calibration With Geometric Deep Learning and Generative Model , 2020, IEEE Robotics and Automation Letters.

[19]  Yu-Chiang Frank Wang,et al.  Convolution in the Cloud: Learning Deformable Kernels in 3D Graph Convolution Networks for Point Cloud Analysis , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Yubo Zhang,et al.  Online Camera-LiDAR Calibration with Sensor Semantic Information , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Wouter Van Gansbeke,et al.  Multi-Task Learning for Dense Prediction Tasks: A Survey , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Naiyan Wang,et al.  DMLO: Deep Matching LiDAR Odometry , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[23]  Philip S. Yu,et al.  A Survey on Knowledge Graphs: Representation, Acquisition, and Applications , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[24]  Sridha Sridharan,et al.  Spatiotemporal Camera-LiDAR Calibration: A Targetless and Structureless Approach , 2020, IEEE Robotics and Automation Letters.

[25]  Quoc V. Le,et al.  EfficientDet: Scalable and Efficient Object Detection , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Zhaohui Zheng,et al.  Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression , 2019, AAAI.

[27]  Jinhui Tang,et al.  Integrating Dense LiDAR-Camera Road Detection Maps by a Multi-Modal CRF Model , 2019, IEEE Transactions on Vehicular Technology.

[28]  Kun Jiang,et al.  Pixels and 3-D Points Alignment Method for the Fusion of Camera and LiDAR Data , 2019, IEEE Transactions on Instrumentation and Measurement.

[29]  Bo Chen,et al.  End-to-End Learnable Geometric Vision by Backpropagating PnP Optimization , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Bin Yang,et al.  Multi-Task Multi-Sensor Fusion for 3D Object Detection , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Jun Li,et al.  Unsupervised Feature Extraction in Hyperspectral Images Based on Wasserstein Generative Adversarial Network , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Bo Fu,et al.  LiDAR-Camera Calibration Under Arbitrary Configurations: Observability and Methods , 2019, IEEE Transactions on Instrumentation and Measurement.

[33]  Huosheng Hu,et al.  Building an Omnidirectional 3-D Color Laser Ranging System Through a Novel Calibration Method , 2019, IEEE Transactions on Industrial Electronics.

[34]  Roland Siegwart,et al.  From Coarse to Fine: Robust Hierarchical Localization at Large Scale , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Brendan Englot,et al.  LeGO-LOAM: Lightweight and Ground-Optimized Lidar Odometry and Mapping on Variable Terrain , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[36]  Kurt Keutzer,et al.  SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[37]  Jessica B. Hamrick,et al.  Relational inductive biases, deep learning, and graph networks , 2018, ArXiv.

[38]  Eduardo Ros,et al.  Joint direct estimation of 3D geometry and 3D motion using spatio temporal gradients , 2018, Pattern Recognit..

[39]  Takeshi Oishi,et al.  LiDAR and Camera Calibration Using Motions Estimated by Sensor Fusion Odometry , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[40]  K. Madhava Krishna,et al.  CalibNet: Geometrically Supervised Extrinsic Calibration using 3D Spatial Transformer Networks , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[41]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[42]  Huchuan Lu,et al.  Deep Mutual Learning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[43]  Hongdong Li,et al.  Accurate extrinsic calibration between monocular camera and sparse 3D Lidar points without markers , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[44]  Nick Schneider,et al.  RegNet: Multimodal sensor registration using deep neural networks , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[45]  Serge J. Belongie,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Daniel Cremers,et al.  Direct Sparse Odometry , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Bin Zhao,et al.  Extrinsic Calibration of 2-D Laser Rangefinder and Camera From Single Shot Based on Minimal Solution , 2016, IEEE Transactions on Instrumentation and Measurement.

[49]  Haryong Song,et al.  Robust Vision-Based Relative-Localization Approach Using an RGB-Depth Camera and LiDAR Sensor Fusion , 2016, IEEE Transactions on Industrial Electronics.

[50]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  S. Savarese,et al.  Automatic Extrinsic Calibration of Vision and Lidar by Maximizing Mutual Information , 2015, J. Field Robotics.

[52]  Roberto Cipolla,et al.  PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[53]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[54]  Roland Siegwart,et al.  A Flexible Technique for Accurate Omnidirectional Camera Calibration and Structure from Motion , 2006, Fourth IEEE International Conference on Computer Vision Systems (ICVS'06).

[55]  Marc Levoy,et al.  Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[56]  Huihui Pan,et al.  Keypoint-Based LiDAR-Camera Online Calibration With Robust Geometric Network , 2022, IEEE Transactions on Instrumentation and Measurement.

[57]  Yan Zhuang,et al.  A Novel Lidar-Assisted Monocular Visual SLAM Framework for Mobile Robots in Outdoor Environments , 2022, IEEE Transactions on Instrumentation and Measurement.

[58]  Stephen Lin,et al.  Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).