Non-Learning Stereo-Aided Depth Completion Under Mis-Projection via Selective Stereo Matching

We propose a non-learning depth completion method for a sparse depth map captured using a light detection and ranging (LiDAR) sensor guided by a pair of stereo images. Generally, conventional stereo-aided depth completion methods have two limiations. (i) they assume the given sparse depth map is accurately aligned to the input image, whereas the alignment is difficult to achieve in practice; (ii) they have limited accuracy in the long range because the depth is estimated by pixel disparity. To solve the abovementioned limitations, we propose selective stereo matching (SSM) that searches the most appropriate depth value for each image pixel from its neighborly projected LiDAR points based on an energy minimization framework. This depth selection approach can handle any type of mis-projection. Moreover, SSM has an advantage in terms of long-range depth accuracy because it directly uses the LiDAR measurement rather than the depth acquired from the stereo. SSM is a discrete process; thus, we apply variational smoothing with binary anisotropic diffusion tensor (B-ADT) to generate a continuous depth map while preserving depth discontinuity across object boundaries. Experimentally, compared with the previous state-of-the-art stereo-aided depth completion, the proposed method reduced the mean absolute error (MAE) of the depth estimation to 0.65 times and demonstrated approximately twice more accurate estimation in the long range. Moreover, under various LiDAR-camera calibration errors, the proposed method reduced the depth estimation MAE to 0.34-0.93 times from previous depth completion methods.

[1]  Silvio Savarese,et al.  Automatic Targetless Extrinsic Calibration of a 3D Lidar and Camera by Maximizing Mutual Information , 2012, AAAI.

[2]  Kihong Park,et al.  High-Precision Depth Estimation with the 3D LiDAR and Stereo Fusion , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[3]  Stefano Soatto,et al.  Dense Depth Posterior (DDP) From Single Image and Sparse Range , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Takeo Kanade,et al.  Integrating LIDAR into Stereo for Fast and Improved Disparity Computation , 2011, 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission.

[5]  Sertac Karaman,et al.  Self-Supervised Sparse-to-Dense: Self-Supervised Depth Completion from LiDAR and Monocular Camera , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[6]  Qingquan Li,et al.  Transforming a 3-D LiDAR Point Cloud Into a 2-D Dense Depth Map Through a Parameter Self-Adaptive Framework , 2017, IEEE Transactions on Intelligent Transportation Systems.

[7]  Paul Newman,et al.  Real-time probabilistic fusion of sparse 3D LIDAR and dense stereo , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[8]  Kyungdon Joo,et al.  Non-Local Spatial Propagation Network for Depth Completion , 2020, ECCV.

[9]  Kyungdon Joo,et al.  Volumetric Propagation Network: Stereo-LiDAR Fusion for Long-Range Depth Estimation , 2021, IEEE Robotics and Automation Letters.

[10]  Mengmeng Wang,et al.  FCFR-Net: Feature Fusion based Coarse-to-Fine Residual Learning for Depth Completion , 2020, AAAI.

[11]  Takeshi Oishi,et al.  LiDAR and Camera Calibration Using Motions Estimated by Sensor Fusion Odometry , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[12]  Sebastian Thrun,et al.  An Application of Markov Random Fields to Range Sensing , 2005, NIPS.

[13]  Xinming Huang,et al.  DepthNet: Real-Time LiDAR Point Cloud Depth Completion for Autonomous Vehicles , 2020, IEEE Access.

[14]  Camillo J. Taylor,et al.  DFuseNet: Deep Fusion of RGB and Sparse Depth Information for Image Guided Dense Depth Completion , 2019, 2019 IEEE Intelligent Transportation Systems Conference (ITSC).

[15]  Thomas Brox,et al.  Sparsity Invariant CNNs , 2017, 2017 International Conference on 3D Vision (3DV).

[16]  Didier Stricker,et al.  SSGP: Sparse Spatial Guided Propagation for Robust and Generic Interpolation , 2021, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[17]  Andrew J. Davison,et al.  DTAM: Dense tracking and mapping in real-time , 2011, 2011 International Conference on Computer Vision.

[18]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[19]  Michael Felsberg,et al.  Confidence Propagation through CNNs for Guided Sparse Depth Regression , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  M. Pollefeys,et al.  DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene From Sparse LiDAR Data and Single Color Image , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Rui Fan,et al.  PVStereo: Pyramid Voting Module for End-to-End Self-Supervised Stereo Matching , 2021, IEEE Robotics and Automation Letters.

[22]  Zheng Liu,et al.  Automatic calibration and registration of lidar and stereo camera without calibration objects , 2015, 2015 IEEE International Conference on Vehicular Electronics and Safety (ICVES).

[23]  Moritz Menze,et al.  Object scene flow , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[24]  Dacheng Tao,et al.  Adaptive Context-Aware Multi-Modal Network for Depth Completion , 2020, IEEE Transactions on Image Processing.

[25]  Raquel Urtasun,et al.  Efficient Joint Segmentation, Occlusion Labeling, Stereo and Flow Estimation , 2014, ECCV.

[26]  W. Freeman,et al.  Generalized Belief Propagation , 2000, NIPS.

[27]  Horst Bischof,et al.  Image Guided Depth Upsampling Using Anisotropic Total Generalized Variation , 2013, 2013 IEEE International Conference on Computer Vision.

[28]  Zejian Yuan,et al.  A Multi-Scale Guided Cascade Hourglass Network for Depth Completion , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[29]  Ruigang Yang,et al.  CSPN++: Learning Context and Resource Aware Convolutional Spatial Propagation Networks for Depth Completion , 2019, AAAI.

[30]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[31]  Lin Yan,et al.  Revisiting Sparsity Invariant Convolution: A Network for Image Guided Depth Completion , 2020, IEEE Access.

[32]  Antonin Chambolle,et al.  A First-Order Primal-Dual Algorithm for Convex Problems with Applications to Imaging , 2011, Journal of Mathematical Imaging and Vision.

[33]  Raúl Rojas,et al.  Large scale Semi-Global Matching on the CPU , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[34]  Stefano Soatto,et al.  Unsupervised Depth Completion From Visual Inertial Odometry , 2019, IEEE Robotics and Automation Letters.

[35]  Joaquim Salvi,et al.  Recent progress in coded structured light as a technique to solve the correspondence problem: a survey , 1998, Pattern Recognit..

[36]  Luc Van Gool,et al.  Sparse and Noisy LiDAR Completion with RGB Guidance and Uncertainty , 2019, 2019 16th International Conference on Machine Vision Applications (MVA).

[37]  Sergiu Nedevschi,et al.  Improving local stereo algorithms using binary shifted windows, fusion and smoothness constraint , 2015, 2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP).

[38]  Marc Pollefeys,et al.  Semantically Guided Depth Upsampling , 2016, GCPR.

[39]  Kihong Park,et al.  High-Precision Depth Estimation Using Uncalibrated LiDAR and Stereo Fusion , 2020, IEEE Transactions on Intelligent Transportation Systems.

[40]  Dani Lischinski,et al.  Joint bilateral upsampling , 2007, ACM Trans. Graph..

[41]  Takeshi Oishi,et al.  Real-Time Dense Depth Estimation Using Semantically-Guided LIDAR Data Propagation and Motion Stereo , 2019, IEEE Robotics and Automation Letters.

[42]  Raquel Urtasun,et al.  Learning Joint 2D-3D Representations for Depth Completion , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[43]  Hujun Bao,et al.  Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[44]  Mehrtash Harandi,et al.  Hierarchical Neural Architecture Search for Deep Stereo Matching , 2020, NeurIPS.

[45]  Hongdong Li,et al.  Noise-Aware Unsupervised Deep Lidar-Stereo Fusion , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Andreas Geiger,et al.  Automatic camera and range sensor calibration using a single shot , 2012, 2012 IEEE International Conference on Robotics and Automation.

[47]  Takeshi Oishi,et al.  Discontinuous and Smooth Depth Completion With Binary Anisotropic Diffusion Tensor , 2020, IEEE Robotics and Automation Letters.

[48]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.