Transforming a 3-D LiDAR Point Cloud Into a 2-D Dense Depth Map Through a Parameter Self-Adaptive Framework

The 3-D LiDAR scanner and the 2-D charge-coupled device (CCD) camera are two typical types of sensors for surrounding-environment perceiving in robotics or autonomous driving. Commonly, they are jointly used to improve perception accuracy by simultaneously recording the distances of surrounding objects, as well as the color and shape information. In this paper, we use the correspondence between a 3-D LiDAR scanner and a CCD camera to rearrange the captured LiDAR point cloud into a dense depth map, in which each 3-D point corresponds to a pixel at the same location in the RGB image. In this paper, we assume that the LiDAR scanner and the CCD camera are accurately calibrated and synchronized beforehand so that each 3-D LiDAR point cloud is aligned with its corresponding RGB image. Each frame of the LiDAR point cloud is then projected onto the RGB image plane to form a sparse depth map. Then, a self-adaptive method is proposed to upsample the sparse depth map into a dense depth map, in which the RGB image and the anisotropic diffusion tensor are exploited to guide upsampling by reinforcing the RGB-depth compactness. Finally, convex optimization is applied on the dense depth map for global enhancement. Experiments on the KITTI and Middlebury data sets demonstrate that the proposed method outperforms several other relevant state-of-the-art methods in terms of visual comparison and root-mean-square error measurement.

[1]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2]  Michael Ying Yang,et al.  Joint Object Segmentation and Depth Upsampling , 2015, IEEE Signal Processing Letters.

[3]  Yao Wang,et al.  Color-Guided Depth Recovery From RGB-D Data Using an Adaptive Autoregressive Model , 2014, IEEE Transactions on Image Processing.

[4]  Kun Li,et al.  Depth Recovery Using an Adaptive Color-Guided Auto-Regressive Model , 2012, ECCV.

[5]  Jitendra Malik,et al.  Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Long Chen,et al.  A novel way to organize 3D LiDAR point cloud as 2D depth map height map and surface normal map , 2015, 2015 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[7]  Ruimin Hu,et al.  Kinect depth map based enhancement for low light surveillance image , 2013, 2013 IEEE International Conference on Image Processing.

[8]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Christian Schaller,et al.  Time-of-Flight - A New Modality for Radiotherapy , 2011 .

[10]  Bin Dai,et al.  Velodyne-based curb detection up to 50 meters away , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[11]  Heiko Hirschmüller,et al.  Evaluation of Cost Functions for Stereo Matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Richard Szeliski,et al.  A Comparative Study of Energy Minimization Methods for Markov Random Fields with Smoothness-Based Priors , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Jürgen Schmidhuber,et al.  Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Christoph Stiller,et al.  Velodyne SLAM , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[15]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[16]  Cheng Wang,et al.  Using Mobile LiDAR Data for Rapidly Updating Road Markings , 2015, IEEE Transactions on Intelligent Transportation Systems.

[17]  Jaebum Choi,et al.  Hybrid map-based SLAM using a Velodyne laser scanner , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[18]  Xing Mei,et al.  Depth Map Upsampling via Compressive Sensing , 2013, 2013 2nd IAPR Asian Conference on Pattern Recognition.

[19]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Nicholas Roy,et al.  Tracking-Based Semi-supervised Learning , 2012 .

[21]  Paul E. Rybski,et al.  Vision-based 3D bicycle tracking using deformable part model and Interacting Multiple Model filter , 2011, 2011 IEEE International Conference on Robotics and Automation.

[22]  Long Chen,et al.  Sparse depth map upsampling with RGB image and anisotropic diffusion tensor , 2015, 2015 IEEE Intelligent Vehicles Symposium (IV).

[23]  Ming-Yu Liu,et al.  Joint Geodesic Upsampling of Depth Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Reinhard Koch,et al.  Time‐of‐Flight Cameras in Computer Graphics , 2010, Comput. Graph. Forum.

[25]  Xiaogang Wang,et al.  Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.

[26]  Shipeng Li,et al.  Texture-assisted Kinect depth inpainting , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[27]  Cristiano Premebida,et al.  Pedestrian detection combining RGB and dense LIDAR data , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  Jitendra Malik,et al.  Learning Rich Features from RGB-D Images for Object Detection and Segmentation , 2014, ECCV.

[29]  Nanning Zheng,et al.  Stereo Matching Using Belief Propagation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Bin Dai,et al.  Performance of global descriptors for velodyne-based urban object recognition , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[31]  Horst Bischof,et al.  Image Guided Depth Upsampling Using Anisotropic Total Generalized Variation , 2013, 2013 IEEE International Conference on Computer Vision.

[32]  Qingxiong Yang,et al.  Stereo Matching Using Tree Filtering , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Thierry Chateau,et al.  Pedestrian Detection and Tracking in an Urban Environment Using a Multilayer Laser Scanner , 2010, IEEE Transactions on Intelligent Transportation Systems.

[34]  Shi Chen,et al.  Using edit distance and junction feature to detect and recognize arrow road marking , 2014, 17th International IEEE Conference on Intelligent Transportation Systems (ITSC).

[35]  Adam C. Winstanley,et al.  Background Foreground Segmentation for SLAM , 2011, IEEE Transactions on Intelligent Transportation Systems.