Deep Convolutional Grid Warping Network for Joint Depth Map Upsampling

Depth maps play an important role in the representation of 3D information. They are often simultaneously acquired with color images; however, their resolution is significantly lower than that of color images owing to hardware limitations. In this paper, we propose a novel approach to upsample depth maps by using geometric deformation instead of pixel value refinement, which is employed in a majority of existing methods. This approach, known as grid warping, displaces the position of blurred pixels around the edge towards the center of the edge. The displacement vector for warping is obtained from an analysis of the corresponding high-resolution color image. Furthermore, we propose an edge signal and displacement vector modeling for a more effective analysis. The experimental results show that the proposed method significantly improves the quantitative and visual performance, as compared to state-of-the-art methods. The source codes of the proposed method will be available at https://github.com/yym064/DeepGridWarp.

[1]  Dowan Kim,et al.  Trilateral filter construction for depth map upsampling , 2013, IVMSP 2013.

[2]  Nanning Zheng,et al.  Stereo Matching Using Belief Propagation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Sebastian Thrun,et al.  A Noise‐aware Filter for Real‐time Depth Upsampling , 2008 .

[4]  Xiaojin Gong,et al.  Guided Depth Enhancement via Anisotropic Diffusion , 2013, PCM.

[5]  Andrew Zisserman,et al.  Spatial Transformer Networks , 2015, NIPS.

[6]  Byung Tae Oh,et al.  Depth map upsampling with a confidence-based joint guided filter , 2019, Signal Process. Image Commun..

[7]  Dani Lischinski,et al.  Joint bilateral upsampling , 2007, ACM Trans. Graph..

[8]  Seung-Won Jung,et al.  Enhancement of Image and Depth Map Using Adaptive Joint Trilateral Filter , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[10]  Feng Liu,et al.  Depth Enhancement via Low-Rank Matrix Completion , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Kai-Lung Hua,et al.  Extended Guided Filtering for Depth Map Upsampling , 2016, IEEE MultiMedia.

[12]  Richard Szeliski,et al.  High-accuracy stereo depth maps using structured light , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[13]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[14]  Michael J. Black,et al.  A Naturalistic Open Source Movie for Optical Flow Evaluation , 2012, ECCV.

[15]  Hang Su,et al.  Pixel-Adaptive Convolutional Neural Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Huchuan Lu,et al.  Saliency Detection via Graph-Based Manifold Ranking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Lin Yang,et al.  Evaluating and Improving the Depth Accuracy of Kinect for Windows v2 , 2015, IEEE Sensors Journal.

[18]  Minh N. Do,et al.  Depth Video Enhancement Based on Weighted Mode Filtering , 2012, IEEE Transactions on Image Processing.

[19]  Amnon Shashua,et al.  Stereo-Assist: Top-down stereo for driver assistance systems , 2010, 2010 IEEE Intelligent Vehicles Symposium.

[20]  Narendra Ahuja,et al.  Joint Image Filtering with Deep Convolutional Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Andrey S. Krylov,et al.  Deblurred Images Post-Processing by Poisson Warping , 2015, IEEE Signal Processing Letters.

[22]  Kai-Lung Hua,et al.  Joint trilateral filtering for depth map super-resolution , 2013, 2013 Visual Communications and Image Processing (VCIP).

[23]  Andrey S. Krylov,et al.  Single parameter post-processing method for image deblurring , 2017, 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA).

[24]  Gregory Shakhnarovich,et al.  Deep Back-Projection Networks for Super-Resolution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[25]  Bashar Zogheib,et al.  Approximations of the Standard Normal Distribution , 2009 .

[26]  Jean Ponce,et al.  Deformable kernel networks for guided depth map upsampling , 2019, ArXiv.

[27]  Heiko Hirschmüller,et al.  Evaluation of Cost Functions for Stereo Matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Michael F. Cohen,et al.  Digital photography with flash and no-flash image pairs , 2004, ACM Trans. Graph..

[29]  Vincent Dumoulin,et al.  Deconvolution and Checkerboard Artifacts , 2016 .

[30]  Zhengyou Zhang,et al.  Microsoft Kinect Sensor and Its Effect , 2012, IEEE Multim..

[31]  Jian Sun,et al.  Guided Image Filtering , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Craig Gotsman,et al.  Enhancement by image-dependent warping , 1999, IEEE Trans. Image Process..

[33]  Thomas Brox,et al.  FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[34]  Michael S. Brown,et al.  High quality depth map upsampling for 3D-TOF cameras , 2011, 2011 International Conference on Computer Vision.

[35]  Xiaoou Tang,et al.  Depth Map Super-Resolution by Deep Multi-Scale Guidance , 2016, ECCV.

[36]  Sebastian Thrun,et al.  An Application of Markov Random Fields to Range Sensing , 2005, NIPS.

[37]  Minh N. Do,et al.  A revisit to MRF-based depth map super-resolution and enhancement , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[38]  Xiaoou Tang,et al.  Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[39]  Kyoung Mu Lee,et al.  Enhanced Deep Residual Networks for Single Image Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[40]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.