Global Auto-Regressive Depth Recovery via Iterative Non-Local Filtering

Existing depth sensing techniques have many shortcomings in terms of resolution, completeness, and accuracy. The performance of 3-D broadcasting systems is therefore limited by the challenges of capturing high-resolution depth data. In this paper, we present a novel framework for obtaining high-quality depth images and multi-view depth videos from simple acquisition systems. We first propose a single depth image recovery algorithm based on auto-regressive (AR) correlations. A fixed-point iteration algorithm under the global AR modeling is derived to efficiently solve the large-scale quadratic programming. Each iteration is equivalent to a nonlocal filtering process with a residue feedback. Then, we extend our framework to an AR-based multi-view depth video recovery framework, where each depth map is recovered from low-quality measurements with the help of the corresponding color image, depth maps from neighboring views, and depth maps of temporally adjacent frames. AR coefficients on nonlocal spatiotemporal neighborhoods in the algorithm are designed to improve the recovery performance. We further discuss the connections between our model and other methods like graph-based tools, and demonstrate that our algorithms enjoy the advantages of both global and local methods. Experimental results on both the Middleburry datasets and other captured datasets finally show that our method is able to improve the performances of depth images and multi-view depth videos recovery compared with state-of-the-art approaches.

[1]  Timo Schairer,et al.  Fusion of range and color images for denoising and resolution enhancement with a non-local filter , 2010, Comput. Vis. Image Underst..

[2]  John G. Apostolopoulos,et al.  Fusion of active and passive sensors for fast 3D capture , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[3]  Qiang Wu,et al.  Robust Color Guided Depth Map Restoration , 2017, IEEE Transactions on Image Processing.

[4]  Richard Szeliski,et al.  A Comparative Study of Energy Minimization Methods for Markov Random Fields with Smoothness-Based Priors , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Jianjun Lei,et al.  Color-Guided Depth Map Super Resolution Using Convolutional Neural Network , 2017, IEEE Access.

[6]  Aimin Hao,et al.  Super-Resolution of Multi-Observed RGB-D Images Based on Nonlocal Regression and Total Variation , 2016, IEEE Transactions on Image Processing.

[7]  Kun Li,et al.  Depth Recovery Using an Adaptive Color-Guided Auto-Regressive Model , 2012, ECCV.

[8]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[9]  Zhengjun Zha,et al.  Gradient-domain-based enhancement of multi-view depth video , 2014, Inf. Sci..

[10]  Michael S. Brown,et al.  High quality depth map upsampling for 3D-TOF cameras , 2011, 2011 International Conference on Computer Vision.

[11]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[12]  Yousef Saad,et al.  Iterative methods for sparse linear systems , 2003 .

[13]  Sebastian Thrun,et al.  An Application of Markov Random Fields to Range Sensing , 2005, NIPS.

[14]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[15]  Shuai Li,et al.  Depth Image Based View Synthesis: New Insights and Perspectives on Hole Generation and Filling , 2016, IEEE Transactions on Broadcasting.

[16]  Hao Huang,et al.  Computational Multi-View Imaging with Kinect , 2014, IEEE Transactions on Broadcasting.

[17]  Joachim Weickert,et al.  Universität Des Saarlandes Fachrichtung 6.1 – Mathematik Generalised Nonlocal Image Smoothing Generalised Nonlocal Image Smoothing , 2022 .

[18]  Fengfeng Duan Spatio-temporal Consistency in Stereoscopic Video Depth Map Sequence Estimation , 2014 .

[19]  Kwanghoon Sohn,et al.  Multiview ToF sensor fusion technique for high-quality depth map , 2013, Electronic Imaging.

[20]  Minh N. Do,et al.  Depth Video Enhancement Based on Weighted Mode Filtering , 2012, IEEE Transactions on Image Processing.

[21]  Jianjun Lei,et al.  Depth Map Super-Resolution Considering View Synthesis Quality , 2017, IEEE Transactions on Image Processing.

[22]  Kwanghoon Sohn,et al.  Structure Selective Depth Superresolution for RGB-D Cameras , 2016, IEEE Transactions on Image Processing.

[23]  Sebastian Thrun,et al.  Upsampling range data in dynamic environments , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24]  David A. Forsyth,et al.  Sparse depth super resolution , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[26]  Ruigang Yang,et al.  Spatial-Depth Super Resolution for Range Images , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Jian Sun,et al.  Guided Image Filtering , 2010, ECCV.

[28]  K. Hartmann,et al.  Data-Fusion of PMD-Based Distance-Information and High-Resolution RGB-Images , 2007, 2007 International Symposium on Signals, Circuits and Systems.

[29]  Jonathan T. Barron,et al.  The Fast Bilateral Solver , 2015, ECCV.

[30]  Horst Bischof,et al.  Image Guided Depth Upsampling Using Anisotropic Total Generalized Variation , 2013, 2013 IEEE International Conference on Computer Vision.

[31]  Minh N. Do,et al.  Cross-based local multipoint filtering , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Ming-Yu Liu,et al.  Joint Geodesic Upsampling of Depth Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Reinhard Koch,et al.  Time‐of‐Flight Cameras in Computer Graphics , 2010, Comput. Graph. Forum.

[34]  B. Huhle,et al.  Integrating 3D Time-of-Flight Camera Data and High Resolution Images for 3DTV Applications , 2007, 2007 3DTV Conference.

[35]  Thomas S. Huang,et al.  Deep Networks for Image Super-Resolution with Sparse Prior , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[36]  Minh N. Do,et al.  Fast Guided Global Interpolation for Depth and Motion , 2016, ECCV.

[37]  A. K. Riemens,et al.  Multistep joint bilateral depth upsampling , 2009, Electronic Imaging.

[38]  查正军,et al.  A Unified Scheme for Super-resolution and Depth Estimation from Asymmetric Stereoscopic Video , 2016 .

[39]  Dani Lischinski,et al.  Joint bilateral upsampling , 2007, SIGGRAPH 2007.

[40]  Ruigang Yang,et al.  Spatial-Temporal Fusion for High Accuracy Depth Maps Using Dynamic MRFs , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  N. Paragios,et al.  A high-quality video denoising algorithm based on reliable motion estimation , 2010 .

[42]  Yao Wang,et al.  Color-Guided Depth Recovery From RGB-D Data Using an Adaptive Autoregressive Model , 2014, IEEE Transactions on Image Processing.

[43]  Sebastian Thrun,et al.  A Noise‐aware Filter for Real‐time Depth Upsampling , 2008 .

[44]  Kwanghoon Sohn,et al.  Reliability-Based Multiview Depth Enhancement Considering Interview Coherence , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[45]  Daniel Cremers,et al.  Efficient Nonlocal Means for Denoising of Textural Patterns , 2008, IEEE Transactions on Image Processing.

[46]  Rasmus Larsen,et al.  TOF imaging in Smart room environments towards improved people tracking , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[47]  King Ngi Ngan,et al.  Temporal depth video enhancement based on intrinsic static structure , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[48]  Qiang Wu,et al.  Explicit Edge Inconsistency Evaluation Model for Color-Guided Depth Map Enhancement , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[49]  Gabriella Kókai,et al.  Increasing depth lateral resolution based on sensor fusion , 2008, Int. J. Intell. Syst. Technol. Appl..

[50]  Minh N. Do,et al.  A revisit to MRF-based depth map super-resolution and enhancement , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[51]  R. Tibshirani,et al.  Linear Smoothers and Additive Models , 1989 .

[52]  Feng Li,et al.  A hybrid camera for motion deblurring and depth map super-resolution , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.