Kinect Depth Recovery Using a Color-Guided, Region-Adaptive, and Depth-Selective Framework

Considering that the existing depth recovery approaches have different limitations when applied to Kinect depth data, in this article, we propose to integrate their effective features including adaptive support region selection, reliable depth selection, and color guidance together under an optimization framework for Kinect depth recovery. In particular, we formulate our depth recovery as an energy minimization problem, which solves the depth hole filling and denoising simultaneously. The energy function consists of a fidelity term and a regularization term, which are designed according to the Kinect characteristics. Our framework inherits and improves the idea of guided filtering by incorporating structure information and prior knowledge of the Kinect noise model. Through analyzing the solution to the optimization framework, we also derive a local filtering version that provides an efficient and effective way of improving the existing filtering techniques. Quantitative evaluations on our developed synthesized dataset and experiments on real Kinect data show that the proposed method achieves superior performance in terms of recovery accuracy and visual quality.

[1]  Michael F. Cohen,et al.  Digital photography with flash and no-flash image pairs , 2004, ACM Trans. Graph..

[2]  Guido M. Cortelazzo,et al.  Microsoft Kinect™ Range Camera , 2012 .

[3]  Henry Fuchs,et al.  Encumbrance-free telepresence system with real-time 3D capture and display using commodity depth cameras , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[4]  Richard Barrett,et al.  Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods , 1994, Other Titles in Applied Mathematics.

[5]  Minh N. Do,et al.  Depth Video Enhancement Based on Weighted Mode Filtering , 2012, IEEE Transactions on Image Processing.

[6]  Tat-Jen Cham,et al.  High-quality Kinect depth filtering for real-time 3D telepresence , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[7]  Michael Schmeing,et al.  Color Segmentation Based Depth Image Filtering , 2012, WDIA.

[8]  Guangming Shi,et al.  Structure guided fusion for depth map inpainting , 2013, Pattern Recognit. Lett..

[9]  Dong Tian,et al.  Depth map processing with iterative joint multilateral filtering , 2010, 28th Picture Coding Symposium.

[10]  Jianfei Cai,et al.  Kinect Shadow Detection and Classification , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[11]  Chongyu Chen,et al.  A color-guided, region-adaptive and depth-selective unified framework for Kinect depth recovery , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[12]  Dani Lischinski,et al.  Joint bilateral upsampling , 2007, ACM Trans. Graph..

[13]  Andrew W. Fitzgibbon,et al.  KinectFusion: Real-time dense surface mapping and tracking , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[14]  Kun Li,et al.  Depth Recovery Using an Adaptive Color-Guided Auto-Regressive Model , 2012, ECCV.

[15]  Michael S. Brown,et al.  High quality depth map upsampling for 3D-TOF cameras , 2011, 2011 International Conference on Computer Vision.

[16]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[17]  Hai Yang,et al.  ACM Transactions on Intelligent Systems and Technology - Special Section on Urban Computing , 2014 .

[18]  Christopher Joseph Pal,et al.  Learning Conditional Random Fields for Stereo , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Sander Oude Elberink,et al.  Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications , 2012, Sensors.

[20]  Henry Fuchs,et al.  Reducing interference between multiple structured light depth sensors using motion , 2012, 2012 IEEE Virtual Reality Workshops (VRW).

[21]  Dmitriy Vatolin,et al.  Real-time depth map occlusion filling and scene background restoration for projected-pattern based depth cameras , 2011 .

[22]  Markus H. Gross,et al.  FreeCam: A Hybrid Camera System for Interactive Free-Viewpoint Video , 2011, VMV.

[23]  Ahmet M. Kondoz,et al.  Adaptive sharpening of depth maps for 3D-TV , 2010 .

[24]  Dong Tian,et al.  Joint trilateral filtering for depth map compression , 2010, Visual Communications and Image Processing.

[25]  Németh Zoltán Autodesk 3ds Max , 2015 .

[26]  Hui Lin,et al.  Depth image enhancement for Kinect using region growing and bilateral filter , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[27]  Hans-Peter Seidel,et al.  Coherent Spatiotemporal Filtering, Upsampling and Rendering of RGBZ Videos , 2012, Comput. Graph. Forum.

[28]  Shipeng Li,et al.  Texture-assisted Kinect depth inpainting , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[29]  Luis Salgado,et al.  Efficient spatio-temporal hole filling strategy for Kinect depth maps , 2012, Electronic Imaging.