Refinement of depth maps generated by low-cost depth sensors

With the introduction of the Microsoft Kinect into the gaming industry and release of Kinect-based application development kits, a whole new industry has evolved around the Kinect with applications for gaming, gesture recognition for controlling devices, 3-D communication, etc. coming to the fore. The popularity of the Kinect sensor can be attributed to its low cost and the real-time depth map generation capability. However, owing to Kinect's lack of sophistication, the depth maps generated have a lot of artifacts like poorly generated object boundaries and missing depth values and misalignment between the depth map and the color image. In this paper, a novel technique for depth map correction and post-processing is proposed. To fill the missing depth pixels in the generated depth map, we use a color-aware Gaussian-weighted averaging filter which estimates the missing depth values using the surrounding good depth pixels and by using the corresponding color image as a guide to prevent filtering across object boundaries. A depth map enhancement algorithm is then proposed based on multi-step upsampling-based anisotropic diffusion with structure aware weights using the SSIM distortion metric. This post-processing technique corrects and sharpens the boundaries of the objects of the depth map and ensures local depth smoothness within the objects. In all of the proposed techniques, an effort is made to reduce the complexity and keep the execution real-time.

[1]  Zhi-Hui Kong,et al.  CMOS-MEMS capacitive sensors for intra-cranial pressure monitoring: Sensor fabrication & system design , 2012, 2012 International SoC Design Conference (ISOCC).

[2]  Dmitriy Vatolin,et al.  Real-time depth map occlusion filling and scene background restoration for projected-pattern based depth cameras , 2011 .

[3]  Ahmet M. Kondoz,et al.  Utilisation of edge adaptive upsampling in compression of depth map videos for enhanced free-viewpoint rendering , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[4]  Luis Salgado,et al.  Efficient spatio-temporal hole filling strategy for Kinect depth maps , 2012, Electronic Imaging.

[5]  Jitendra Malik,et al.  Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[7]  Michael S. Brown,et al.  High quality depth map upsampling for 3D-TOF cameras , 2011, 2011 International Conference on Computer Vision.

[8]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[9]  Dmitriy Vatolin,et al.  Temporal filtering for depth maps generated by Kinect depth camera , 2011, 2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[10]  A.M. Kondoz,et al.  Bit-Rate Adaptive Downsampling for the Coding of Multi-View Video with Depth Information , 2008, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[11]  Y.-S. Ho,et al.  Depth reconstruction filter for depth coding , 2009 .

[12]  Lifeng Sun,et al.  A novel upsampling scheme for depth map compression in 3DTV system , 2010, 28th Picture Coding Symposium.

[13]  Dani Lischinski,et al.  Joint bilateral upsampling , 2007, ACM Trans. Graph..

[14]  Om Prakash Gangwal,et al.  Real-time implementation of depth map post-processing for 3D-TV in dedicated hardware , 2010, 2010 Digest of Technical Papers International Conference on Consumer Electronics (ICCE).