Real-time implementation of depth map post-processing for 3D-TV in dedicated hardware

Automatically generated depth maps from video are usually not aligned with the objects in the original image and produced at lower resolutions. We propose to apply a joint-bilateral filter to smoothen the depth map within the objects and upsample it to the original image resolution, while keeping the object edges in the depth map aligned with the original image. We performed algorithmic and ASIC specific optimizations to achieve a real-time implementation in dedicated hardware, while preserving high quality results. Upsampling depth maps from 240×135@60Hz to 1920×1080@60Hz, requires 0.35 mm2, at 200 MHz, for a dedicated hardware implementation in CMOS 45 nm technology.

[1]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[2]  Om Prakash Gangwal,et al.  Real-time implementation of depth map post-processing for 3D-TV on a programmable DSP (TriMedia) , 2009, 2009 Digest of Technical Papers International Conference on Consumer Electronics.

[3]  A. K. Riemens,et al.  Real-time embedded system for stereo video processing for multiview displays , 2007, Electronic Imaging.

[4]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[5]  Dani Lischinski,et al.  Joint bilateral upsampling , 2007, SIGGRAPH 2007.

[6]  Richard Szeliski,et al.  Digital photography with flash and no-flash image pairs , 2004, ACM Trans. Graph..

[7]  A. K. Riemens,et al.  Multistep joint bilateral depth upsampling , 2009, Electronic Imaging.