Edge and motion-adaptive median filtering for multi-view depth map enhancement

We present a novel multi-view depth map enhancement method deployed as a post-processing of initially estimated depth maps, which are incoherent in the temporal and inter-view dimensions. The proposed method is based on edge and motion-adaptive median filtering and allows for an improved quality of virtual view synthesis. To enforce the spatial, temporal and inter-view coherence in the multiview depth maps, the median filtering is applied to 4-dimensional windows that consist of the spatially neighbor depth map values taken at different viewpoints and time instants. These windows have locally adaptive shapes in a presence of edges or motion to preserve sharpness and realistic rendering. We show that our enhancement method leads to a reduction of a coding bit-rate required for representation of the depth maps and also to a gain in the quality of synthesized views at an arbitrary virtual viewpoint. At the same time, the method carries a low additional computational complexity.

[1]  Aljoscha Smolic,et al.  Multi-View Video Plus Depth Representation and Coding , 2007, 2007 IEEE International Conference on Image Processing.

[2]  Anthony Vetro,et al.  Extensions of H.264/AVC for Multiview Video Compression , 2006, 2006 International Conference on Image Processing.

[3]  Andrea Fusiello Image-based Rendering * , 2003 .

[4]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[5]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Harry Shum,et al.  Image-based rendering , 2006, Found. Trends Comput. Graph. Vis..

[7]  Yo-Sung Ho,et al.  Multi-view Depth Map Estimation Enhancing Temporal Consistency , 2008 .

[8]  Minh N. Do,et al.  Joint encoding of the depth image based representation using shape-adaptive wavelets , 2008, 2008 15th IEEE International Conference on Image Processing.

[9]  Peter Eisert,et al.  Rate-distortion-optimized predictive compression of dynamic 3D mesh sequences , 2006, Signal Process. Image Commun..

[10]  A. Aydin Alatan,et al.  Region-Based Dense Depth Extraction from Multi-View Video , 2007 .

[11]  Peter H. N. de With,et al.  Depth-Image Compression Based on an R-D Optimized Quadtree Decomposition for the Transmission of Multiview Images , 2007, 2007 IEEE International Conference on Image Processing.

[12]  Yo-Sung Ho,et al.  Segment-Based Multi-View Depth Map Estimation Using Belief Propagation from Dense Multi-View Video , 2008, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.