Gradient-domain-based enhancement of multi-view depth video

Multi-view depth is an emerging and attractive 3D representation in recent years, which acts a significant role in rendering numerous viewing angles from a small number of given input views. The performance of those depth-based 3D video applications is strongly dependent on the quality of multi-view depth. However, because of the limitations of depth acquisition and estimation, the quality of multi-view depth suffers from artifacts in spatial dimension and inconsistency in both the temporal and inter-view dimensions. In this paper, we propose a gradient-domain based enhancement method for multi-view depth. Being different from the traditional enhancement methods which intended to process one or two above dimensions, our proposal exploits the coherence of both temporal and inter-view dimensions in addition of the spatial one. It is very challenging to obtain the stable characteristics in multi-dimensions. To solve this problem, we propose to investigate the characteristics in the gradient domain rather than the intensity domain. The enhanced multi-view depth is obtained through the minimization of energy function under the constraint of a joint gradient field (JGF), which is estimated from multiple dimensions through motion estimation and geometric mapping. Therefore, the enhanced multi-view depth is a global optimization in multiple dimensions that ensures the consistency in temporal, inter-view and spatial domains. Furthermore, we also propose an enhancement structure to indicate the process order of depth frames. The experimental results suggest that the proposed method can enhance multi-view depth with desired sharpness and consistency.

[1]  Concha Bielza,et al.  A review on evolutionary algorithms in Bayesian network learning and inference tasks , 2013, Inf. Sci..

[2]  Patrick Pérez,et al.  Poisson image editing , 2003, ACM Trans. Graph..

[3]  Thomas Wiegand,et al.  3D Video and Free Viewpoint Video - Technologies, Applications and MPEG Standards , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[4]  Levent Onural,et al.  Three-Dimensional Television: Capture, Transmission, Display , 2007 .

[5]  Li Hong,et al.  Segment-based stereo matching using graph cuts , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[6]  Rudy Lauwereins,et al.  Real-Time and Accurate Stereo: A Scalable Approach With Bitwise Fast Voting on CUDA , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Minh N. Do,et al.  Depth Video Enhancement Based on Weighted Mode Filtering , 2012, IEEE Transactions on Image Processing.

[8]  Tae-Sun Choi,et al.  Optimal depth estimation by combining focus measures using genetic programming , 2011, Inf. Sci..

[9]  Harry Shum,et al.  Gradient Profile Prior and Its Applications in Image Super-Resolution and Enhancement , 2011, IEEE Transactions on Image Processing.

[10]  Liang Zhang Fast Stereo Matching Algorithm for Intermediate View Reconstruction of Stereoscopic Television Images , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Olga Veksler,et al.  Fast Approximate Energy Minimization via Graph Cuts , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Dani Lischinski,et al.  Joint bilateral upsampling , 2007, SIGGRAPH 2007.

[13]  Ahmet M. Kondoz,et al.  Edge adaptive upsampling of depth map videos for enhanced free-viewpoint video quality , 2009 .

[14]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[15]  Dani Lischinski,et al.  Gradient Domain High Dynamic Range Compression , 2023 .

[16]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[17]  Nikolaos Grammalidis,et al.  Disparity and occlusion estimation for multiview image sequences using dynamic programming , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[18]  William T. Freeman,et al.  Learning low-level vision , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[19]  Nanning Zheng,et al.  Stereo Matching Using Belief Propagation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Shuyuan Yang,et al.  High resolution range-reflectivity estimation of radar targets via compressive sampling and Memetic Algorithm , 2013, Inf. Sci..

[21]  Aljoscha Smolic,et al.  3DAV exploration of video-based rendering technology in MPEG , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Cheng Wang,et al.  A global optimization algorithm for target set selection problems , 2014, Inf. Sci..

[23]  Hyunjin Kim,et al.  Communication-aware task scheduling and voltage selection for total energy minimization in a multiprocessor system using Ant Colony Optimization , 2011, Inf. Sci..

[24]  Teresa Bernarda Ludermir,et al.  Combining Uncertainty Sampling methods for supporting the generation of meta-examples , 2012, Inf. Sci..

[25]  Lu Yu,et al.  Temporal consistency enhancement on depth sequences , 2010, 28th Picture Coding Symposium.

[26]  Markus Flierl,et al.  A Variational Bayesian Inference Framework for Multiview Depth Image Enhancement , 2012, 2012 IEEE International Symposium on Multimedia.

[27]  Aljoscha Smolic,et al.  An overview of available and emerging 3D video formats and depth enhanced stereo as efficient generic solution , 2009, 2009 Picture Coding Symposium.

[28]  L. Hong,et al.  Segment-based stereo matching using graph cuts , 2004, CVPR 2004.

[29]  Yue Gao,et al.  k-Partite graph reinforcement and its application in multimedia information retrieval , 2012, Inf. Sci..

[30]  Sebastian Schwarz,et al.  Adaptive depth filtering for HEVC 3D video coding , 2012, 2012 Picture Coding Symposium.

[31]  Erhan Ekmekcioglu,et al.  Content Adaptive Enhancement of Multi-View Depth Maps for Free Viewpoint Video , 2011, IEEE Journal of Selected Topics in Signal Processing.

[32]  Yu-Lin He,et al.  Bayesian classifiers based on probability density estimation and their applications to simultaneous fault diagnosis , 2014, Inf. Sci..

[33]  Seung-Won Jung,et al.  Enhancement of Image and Depth Map Using Adaptive Joint Trilateral Filter , 2013, IEEE Transactions on Circuits and Systems for Video Technology.