Graph-based multiview depth estimation using segmentation

This paper presents a new depth estimation method for multiview systems with arbitrary camera locations. The method exploits the graph cuts method, where vertices of the graph represent segments used for controlling the trade-off between the quality of depth maps and the time of estimation, while preserving the original resolution of a depth map. Moreover, the inter-view consistency of the depth maps, crucial for free-viewpoint television systems, is ensured by introduction of suitable connections in the optimized graph. It makes the proposed method the first that allows generation of spatially-consistent multiview depth maps using segmentation-based estimation. A new method of the adaptive calculation of the smoothing coefficient was also presented. The performance of the proposed algorithm was tested and compared with the state-of-the-art DERS method, showing an significant improvement, both in terms of the depth maps fidelity and the time of estimation.

[1]  Nanning Zheng,et al.  Stereo Matching Using Belief Propagation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Qifei Wang Computational Models for Multiview Dense Depth Maps of Dynamic Scene , 2015, ArXiv.

[3]  Takanori Senoh,et al.  New visual coding exploration in MPEG: Super-MultiView and Free Navigation in Free viewpoint TV , 2016, SD&A.

[4]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Feng Wu,et al.  Estimation of Virtual View Synthesis Distortion Toward Virtual View Position , 2016, IEEE Transactions on Image Processing.

[6]  Zixiang Xiong,et al.  A gradient-based approach for interference cancelation in systems with multiple Kinect cameras , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).

[7]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[8]  Krzysztof Wegner,et al.  A practical approach to acquisition and processing of free viewpoint video , 2015, 2015 Picture Coding Symposium (PCS).

[9]  Peter Schelkens,et al.  Spatio-Temporally Consistent Color and Structure Optimization for Multiview Video Color Correction , 2015, IEEE Transactions on Multimedia.

[10]  Li Rui,et al.  VR glasses and leap motion trends in education , 2016, 2016 11th International Conference on Computer Science & Education (ICCSE).

[11]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[12]  Masayuki Tanimoto FTV standardization in MPEG , 2014, 2014 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[13]  Peter Eisert,et al.  Real-time generation of multi-view video plus depth content using mixed narrow and wide baseline , 2014, J. Vis. Commun. Image Represent..

[14]  Krzysztof Wegner,et al.  Poznan University of Technology test multiview video sequences acquired with circular camera arrangement – “Poznan Team” and “Poznan Blocks” sequences , 2015 .

[15]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[16]  Gauthier Lafruit,et al.  Multi-view wide baseline depth estimation robust to sparse input sampling , 2016, 2016 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[17]  Marc Pollefeys,et al.  Simplified Belief Propagation for Multiple View Reconstruction , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[18]  Òscar Divorra Escoda,et al.  Depth estimation based on multiview matching with depth/color segmentation and memory efficient Belief Propagation , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[19]  Margrit Gelautz,et al.  Graph-based surface reconstruction from stereo pairs using image segmentation , 2005 .

[20]  Yo-Sung Ho,et al.  High-quality multi-view depth generation using multiple color and depth cameras , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[21]  Gauthier Lafruit,et al.  Multi-camera epipolar plane image feature detection for robust view synthesis , 2015, 2015 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[22]  Vladimir Kolmogorov,et al.  What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Li Hong,et al.  Segment-based stereo matching using graph cuts , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..