Interactive object segmentation for mono and stereo applications: Geodesic prior induced graph cut energy minimization

This study proposes an interactive multi label object segmentation method and applications on mono and stereo images. The general segmentation problem is approached by an energy minimization on a Markov Random Field (MRF). The minimum energy potential labelling is the primary goal of the multi label segmentation algorithm. User inputs are used to determine object location and geodesic prior induced iterative graph cut energy minimization is used to define object boundaries. Segmented objects on mono images are used to generate stereo pairs for viewing on 3D displays. Segmented object pairs on stereo images are used for depth adjustment in order to achieve better visual quality. The assignment of relative depths on multiple objects is necessary for stereo image pair synthesis using conventional depth image based rendering (DIBR) techniques.

[1]  Vladimir Kolmogorov,et al.  An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[3]  Aljoscha Smolic,et al.  Nonlinear disparity mapping for stereoscopic 3D , 2010, ACM Trans. Graph..

[4]  Olga Veksler,et al.  Markov random fields with efficient approximations , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[5]  Sven J. Dickinson,et al.  TurboPixels: Fast Superpixels Using Geometric Flows , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Andrew Blake,et al.  GeoS: Geodesic Image Segmentation , 2008, ECCV.

[7]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[8]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[9]  D. Greig,et al.  Exact Maximum A Posteriori Estimation for Binary Images , 1989 .

[10]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[11]  A. Aydin Alatan,et al.  Efficient graph-based image segmentation via speeded-up turbo pixels , 2010, 2010 IEEE International Conference on Image Processing.

[12]  Jitendra Malik,et al.  Learning a classification model for segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[13]  Scott Cohen,et al.  Geodesic graph cut for interactive image segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Kemal Ugur,et al.  Interactive 2D 3D image conversion method for mobile devices , 2011, 2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[15]  Filippo Speranza,et al.  Surrogate depth maps for stereoscopic imaging: different edge types , 2007, Electronic Imaging.

[16]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Wenxian Yang,et al.  User-Friendly Interactive Image Segmentation Through Unified Combinatorial User Inputs , 2010, IEEE Transactions on Image Processing.