User assisted stereo image segmentation

The wide availability of stereoscopic 3D displays created a considerable market for content producers. This encouraged researchers to focus on methods to alter and process the content for various purposes. This study concentrates on user assisted image segmentation and proposes a method to extend previous techniques on monoscopic image segmentation to stereoscopic footage with minimum effort. User assistance is required to indicate the representative locations of an image as object and background regions. An MRF based energy minimization technique is utilized where user inputs are applied only on one of the stereoscopic pairs. A key contribution of the proposed study is the elimination of dense disparity estimation by introducing a sparse feature matching idea. Segmentation results are evaluated by objective metrics on a ground truth stereo segmentation dataset and it can be concluded that competitive results with minimum user interaction have been obtained even without dense disparity estimation.

[1]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[2]  Scott Cohen,et al.  LIVEcut: Learning-based interactive video segmentation by evaluation of multiple propagated cues , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[3]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[4]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[5]  Matthias Zwicker,et al.  Stereoscopic 3D copy & paste , 2010, SIGGRAPH 2010.

[6]  Andrew Blake,et al.  GeoS: Geodesic Image Segmentation , 2008, ECCV.

[7]  A. Aydin Alatan,et al.  Efficient graph-based image segmentation via speeded-up turbo pixels , 2010, 2010 IEEE International Conference on Image Processing.

[8]  Guillermo Sapiro,et al.  Video SnapCut: robust video object cutout using localized classifiers , 2009, ACM Trans. Graph..

[9]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[10]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[11]  Cevahir Çigla,et al.  Efficient edge-preserving stereo matching , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[12]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[13]  A. Aydin Alatan,et al.  Interactive object segmentation for mono and stereo applications: Geodesic prior induced graph cut energy minimization , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[14]  Scott Cohen,et al.  StereoCut: Consistent interactive object selection in stereo image pairs , 2011, 2011 International Conference on Computer Vision.

[15]  Xing Mei,et al.  On building an accurate stereo matching system on graphics hardware , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[16]  Jian Sun,et al.  Lazy snapping , 2004, SIGGRAPH 2004.

[17]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.