Convexity constrained efficient superpixel and supervoxel extraction

This paper presents an efficient superpixel (SP) and supervoxel (SV) extraction method that aims improvements over the state-of-the-art in terms of both accuracy and computational complexity. Segmentation performance is improved through convexity constrained distance utilization, whereas computational efficiency is achieved by replacing complete region processing by a boundary adaptation technique. Starting from the uniformly distributed, rectangular (cubical) equal size (volume) superpixels (supervoxels), region boundaries are iteratively adapted towards object edges. Adaptation is performed by assigning the boundary pixels to the most similar neighboring SPs (SVs). At each iteration, SP (SV) regions are updated; hence, progressively converging to compact pixel groups. Detailed experimental comparisons against the state-of-the-art competing methods validate the performance of the proposed technique considering both accuracy and speed. HighlightsAn iterative superpixel extraction method is proposed where only boundary pixels are updated for an efficient computation.Color term in the cost function enables strong adaptation on object boundaries and distance term enforces a geometry constrain.Extension of the method on spatio-temporal volume shows region adaptation in video footage.Superior or competing quantitative performance is observed in comparison with the state-of-the-art techniques.

[1]  Mei Han,et al.  Efficient hierarchical graph-based video segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Sven J. Dickinson,et al.  TurboPixels: Fast Superpixels Using Geometric Flows , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Gunilla Borgefors,et al.  Distance transformations in digital images , 1986, Comput. Vis. Graph. Image Process..

[5]  Luc Vincent,et al.  Watersheds in Digital Spaces: An Efficient Algorithm Based on Immersion Simulations , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Camille Couprie,et al.  Power Watershed: A Unifying Graph-Based Optimization Framework , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Ronen Basri,et al.  Fast multiscale image segmentation , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[8]  Kemal Ugur,et al.  Interactive 2D 3D image conversion method for mobile devices , 2011, 2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[9]  A. Aydin Alatan,et al.  User assisted disparity remapping for stereo images , 2013, Signal Process. Image Commun..

[10]  Paria Mehrani,et al.  Superpixels and Supervoxels in an Energy Optimization Framework , 2010, ECCV.

[11]  Dorin Comaniciu,et al.  Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Luc Van Gool,et al.  SEEDS: Superpixels Extracted via Energy-Driven Sampling , 2012, ECCV.

[13]  Umar Mohammed,et al.  Superpixel lattices , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  A. Aydin Alatan,et al.  User assisted stereo image segmentation , 2012, 2012 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[15]  Stefano Soatto,et al.  Motion segmentation with occlusions on the superpixel graph , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[16]  A. Aydin Alatan,et al.  Interactive object segmentation for mono and stereo applications: Geodesic prior induced graph cut energy minimization , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[17]  Cevahir Çigla,et al.  Super pixel extraction via convexity induced boundary adaptation , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[18]  Xiaochun Cao,et al.  Topology Preserved Regular Superpixel , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[19]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[20]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Chenliang Xu,et al.  Evaluation of super-voxel methods for early video processing , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Scott Cohen,et al.  Geodesic graph cut for interactive image segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  A. Aydin Alatan,et al.  Efficient graph-based image segmentation via speeded-up turbo pixels , 2010, 2010 IEEE International Conference on Image Processing.

[24]  Jitendra Malik,et al.  Learning a classification model for segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[25]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[26]  Jean Ponce,et al.  Computer Vision: A Modern Approach , 2002 .

[27]  James M. Rehg,et al.  Motion Coherent Tracking with Multi-label MRF optimization , 2010, BMVC.

[28]  Andrew Blake,et al.  GeoS: Geodesic Image Segmentation , 2008, ECCV.

[29]  Vladimir Kolmogorov,et al.  An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Fernand Meyer,et al.  An Overview of Morphological Segmentation , 2001, Int. J. Pattern Recognit. Artif. Intell..

[31]  Hongbin Zha,et al.  Structure-sensitive superpixels via geodesic distance , 2011, ICCV.