论文信息 - Graph-Based Segmentation for RGB-D Data Using 3-D Geometry Enhanced Superpixels

Graph-Based Segmentation for RGB-D Data Using 3-D Geometry Enhanced Superpixels

With the advances of depth sensing technologies, color image plus depth information (referred to as RGB-D data hereafter) is more and more popular for comprehensive description of 3-D scenes. This paper proposes a two-stage segmentation method for RGB-D data: 1) oversegmentation by 3-D geometry enhanced superpixels and 2) graph-based merging with label cost from superpixels. In the oversegmentation stage, 3-D geometrical information is reconstructed from the depth map. Then, a K-means-like clustering method is applied to the RGB-D data for oversegmentation using an 8-D distance metric constructed from both color and 3-D geometrical information. In the merging stage, treating each superpixel as a node, a graph-based model is set up to relabel the superpixels into semantically-coherent segments. In the graph-based model, RGB-D proximity, texture similarity, and boundary continuity are incorporated into the smoothness term to exploit the correlations of neighboring superpixels. To obtain a compact labeling, the label term is designed to penalize labels linking to similar superpixels that likely belong to the same object. Both the proposed 3-D geometry enhanced superpixel clustering method and the graph-based merging method from superpixels are evaluated by qualitative and quantitative results. By the fusion of color and depth information, the proposed method achieves superior segmentation performance over several state-of-the-art algorithms.

[1] Chun-Shien Lu,et al. Unsupervised texture segmentation via wavelet transform , 1997, Pattern Recognit..

[2] Qionghai Dai,et al. Free-Viewpoint Video of Human Actors Using Multiple Handheld Kinects , 2013, IEEE Transactions on Cybernetics.

[3] Dieter Fox,et al. RGB-D mapping: Using Kinect-style depth cameras for dense 3D modeling of indoor environments , 2012, Int. J. Robotics Res..

[4] Dar-Ren Chen,et al. Watershed segmentation for breast tumor in 2-D sonography. , 2004, Ultrasound in medicine & biology.

[5] C. J. Taylor,et al. Segmentation and Analysis of RGB-D data , 2010 .

[6] Lei Zhang,et al. Active contours with selective local or global segmentation: A new formulation and level set method , 2010, Image Vis. Comput..

[7] Yann LeCun,et al. Indoor Semantic Segmentation using depth information , 2013, ICLR.

[8] Daniel P. Huttenlocher,et al. Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[9] Frank Dellaert,et al. Planar Segmentation of RGBD Images Using Fast Linear Fitting and Markov Chain Monte Carlo , 2012, 2012 Ninth Conference on Computer and Robot Vision.

[10] Song-Chun Zhu,et al. What are Textons? , 2005 .

[11] Michael M. Richter,et al. Color image segmentation guided by a color gradient network , 2007, Pattern Recognit. Lett..

[12] Rui Seara,et al. Image segmentation by histogram thresholding using fuzzy sets , 2002, IEEE Trans. Image Process..

[13] Jitendra Malik,et al. Recovering human body configurations: combining segmentation and recognition , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[14] Pascal Fua,et al. SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] B. S. Manjunath,et al. Unsupervised Segmentation of Color-Texture Regions in Images and Video , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[16] N. Kingsbury. Complex Wavelets for Shift Invariant Analysis and Filtering of Signals , 2001 .

[17] Ting Yu,et al. Simultaneous image segmentation and 3D plane fitting for RGB-D sensors — An iterative framework , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[18] Anil K. Jain,et al. Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[19] Thorsten Joachims,et al. Semantic Labeling of 3D Point Clouds for Indoor Scenes , 2011, NIPS.

[20] Vladimir Kolmogorov,et al. An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[21] Lei Zhang,et al. Active contours driven by local image fitting energy , 2010, Pattern Recognit..

[22] Anton Osokin,et al. Fast Approximate Energy Minimization with Label Costs , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23] Siddheswar Ray,et al. Determination of Number of Clusters in K-Means Clustering and Application in Colour Image Segmentation , 2000 .

[24] Dieter Fox,et al. RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments , 2010, ISER.

[25] Jianbo Shi,et al. Spectral segmentation with multiscale graph decomposition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[26] Sebastian Nowozin,et al. Higher-Order Correlation Clustering for Image Segmentation , 2011, NIPS.

[27] Jan-Michael Frahm,et al. Exploring High-Level Plane Primitives for Indoor 3D Reconstruction with a Hand-held RGB-D Camera , 2012, ACCV Workshops.

[28] Jitendra Malik,et al. Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[29] Ling Shao,et al. Enhanced Computer Vision With Microsoft Kinect Sensor: A Review , 2013, IEEE Transactions on Cybernetics.

[30] Josef Kittler,et al. Histogram-based segmentation in a perceptually uniform color space , 1998, IEEE Trans. Image Process..

[31] Serge Beucher,et al. THE WATERSHED TRANSFORMATION APPLIED TO IMAGE SEGMENTATION , 2009 .

[32] Yiannis Aloimonos,et al. Segmenting “simple” objects using RGB-D , 2012, 2012 IEEE International Conference on Robotics and Automation.

[33] Paria Mehrani,et al. Superpixels and Supervoxels in an Energy Optimization Framework , 2010, ECCV.

[34] Gareth Funka-Lea,et al. Graph Cuts and Efficient N-D Image Segmentation , 2006, International Journal of Computer Vision.

[35] Charless C. Fowlkes,et al. Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36] Andrew Blake,et al. "GrabCut" , 2004, ACM Trans. Graph..

[37] Olga Veksler,et al. Fast Approximate Energy Minimization via Graph Cuts , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[38] Bernhard G Herrmann,et al. Segmentation in vertebrates: clock and gradient finally joined. , 2004, Genes & development.

[39] Umar Mohammed,et al. Superpixel lattices , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[40] Dieter Fox,et al. RGB-D Object Recognition: Features, Algorithms, and a Large Scale Benchmark , 2013, Consumer Depth Cameras for Computer Vision.

[41] Shih-Fu Chang,et al. Segmentation using superpixels: A bipartite graph partitioning approach , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[42] Alexei A. Efros,et al. Discovering objects and their location in images , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[43] Michael Werman,et al. The Quadratic-Chi Histogram Distance Family , 2010, ECCV.

[44] Xiaojin Gong,et al. Rock detection via superpixel graph cuts , 2012, 2012 19th IEEE International Conference on Image Processing.

[45] Michael Werman,et al. Fusing Time-of-Flight Depth and Color for Real-Time Segmentation and Tracking , 2009, Dyn3D.

[46] Cordelia Schmid,et al. Constructing models for content-based image retrieval , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[47] Derek Hoiem,et al. Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[48] Ling Shao,et al. Learning Discriminative Representations from RGB-D Video Data , 2013, IJCAI.

[49] Sheng Tang,et al. Accurate Estimation of Human Body Orientation From RGB-D Sensors , 2013, IEEE Transactions on Cybernetics.

[50] Sven J. Dickinson,et al. TurboPixels: Fast Superpixels Using Geometric Flows , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51] Xuelong Li,et al. Depth-Aware Image Seam Carving , 2013, IEEE Transactions on Cybernetics.

[52] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[53] Stefano Soatto,et al. Really Quick Shift: Image Segmentation on a GPU , 2010, ECCV Workshops.

[54] Sven Behnke,et al. Real-Time Plane Segmentation Using RGB-D Cameras , 2012, RoboCup.

[55] Kun Li,et al. 3-D Geometry Enhanced Superpixels for RGB-D Data , 2013, PCM.

[56] Tien Yin Wong,et al. Efficient optic cup localization based on superpixel classification for glaucoma diagnosis in digital fundus images , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[57] Amar Mitiche,et al. Multiregion Image Segmentation by Parametric Kernel Graph Cuts , 2011, IEEE Transactions on Image Processing.

[58] P. Nagabhushan,et al. Depth-wise segmentation of 3D images using dense depth maps , 2013 .

[59] Stefano Soatto,et al. Quick Shift and Kernel Methods for Mode Seeking , 2008, ECCV.

[60] Allen M. Waxman,et al. Adaptive 3-D Object Recognition from Multiple Views , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[61] Stefano Soatto,et al. Class segmentation and object localization with superpixel neighborhoods , 2009, 2009 IEEE 12th International Conference on Computer Vision.