论文信息 - Task-Specific Image Partitioning

Task-Specific Image Partitioning

Image partitioning is an important preprocessing step for many of the state-of-the-art algorithms used for performing high-level computer vision tasks. Typically, partitioning is conducted without regard to the task in hand. We propose a task-specific image partitioning framework to produce a region-based image representation that will lead to a higher task performance than that reached using any task-oblivious partitioning framework and existing supervised partitioning framework, albeit few in number. The proposed method partitions the image by means of correlation clustering, maximizing a linear discriminant function defined over a superpixel graph. The parameters of the discriminant function that define task-specific similarity/dissimilarity among superpixels are estimated based on structured support vector machine (S-SVM) using task-specific training data. The S-SVM learning leads to a better generalization ability while the construction of the superpixel graph used to define the discriminant function allows a rich set of features to be incorporated to improve discriminability and robustness. We evaluate the learned task-aware partitioning algorithms on three benchmark datasets. Results show that task-aware partitioning leads to better labeling performance than the partitioning computed by the state-of-the-art general-purpose and supervised partitioning algorithms. We believe that the task-specific image partitioning paradigm is widely applicable to improving performance in high-level image understanding tasks.

[1] Stephen Gould,et al. Single image depth estimation from predicted semantic labels , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2] Tomer Hertz,et al. Pairwise Clustering and Graphical Models , 2003, NIPS.

[3] Jianbo Shi,et al. Learning spectral graph segmentation , 2005, AISTATS.

[4] Sang Uk Lee,et al. Learning full pairwise affinities for spectral segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5] Alexei A. Efros,et al. Recovering Surface Layout from an Image , 2007, International Journal of Computer Vision.

[6] Michael Werman,et al. Fast and robust Earth Mover's Distances , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[7] Stefano Soatto,et al. Quick Shift and Kernel Methods for Mode Seeking , 2008, ECCV.

[8] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9] 智一吉田,et al. Efficient Graph-Based Image Segmentationを用いた圃場図自動作成手法の検討 , 2014 .

[10] Antonio Criminisi,et al. TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation , 2006, ECCV.

[11] William M. Rand,et al. Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[12] Jianbo Shi,et al. Spectral segmentation with multiscale graph decomposition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13] Pushmeet Kohli,et al. Associative hierarchical CRFs for object class image segmentation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[14] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[15] Jitendra Malik,et al. Learning affinity functions for image segmentation: combining patch-based and gradient-based approaches , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[16] Stefano Soatto,et al. Class segmentation and object localization with superpixel neighborhoods , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[17] Marina Meila,et al. Comparing clusterings: an axiomatic view , 2005, ICML.

[18] Daphne Koller,et al. Efficiently selecting regions for scene understanding , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19] H. Sebastian Seung,et al. Maximin affinity learning of image segmentation , 2009, NIPS.

[20] Michael I. Jordan,et al. Learning Spectral Clustering , 2003, NIPS.

[21] Thorsten Joachims,et al. Supervised clustering with support vector machines , 2005, ICML.

[22] Hossein Mobahi,et al. Natural Image Segmentation with Adaptive Texture and Boundary Encoding , 2009, ACCV.

[23] Christopher K. I. Williams,et al. Pascal Visual Object Classes Challenge Results , 2005 .

[24] Jitendra Malik,et al. Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[25] Thorsten Joachims,et al. Training structural SVMs when exact inference is intractable , 2008, ICML '08.

[26] Paria Mehrani,et al. Superpixels and Supervoxels in an Energy Optimization Framework , 2010, ECCV.

[27] Jitendra Malik,et al. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[28] Charless C. Fowlkes,et al. Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29] M. R. Rao,et al. The partition problem , 1993, Math. Program..

[30] Jianguo Zhang,et al. The PASCAL Visual Object Classes Challenge , 2006 .

[31] Anthony Wirth,et al. Correlation Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.

[32] Fernando Pereira,et al. Structured Learning with Approximate Inference , 2007, NIPS.

[33] Stephen Gould,et al. Decomposing a scene into geometric and semantically consistent regions , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[34] Eric P. Xing,et al. Polyhedral outer approximations with application to natural language parsing , 2009, ICML '09.

[35] Alexei A. Efros,et al. Improving Spatial Support for Objects via Multiple Segmentations , 2007, BMVC.

[36] Xavier Cufí,et al. Yet Another Survey on Image Segmentation: Region and Boundary Information Integration , 2002, ECCV.

[37] Ben Taskar,et al. Learning structured prediction models: a large margin approach , 2005, ICML.

[38] Tsuhan Chen,et al. Learning class-specific affinities for image labelling , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[39] Thomas Hofmann,et al. Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[40] Pushmeet Kohli,et al. Robust Higher Order Potentials for Enforcing Label Consistency , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[41] Ben Taskar,et al. Talking pictures: Temporal grouping and dialog-supervised person recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[42] Dorin Comaniciu,et al. Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[43] Sebastian Nowozin,et al. Solution stability in linear programming relaxations: graph partitioning and unsupervised learning , 2009, ICML '09.

[44] Andrea Vedaldi,et al. Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.