Convex Optimization for Scene Understanding

In this paper we give a convex optimization approach for scene understanding. Since segmentation, object recognition and scene labeling strongly benefit from each other we propose to solve these tasks within a single convex optimization problem. In contrast to previous approaches we do not rely on pre-processing techniques such as object detectors or super pixels. The central idea is to integrate a hierarchical label prior and a set of convex constraints into the segmentation approach, which combine the three tasks by introducing high-level scene information. Instead of learning label co-occurrences from limited benchmark training data, the hierarchical prior comes naturally with the way humans see their surroundings.

[1]  Jing Yuan,et al.  Convex Multi-class Image Labeling by Simplex-Constrained Total Variation , 2009, SSVM.

[2]  Daniel Cremers,et al.  A Convex Approach to Minimal Partitions , 2012, SIAM J. Imaging Sci..

[3]  Horst Bischof,et al.  Efficient Minimization of the Non-local Potts Model , 2011, SSVM.

[4]  Jing Yuan,et al.  TV-Based Multi-Label Image Segmentation with Label Cost Prior , 2010, BMVC.

[5]  Lena Gorelick,et al.  Minimizing Energies with Hierarchical Costs , 2012, International Journal of Computer Vision.

[6]  Daniel Cremers,et al.  A Survey and Comparison of Discrete and Continuous Multi-label Optimization Approaches for the Potts Model , 2013, International Journal of Computer Vision.

[7]  Daniel Cremers,et al.  A Survey and Comparison of Discrete and Continuous Multilabel Segmentation Approaches , 2013 .

[8]  Pushmeet Kohli,et al.  Graph Cut Based Inference with Co-occurrence Statistics , 2010, ECCV.

[9]  Jan-Michael Frahm,et al.  Fast Global Labeling for Real-Time Stereo Using Multiple Plane Sweeps , 2008, VMV.

[10]  Antonin Chambolle,et al.  A First-Order Primal-Dual Algorithm for Convex Problems with Applications to Imaging , 2011, Journal of Mathematical Imaging and Vision.

[11]  Daniel Cremers,et al.  Convex Relaxation for Multilabel Problems with Product Label Spaces , 2010, ECCV.

[12]  Kang Zhao Approximation from locally finite-dimensional shift-invariant spaces , 1996 .

[13]  Daniel Cremers,et al.  A Co-occurrence Prior for Continuous Multi-label Optimization , 2013, EMMCVPR.

[14]  Daniel Cremers,et al.  Tight convex relaxations for vector-valued labeling problems , 2011, 2011 International Conference on Computer Vision.

[15]  Hao Su,et al.  Objects as Attributes for Scene Classification , 2010, ECCV Workshops.

[16]  Antonin Chambolle,et al.  Diagonal preconditioning for first order primal-dual algorithms in convex optimization , 2011, 2011 International Conference on Computer Vision.

[17]  L. Ambrosio,et al.  Functions of Bounded Variation and Free Discontinuity Problems , 2000 .

[18]  Philip H. S. Torr,et al.  What, Where and How Many? Combining Object Detectors and CRFs , 2010, ECCV.

[19]  Hao Su,et al.  Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification , 2010, NIPS.

[20]  Alan L. Yuille,et al.  Region Competition: Unifying Snakes, Region Growing, and Bayes/MDL for Multiband Image Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[22]  Sanja Fidler,et al.  Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.