论文信息 - Learning Graphs to Model Visual Objects across Different Depictive Styles

Learning Graphs to Model Visual Objects across Different Depictive Styles

Visual object classification and detection are major problems in contemporary computer vision. State-of-art algorithms allow thousands of visual objects to be learned and recognized, under a wide range of variations including lighting changes, occlusion, point of view and different object instances. Only a small fraction of the literature addresses the problem of variation in depictive styles (photographs, drawings, paintings etc.). This is a challenging gap but the ability to process images of all depictive styles and not just photographs has potential value across many applications. In this paper we model visual classes using a graph with multiple labels on each node; weights on arcs and nodes indicate relative importance (salience) to the object description. Visual class models can be learned from examples from a database that contains photographs, drawings, paintings etc. Experiments show that our representation is able to improve upon Deformable Part Models for detection and Bag of Words models for classification.

[1] Martin A. Fischler,et al. The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[2] Timothy F. Cootes,et al. Active Appearance Models , 1998, ECCV.

[3] Daniel Snow,et al. Efficient Deformable Template Detection and Localization without User Initialization , 2000, Comput. Vis. Image Underst..

[4] Pietro Perona,et al. Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[5] Gabriela Csurka,et al. Visual categorization with bags of keypoints , 2002, eccv 2004.

[6] Daniel P. Huttenlocher,et al. Pictorial Structures for Object Recognition , 2004, International Journal of Computer Vision.

[7] Daniel P. Huttenlocher,et al. Spatial priors for part-based recognition using statistical models , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8] Thomas Hofmann,et al. Large Margin Methods for Structured and Interdependent Output Variables , 2005, J. Mach. Learn. Res..

[9] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[10] Yali Amit,et al. POP: Patchwork of Parts Models for Object Recognition , 2007, International Journal of Computer Vision.

[11] Bernt Schiele,et al. Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.

[12] Eli Shechtman,et al. Matching Local Self-Similarities across Images and Videos , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[13] Andrew Zisserman,et al. Image Classification using Random Forests and Ferns , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[14] Andrew J. Davison,et al. Active Matching , 2008, ECCV.

[15] Vladimir Kolmogorov,et al. Feature Correspondence Via Graph Matching: Models and Global Optimization , 2008, ECCV.

[16] Andrew Blake,et al. Multiscale Categorical Object Recognition Using Contour Fragments , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] Joseph J. Lim,et al. Recognition using regions , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[18] Cordelia Schmid,et al. Bandit Algorithms for Tree Search , 2007, UAI.

[19] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20] Thorsten Joachims,et al. Cutting-plane training of structural SVMs , 2009, Machine Learning.

[21] Thomas Mensink,et al. Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[22] Stephen J. McKenna,et al. Classifying Textile Designs Using Bags of Shapes , 2010, 2010 20th International Conference on Pattern Recognition.

[23] Ben Taskar,et al. Cascaded Models for Articulated Pose Estimation , 2010, ECCV.

[24] Stephen J. McKenna,et al. Classifying Textile Designs using Region Graphs , 2010, BMVC.

[25] Andrea Vedaldi,et al. Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[26] Thomas Deselaers,et al. ClassCut for Unsupervised Class Segmentation , 2010, ECCV.

[27] Alexei A. Efros,et al. Data-driven visual similarity for cross-domain image matching , 2011, ACM Trans. Graph..

[28] Yi Yang,et al. Articulated pose estimation with flexible mixtures-of-parts , 2011, CVPR 2011.

[29] Yoram Singer,et al. Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[30] Fei-Fei Li,et al. Action Recognition with Exemplar Based 2.5D Graph Matching , 2012, ECCV.

[31] Fei-Fei Li,et al. Object-Centric Spatial Pooling for Image Classification , 2012, ECCV.

[32] Matthieu Guillaumin,et al. Segmentation Propagation in ImageNet , 2012, ECCV.

[33] Jitendra Malik,et al. Multi-component Models for Object Detection , 2012, ECCV.

[34] Jean Ponce,et al. Learning Graphs to Match , 2013, 2013 IEEE International Conference on Computer Vision.

[35] Shaogang Gong,et al. Sketch Recognition by Ensemble Matching of Structured Features , 2013, BMVC.

[36] Jian Dong,et al. Subcategory-Aware Object Classification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[37] Qi Wu,et al. Modelling Visual Objects Invariant to Depictive Style , 2013, BMVC.

[38] Rui Hu,et al. A performance evaluation of gradient field HOG descriptor for sketch based image retrieval , 2013, Comput. Vis. Image Underst..