A Novel Context-Aware Topic Model for Category Discovery in Natural Scenes

Automatic category discovery from images is a challenging problem in computer vision community especially from natural scene images due to the great variability in them. This paper proposes a novel context-aware topic model for category discovery in complex natural scenes. The proposed model constructs a generative probabilistic procedure from three-level features consisting of patch, region and the entire image by introducing latent topic variables to every patch and every region. Additionally, a new kind of scene context prior, namely, the spatial preference of categories, is also modeled using only a few parameters to reduce the ambiguity of categories in scene images. By regarding “topics” as “categories”, category discovery is thus converted to the inference of the proposed probabilistic model, which will further be addressed under a Gibbs-EM framework effectively. Experimental results on two benchmark datasets comprising MSRC-v2 and SIFT Flow show its effectiveness and the advantages comparing with other methods.

[1]  Li Fei-Fei,et al.  Towards total scene understanding: Classification, annotation and segmentation in an automatic framework , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  W. Eric L. Grimson,et al.  Spatial Latent Dirichlet Allocation , 2007, NIPS.

[3]  Christos Faloutsos,et al.  Unsupervised modeling of object categories using link analysis techniques , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Antonio Torralba,et al.  Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.

[5]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Tsuhan Chen,et al.  Semantic-Shift for Unsupervised Object Detection , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[7]  Silvio Savarese,et al.  Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[8]  Nando de Freitas,et al.  An Introduction to MCMC for Machine Learning , 2004, Machine Learning.

[9]  Yong Jae Lee,et al.  Object-Graphs for Context-Aware Visual Category Discovery , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Tsuhan Chen,et al.  Unsupervised Image Categorization and Object Localization using Topic Models and Correspondences between Images , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[11]  Yong Jae Lee,et al.  Learning the easy things first: Self-paced visual category discovery , 2011, CVPR 2011.

[12]  Pietro Perona,et al.  Learning Object Categories From Internet Image Searches , 2010, Proceedings of the IEEE.

[13]  Alexei A. Efros,et al.  Using Multiple Segmentations to Discover Objects and their Extent in Image Collections , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[14]  Antonio Criminisi,et al.  TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context , 2007, International Journal of Computer Vision.

[15]  Christoph H. Lampert,et al.  Unsupervised Object Discovery: A Comparison , 2010, International Journal of Computer Vision.

[16]  Gang Hua,et al.  Context aware topic model for scene recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Yong Jae Lee,et al.  Foreground Focus: Unsupervised Learning from Partially Matching Images , 2009, International Journal of Computer Vision.

[18]  Zhuowen Tu,et al.  Unsupervised object class discovery via saliency-guided multiple class learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Palaiahnakote Shivakumara,et al.  A Novel Topic-Level Random Walk Framework for Scene Image Co-segmentation , 2014, ECCV.

[20]  Ce Liu,et al.  Unsupervised Joint Object Discovery and Segmentation in Internet Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Fei-Fei Li,et al.  Spatially Coherent Latent Topic Model for Concurrent Segmentation and Classification of Objects and Scenes , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[22]  Fei-Fei Li,et al.  Image Segmentation with Topic Random Field , 2010, ECCV.

[23]  Svetlana Lazebnik,et al.  Superparsing , 2010, International Journal of Computer Vision.

[24]  Yong Jae Lee,et al.  Shape discovery from unlabeled image collections , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[26]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[27]  Jianxiong Xiao,et al.  Characterizing Layouts of Outdoor Scenes Using Spatial Topic Processes , 2013, 2013 IEEE International Conference on Computer Vision.

[28]  Gang Hua,et al.  Spatial-DiscLDA for visual recognition , 2011, CVPR 2011.