Latent topic model for image annotation by modeling topic correlation

For the task of image annotation, traditional probabilistic topic models based on Latent Dirichlet Allocation (LDA) [1], assume that an image is a mixture of latent topics. An inevitable limitation of LDA is the inability to model topic correlation since topic proportions of an image are generated independently. Motivated by Correlated Topic Model (CTM) [2] which derives from natural language processing to model topic correlation of a document, we extend the popular LDA based models (corrLDA [3], sLDA-bin [4], trmmLDA [5]) to CTM based models (corrCTM, sCTM-bin, trmmCTM). We present a comprehensive comparison between CTM based and LDA based models on three benchmark datasets, illustrating the superior annotation performance of proposed CTM based models, by means of propagating topic correlation among image features and annotation words.

[1]  Chong Wang,et al.  Simultaneous image classification and annotation , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Samy Bengio,et al.  A Discriminative Kernel-Based Approach to Rank Images from Text Queries , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Hagai Attias,et al.  Topic regression multi-modal Latent Dirichlet Allocation for image annotation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  Cordelia Schmid,et al.  TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[5]  Yang Yu,et al.  Automatic image annotation using group sparsity , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Chong-Wah Ngo,et al.  A revisit of Generative Model for Automatic Image Annotation using Markov Random Fields , 2009, CVPR.

[7]  Michael I. Jordan,et al.  Modeling annotated data , 2003, SIGIR.

[8]  Hagai Attias,et al.  Supervised topic model for automatic image annotation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Chong Wang,et al.  Simultaneous image classification and annotation , 2009, CVPR.

[10]  C. V. Jawahar,et al.  Image Annotation Using Metric Learning in Semantic Neighbourhoods , 2012, ECCV.

[11]  R. Lienhart,et al.  Correlated Topic Models for Image Retrieval , 2008 .

[12]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[13]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[14]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[15]  Zhi-Hua Zhou,et al.  Learning a distance metric from multi-instance multi-label data , 2009, CVPR.