Correlated Topic Models for Image Retrieval

In our previous work [4] we have shown that the representation of images by the Latent Dirichlet Allocation (LDA) model combined with an appropriate similarity measure is suitable for performing large-scale image retrieval in a realworld database. The LDA model, however, relies on the assumption that all topics are independent of each other – something that is obviously not true in most cases. In this work we study a recently proposed model, the Correlated Topic Model (CTM) [1], in the context of large-scale image retrieval. This approach is able to explicitly model such correlations of topics. We experimentally evaluate the proposed retrieval approach on a real-world large-scale database consisting of more than 246,000 images and compare the performance to related approaches.

[1]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[2]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[4]  W. Bruce Croft,et al.  LDA-based document models for ad-hoc retrieval , 2006, SIGIR.

[5]  Rainer Lienhart,et al.  Image retrieval on large-scale image databases , 2007, CIVR '07.

[6]  Rainer Lienhart,et al.  PLSA on Large Scale Image Databases , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7]  John D. Lafferty,et al.  A correlated topic model of Science , 2007, 0708.3601.