2LDA: Segmentation for Recognition

Following the trend of “segmentation for recognition”, we present 2LDA, a novel generative model to automatically segment an image in 2 segments, background and foreground, while inferring a latent Dirichlet allocation (LDA) topic distribution on both segments. The idea is to merge two separate modules, LDA and the segmentation module, explicitly considering (and exchanging) the uncertainty between them. The resulting model adds spatial relationships to LDA, which in turn helps in using the topics to segment an image. The experimental results show that, unlike LDA, our model can be used to recognize objects, and also outperforms the state of the art algorithms.

[1]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[2]  Cordelia Schmid,et al.  Spatial Weighting for Bag-of-Features , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  B. Frey,et al.  Transformation-Invariant Clustering Using the EM Algorithm , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Fei-Fei Li,et al.  Spatially Coherent Latent Topic Model for Concurrent Segmentation and Classification of Objects and Scenes , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[8]  Nebojsa Jojic,et al.  LOCUS: learning object classes with unsupervised segmentation , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[9]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine-mediated learning.

[10]  N. Jojic,et al.  Capturing image structure with probabilistic index maps , 2004, CVPR 2004.