Probabilistic visual concept trees

This paper presents probabilistic visual concept trees, a model for large visual semantic taxonomy structures and its use in visual concept detection. Organizing visual semantic knowledge systematically is one of the key challenges towards large-scale concept detection, and one that is complementary to optimizing visual classification for individual concepts. Semantic concepts have traditionally been treated as isolated nodes, a densely-connected web, or a tree. Our analysis shows that none of these models are sufficient in modeling the typical relationships on a real-world visual taxonomy, and these relationships belong to three broad categories -- semantic, appearance and statistics. We propose probabilistic visual concept trees for modeling a taxonomy forest with observation uncertainty. As a Bayesian network with parameter constraints, this model is flexible enough to account for the key assumptions in all three types of taxonomy relations, yet it is robust enough to accommodate expansion or deletion in a taxonomy. Our evaluation results on a large web image dataset show that the classification accuracy has considerably improved upon baselines without, or with only a subset of concept relationships

[1]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[2]  John R. Smith,et al.  IBM Research TRECVID-2009 Video Retrieval System , 2009, TRECVID.

[3]  Richard S. Zemel,et al.  Latent topic random fields: Learning using a taxonomy of labels , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  L. Williams,et al.  Contents , 2020, Ophthalmology (Rochester, Minn.).

[5]  Kevin Murphy,et al.  Bayes net toolbox for Matlab , 1999 .

[6]  Shih-Fu Chang,et al.  Kernel Sharing With Joint Boosting For Multi-Class Concept Detection , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Rong Yan,et al.  Multi-concept learning with large-scale multimedia lexicons , 2008, 2008 15th IEEE International Conference on Image Processing.