Ontological Inference Framework with Joint Ontology Construction and Learning for Image Understanding

Lack of human prior knowledge is one of the main reasons that semantic gap still remains when it comes to automatic multimedia understanding. In this work, we exploit the ontological structure of target concepts and propose an universal ontological inference framework for image understanding. The framework explicitly utilizes subclass and co-occurrence relation to effectively refine the coarse concept detections. Moreover, we show how to automatically construct and learn the underlying ontology required by the framework. As can be shown by experiments, the result is an effective and robust algorithm that characterizes well the structure of the target concepts and outperforms the state-of-the-art methods.

[1]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[2]  Tao Mei,et al.  Building a comprehensive ontology to refine video concept detection , 2007, MIR '07.

[3]  Dan Roth,et al.  Learning and Inference over Constrained Output , 2005, IJCAI.

[4]  Tao Mei,et al.  Correlative multi-label video annotation , 2007, ACM Multimedia.

[5]  Cordelia Schmid,et al.  Semantic Hierarchies for Visual Object Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Jianping Fan,et al.  Incorporating concept ontology to enable probabilistic concept reasoning for multi-level image annotation , 2006, MIR '06.

[7]  Thomas Hofmann,et al.  Support vector machine learning for interdependent and structured output spaces , 2004, ICML.

[8]  Sunita Sarawagi,et al.  Discriminative Methods for Multi-labeled Classification , 2004, PAKDD.

[9]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Xian-Sheng Hua,et al.  Two-Dimensional Multilabel Active Learning with an Efficient Online Adaptation Model for Image Classification , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[12]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[13]  Ming-Wei Chang,et al.  Learning and Inference with Constraints , 2008, AAAI.