Ontology enhanced web image retrieval: aided by wikipedia & spreading activation theory

Ontology, as an effective approach to bridge the semantic gap in various domains, has attracted a lot of interests from multimedia researchers. Among the numerous possibilities enabled by ontology, we are particularly interested in exploiting ontology for a better understanding of media task (particularly, images) on the World Wide Web. To achieve our goal, two open issues are inevitably involved: 1) How to avoid the tedious manual work for ontology construction? 2) What are the effective inference models when using an ontology? Recent works[11, 16] about ontology learned from Wikipedia has been reported in conferences targeting the areas of knowledge management and artificial intelligent. There are also reports of different inference models being investigated [5, 13, 15]. However, so far there has not been any comprehensive solution. In this paper, we look at these challenges and attempt to provide a general solution to both questions. Through a careful analysis of the online encyclopedia Wikipedia's categorization and page content, we choose it as our knowledge source and propose an automatic ontology construction approach. We prove that it is a viable way to build ontology under various domains. To address the inference model issue, we provide a novel understanding of the ontology and consider it as a type of semantic network, which is similar to brain models in the cognitive research field. Spreading Activation Techniques, which have been proved to be a correct information processing model in the semantic network, are consequently introduced for inference. We have implemented a prototype system with the developed solutions for web image retrieval. By comprehensive experiments on the canine category of the animal kingdom, we show that this is a scalable architecture for our proposed methods.

[1]  Ah-Hwee Tan,et al.  OntoSearch: A Full-Text Search Engine for the Semantic Web , 2006, AAAI.

[2]  John R. Anderson A Spreading Activation Theory of Memory , 1988 .

[3]  Daniel S. Weld,et al.  Autonomously semantifying wikipedia , 2007, CIKM '07.

[4]  Ian H. Witten,et al.  A knowledge-based search engine powered by wikipedia , 2007, CIKM '07.

[5]  Liang-Tien Chia,et al.  Image retrieval with a multi-modality ontology , 2007, Multimedia Systems.

[6]  Chong-Wah Ngo,et al.  Ontology-enriched semantic space for video search , 2007, ACM Multimedia.

[7]  Cordelia Schmid,et al.  Coloring Local Feature Extraction , 2006, ECCV.

[8]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[9]  Liang-Tien Chia,et al.  Does ontology help in image retrieval?: a comparison between keyword, text ontology and multi-modality ontology approaches , 2006, MM '06.

[10]  Clement T. Yu,et al.  Using semantic contents and WordNet in image retrieval , 1997, SIGIR '97.

[11]  Cordelia Schmid,et al.  Semantic Hierarchies for Visual Object Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Jianping Fan,et al.  Incorporating concept ontology to enable probabilistic concept reasoning for multi-level image annotation , 2006, MIR '06.

[13]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[14]  Óscar Corcho,et al.  A Semantic Portal for the International Affairs Sector , 2004, EKAW.

[15]  David A. Forsyth,et al.  Animals on the Web , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[16]  Rada Mihalcea,et al.  Wikify!: linking documents to encyclopedic knowledge , 2007, CIKM '07.

[17]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[18]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, CVPR Workshops.