Multiple Ontology-Based Indexing of Multimedia Documents on the World Wide Web

In order to cope with the growing need to search multimedia documents with precision on the Web, we propose a multimedia conceptual indexing framework incorporating semantic relations between annotation words. To do this, we utilize our DOM Tree-based Webpage segmentation algorithm to automatically extract surrounding textual information of the multimedia documents in Webpages. Next, we employ knowledge represented in multiple ontologies to discover the latent semantic dimensions of the surrounding textual information. As a consequence, indexes (represented as semantic networks) are constructed where nodes of each network capture words that exist in the ontologies and edges represent the semantic relations that hold between those words. To address the semantic heterogeneity problem between the produced networks, we employ a multi-level merging algorithm that combines heterogeneous networks into a more coherent network. Additionally, we utilize concept-relatedness measures to address the issue of unrecognized entities by the ontologies. We evaluate the techniques of the proposed framework using three different multimedia dataset types. Experimental results indicate that the proposed techniques are effective and precise.

[1]  Gerhard Weikum,et al.  SOFIE: a self-organizing framework for information extraction , 2009, WWW '09.

[2]  Yongdong Zhang,et al.  Image Search Reranking With Query-Dependent Click-Based Relevance Feedback , 2014, IEEE Transactions on Image Processing.

[3]  Umar Manzoor,et al.  Ontology based image retrieval , 2012, 2012 International Conference for Internet Technology and Secured Transactions.

[4]  Christoph Meinel,et al.  Content Based Lecture Video Retrieval Using Speech and Video Text Information , 2014, IEEE Transactions on Learning Technologies.

[5]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[6]  Xuelong Li,et al.  Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search , 2013, IEEE Transactions on Image Processing.

[7]  Jer Lang Hong,et al.  Webpage segmentation for extracting images and their surrounding contextual information , 2009, MM '09.

[8]  Karn Patanukhom,et al.  Key frame extraction for text based video retrieval using Maximally Stable Extremal Regions , 2015, 2015 1st International Conference on Industrial Networks and Intelligent Systems (INISCom).

[9]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[10]  Mohammed Maree,et al.  A Coupled Statistical/Semantic Framework for Merging Heterogeneous Domain-Specific Ontologies , 2010, 2010 22nd IEEE International Conference on Tools with Artificial Intelligence.

[11]  Mohammed Maree,et al.  Addressing semantic heterogeneity through multiple knowledge base assisted merging of domain-specific ontologies , 2015, Knowl. Based Syst..

[12]  Fabio Persia,et al.  Content-Based Multimedia Retrieval , 2015, Data Management in Pervasive Systems.

[13]  Adrian Popescu,et al.  SemRetriev: an ontology driven image retrieval system , 2007, CIVR '07.

[14]  Liang-Tien Chia,et al.  Wikipedia-assisted concept thesaurus for better web media understanding , 2010, MIR '10.