Introducing Context and Reasoning in Visual Content Analysis: An Ontology-Based Framework

The amount of multimedia content produced and made available on the World Wide Web, and in professional and, not least, personal collections, is constantly growing, resulting in equally increasing needs in terms of efficient and effective ways to access it. Enabling smooth access at a level that meets user expectations and needs has been the holy grail in content-based retrieval for decades as it is intertwined with the so-called semantic gap between the features that can be extracted from such content through automatic analysis and the conveyed meaning as perceived by the end users. Numerous efforts towards more reliable and effective visual content analysis that target the extraction of user-oriented content descriptions have been reported, addressing a variety of domains and applications, and following diverse methodologies. Among the reported literature, knowledge-based approaches utilising explicit, a priori, knowledge constitute a popular choice aiming at analysis methods decoupled from application-specific implementations. Such knowledge may address various aspects including visual characteristics and numerical representations, topological knowledge about the examined domain, contextual knowledge, as well as knowledge driving the selection and execution of the processing steps required. Among the different knowledge representations adopted in the reported literature, ontologies, being the key enabling technology of the Semantic Web (SW) vision for knowledge sharing and reuse through machine processable metadata, have been favoured in recent efforts. Indicative state-of-the-art approaches include, among others, the work presented in Little and Hunter (2004), and Hollink, Little and Hunter (2005), where ontologies have been used to represent objects of the

[1]  Ivan E. Sutherland,et al.  Sketchpad a Man-Machine Graphical Communication System , 1899, Outstanding Dissertations in the Computer Sciences.

[2]  David L. Waltz,et al.  Understanding Line drawings of Scenes with Shadows , 1975 .

[3]  R M Haralick,et al.  The consistent labeling problem: part I. , 1979, IEEE transactions on pattern analysis and machine intelligence.

[4]  Robert M. Haralick,et al.  The Consistent Labeling Problem: Part I , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Sadaaki Miyamoto,et al.  Fuzzy Sets in Information Retrieval and Cluster Analysis , 1990, Theory and Decision Library.

[6]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[7]  Diane Horton,et al.  Language use in context , 1996, CACM.

[8]  Thomas H. Kolbe,et al.  Constraints for Object Recognition in Aerial Images - Handling of Unobserved Features , 1998, CP.

[9]  Roman Barták,et al.  Constraint Programming: In Pursuit of the Holy Grail , 1999 .

[10]  Anil K. Jain,et al.  Detecting sky and vegetation in outdoor images , 1999, Electronic Imaging.

[11]  K. Ramchandran,et al.  A factor graph framework for semantic indexing and retrieval in video , 2000, 2000 Proceedings Workshop on Content-based Access of Image and Video Libraries.

[12]  Rohini K. Srihari,et al.  Show&Tell: A Semi-Automated Image Annotation System , 2000, IEEE Multim..

[13]  P. Beek,et al.  Text of 15938-5 FCD Information Technology-Multimedia Content Description Interface-Pard 5 Multimedia Description Schemes , 2001 .

[14]  Shih-Fu Chang,et al.  MPEG-7 MDS Content Description Tools and Applications , 2001, CAIP.

[15]  Thomas Sikora,et al.  The MPEG-7 visual standard for content description-an overview , 2001, IEEE Trans. Circuits Syst. Video Technol..

[16]  Giorgos Stamou,et al.  Context-sensitive semantic query expansion , 2002, Proceedings 2002 IEEE International Conference on Artificial Intelligence Systems (ICAIS 2002).

[17]  Nicola Guarino,et al.  Sweetening Ontologies with DOLCE , 2002, EKAW.

[18]  Thomas S. Huang,et al.  Factor graph framework for semantic video indexing , 2002, IEEE Trans. Circuits Syst. Video Technol..

[19]  Krzysztof R. Apt,et al.  Principles of constraint programming , 2003 .

[20]  Antonio Torralba,et al.  Using the Forest to See the Trees: A Graphical Model Relating Features, Objects, and Scenes , 2003, NIPS.

[21]  Jiebo Luo,et al.  Natural object detection in outdoor scenes based on probabilistic spatial context models , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[22]  Otthein Herzog,et al.  Content-based Image Retrieval by Ontology-based Object Recognition , 2004 .

[23]  Bertrand Le Saux,et al.  Image Classifiers for Scene Analysis , 2004, ICCVG.

[24]  Jane Hunter,et al.  Rules-By-Example - A Novel Approach to Semantic Indexing and Querying of Images , 2004, SEMWEB.

[25]  Monique Thonnat,et al.  A Weakly Supervised Approach for Semantic Image Indexing and Retrieval , 2005, CIVR.

[26]  Michael G. Strintzis,et al.  Knowledge-assisted semantic video object detection , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Jane Hunter,et al.  Evaluating the application of semantic inferencing rules to image annotation , 2005, K-CAP '05.

[28]  Jiebo Luo,et al.  Beyond pixels: Exploiting camera metadata for photo classification , 2005, Pattern Recognit..

[29]  Steffen Staab,et al.  Semantic Annotation of Images and Videos for Multimedia Analysis , 2005, ESWC.

[30]  Jiebo Luo,et al.  Exploiting context for semantic scene classification , 2005 .

[31]  Bernd Neumann,et al.  Scene Interpretation as a Configuration Task , 2005, Künstliche Intell..

[32]  Steffen Staab,et al.  Knowledge representation and semantic annotation of multimedia content , 2006 .

[33]  Steffen Staab,et al.  M-OntoMat-Annotizer: Image Annotation Linking Ontologies and Multimedia Low-Level Features , 2006, KES.

[34]  Bernd Neumann,et al.  On scene interpretation with description logics , 2006, Image Vis. Comput..