Lexical Mediation for Ontology-Based Annotation of Multimedia

In the last decade, the annotation of multimedia has evolved toward the use of ontologies, as a way to bridge the semantic gap between low level features of media objects and high level concepts. In many cases, the annotation terms refer to structured ontologies. Such ontologies, however, are often light scale domain oriented knowledge bases, whereas the employment of wide, commonsense ontologies would improve interoperability and knowledge sharing, with beneficial effects on search and navigation. In this chapter, we present an approach to the semantic annotation of media objects through a meaning negotiation approach that requires natural language lexical terms as interface and employs large scale commonsense ontologies. As a test case, we apply the annotation to narrative media objects, using a meta–ontology, called Drammar, to describe their structure. We present the annotation schema, the software architecture for integrating several large scale ontologies, and the lexical interface for negotiating the ontological term. We also describe an evaluation of the proposed approach, conducted through experiments with annotators.

[1]  Ramakant Nevatia,et al.  VERL: An Ontology Framework for Representing and Annotating Video Events , 2005, IEEE Multim..

[2]  Yannick Prié,et al.  Towards a Unified Data Model for Audiovisual Active Reading , 2008, 2008 Tenth IEEE International Symposium on Multimedia.

[3]  Marcel Worring,et al.  The challenge problem for automated detection of 101 semantic concepts in multimedia , 2006, MM '06.

[4]  Alexander G. Hauptmann,et al.  Towards a Large Scale Concept Ontology for Broadcast Video , 2004, CIVR.

[5]  Yiannis Kompatsiaris,et al.  Multimedia, Broadcasting, and eCulture , 2011, Handbook of Semantic Web Technologies.

[6]  Rossana Damiano,et al.  Emotions in Drama Characters and Virtual Agents , 2008, AAAI Spring Symposium: Emotion, Personality, and Social Behavior.

[7]  John R. Smith,et al.  Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[8]  Winfried Lamersdorf,et al.  Jadex: A BDI Reasoning Engine , 2005, Multi-Agent Programming.

[9]  Michael E. Bratman,et al.  Intention, Plans, and Practical Reason , 1991 .

[10]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[11]  Ming Yang,et al.  Detecting video events based on action recognition in complex scenes using spatio-temporal descriptor , 2009, ACM Multimedia.

[12]  Nigel Shadbolt,et al.  OntoMedia: An Ontology for the Representation of Heterogeneous Media , 2005 .

[13]  Nicola Guarino,et al.  Ontologies and Knowledge Bases. Towards a Terminological Clarification , 1995 .

[14]  Sara Tonelli,et al.  New Features for FrameNet - WordNet Mapping , 2009, CoNLL.

[15]  Raphaël Troncy,et al.  Finding media illustrating events , 2011, ICMR '11.

[16]  Martha Palmer,et al.  Verbnet: a broad-coverage, comprehensive verb lexicon , 2005 .

[17]  Marek J. Sergot,et al.  A logic-based calculus of events , 1989, New Generation Computing.

[18]  Christian Bizer,et al.  The Berlin SPARQL Benchmark , 2009, Int. J. Semantic Web Inf. Syst..

[19]  Alberto Del Bimbo,et al.  Video Annotation and Retrieval Using Ontologies and Rule Learning , 2010, IEEE MultiMedia.

[20]  Hector J. Levesque,et al.  Intention is Choice with Commitment , 1990, Artif. Intell..

[21]  Marie-Laure Ryan Avatars Of Story , 2006 .

[22]  Ana Paiva,et al.  FearNot! - An Emergent Narrative Approach to Virtual Dramas for Anti-bullying Education , 2007, International Conference on Virtual Storytelling.

[23]  Aldo Gangemi,et al.  Ontology Design Patterns , 2005 .

[24]  Riccardo Leonardi,et al.  Semantic Indexing of Multimedia Documents , 2002, IEEE Multim..

[25]  Yiannis Kompatsiaris,et al.  Introduction to the special issue on image and video retrieval: theory and applications , 2010, Multimedia Tools and Applications.

[26]  Marc Cavazza,et al.  Revisiting Character-Based Affective Storytelling under a Narrative BDI Framework , 2008, ICIDS.

[27]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[28]  Werner Ceusters,et al.  Introducing Ontological Realism for Semi-Supervised Detection and Annotation of Operationally Significant Activity in Surveillance Videos , 2010, STIDS.

[29]  Vincenzo Lombardo,et al.  An Architecture for Directing Value-Driven Artificial Characters , 2010, AGS.

[30]  James A. Hendler,et al.  Handbook of Semantic Web Technologies , 2011, Handbook of Semantic Web Technologies.

[31]  A. Murat Tekalp,et al.  Automatic Soccer Video Analysis and Summarization , 2003, IS&T/SPIE Electronic Imaging.

[32]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[33]  Nicola Guarino,et al.  Sweetening Ontologies with DOLCE , 2002, EKAW.

[34]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[35]  Ea Sonenberg,et al.  Creating interactive characters with BDI agents , 2004 .

[36]  C. Saathoff,et al.  KAT: The K-Space Annotation Tool , 2008 .

[37]  A. Strauss,et al.  Basics of qualitative research: Grounded theory procedures and techniques. , 1993 .

[38]  Burnet M. Hobgood,et al.  The Field of Drama , 1989 .

[39]  Alberto Del Bimbo,et al.  Video Annotation with Pictorially Enriched Ontologies , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[40]  Fabian M. Suchanek,et al.  Integrating YAGO into the Suggested Upper Merged Ontology , 2008, 2008 20th IEEE International Conference on Tools with Artificial Intelligence.

[41]  Patrick Olivier,et al.  Intelligent Virtual Agents: 5th International Working Conference (IVA). Kos, Greece. 12-14 September 2005 , 2005 .

[42]  A. Rao,et al.  Deliberation and Intentions , 1991 .

[43]  Deborah L. McGuinness,et al.  OWL Web ontology language overview , 2004 .

[44]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[45]  Vincenzo Lombardo,et al.  Integrating Commonsense Knowledge into the Semantic Annotation of Narrative Media Objects , 2011, AI*IA.

[46]  John McCarthy Mental Situation Calculus , 1986, TARK.

[47]  Adam Pease,et al.  The Suggested Upper Merged Ontology: A Large Ontology for the Semantic Web and its Applic ations , 2002 .

[48]  Nicolaas J. I. Mars,et al.  Towards very large knowledge bases, knowledge building and knowledge sharing 1995 , 1995 .

[49]  Fabian M. Suchanek,et al.  Yago: A Core of Semantic Knowledge Unifying WordNet and Wikipedia , 2007 .

[50]  Alberto Del Bimbo,et al.  Learning ontology rules for semantic video annotation , 2008, MS '08.