What Are You Trying to Say? Format-Independent Semantic-Aware Streaming and Delivery

Users automatically associate many layers of meaning with the media content they consume, yet computers have barely begun to scrape the surface of this information. For example, consider the passage above. The subtle exchange of glances between Elizabeth and her father would be readily apparent to most human observers, but it is unlikely that a computer processing a video of the scene would be able to recognise their meaning. Furthermore, while the double-entendre in Mr Bennett’s remark would be clear to most human listeners, algorithmic recoginition of this or other modes of speech are in their infancy (Paleari & Huet, 2008). Other research communities are developing means to communicate such semantic information (whether computed or manually generated) in ways that are able to transcend the original context of the information.This work—originating from Knowledge Representation, but more popularly known as the Semantic Web—has provided languages such as the Resource Description Framework (RDF) (Beckett, 2004) and Web Ontology Language (OWL) (Dean & Schreiber, 2004) which can be used to express concepts in such a way that “this picture has many buildings” may also imply that “it is a cityscape”, and “it contains man-made objects.” Recent multimedia coding formats developed by MPEG and ITU-T such as Scalable Video Coding (SVC) (ISO/IEC, 2007) and Scalable-to-Lossless Coding (SLS) (ISO/IEC, 2004a) offer the ability to dynamically adapt their bitrate to changing conditions. Current systems perform this adaptation on the basis of static channel parameters such as terminal and network capabilities (Timmerer et al., 2006) or dynamic estimation of channel capacity (Chou, 2006). There are, in fact, numerous examples of using content semantics to identify the best way to adapt content to dynamic conditions: Section 2 describes this in further detail. However, while others have proposed specific semantics to be used in the delivery process, there exists no generic system for connecting arbitrary semantics to the adaptation/delivery process. 17

[1]  Philip A. Chou,et al.  Rate-distortion optimized streaming of packetized media , 2006, IEEE Transactions on Multimedia.

[2]  Peter Fankhauser,et al.  XML data integration with OWL: experiences and challenges , 2004, 2004 International Symposium on Applications and the Internet. Proceedings..

[3]  Wesley De Neve,et al.  BFlavor: A harmonized approach to media resource adaptation, inspired by MPEG-21 BSDL and XFlavor , 2006, Signal Process. Image Commun..

[4]  Bu-Sung Lee,et al.  Event on demand with MPEG-21 video adaptation system , 2006, MM '06.

[5]  C. M. Sperberg-McQueen,et al.  Extensible markup language , 1997 .

[6]  Joseph Thomas-Kerr,et al.  Reconfigurable Media Coding: Self-Describing Multimedia Bitstreams , 2007, 2007 IEEE Workshop on Signal Processing Systems.

[7]  Nicola Cranley,et al.  Incorporating user perception in adaptive video streaming systems , 2006 .

[8]  André Kaup,et al.  An MPEG-7 tool for compression and streaming of XML data , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[9]  Liam Murphy,et al.  User-perceived quality-aware adaptive delivery of MPEG-4 content , 2003, NOSSDAV '03.

[10]  Alberto Del Bimbo,et al.  Semantic adaptation of sport videos with user-centred performance analysis , 2006, IEEE Transactions on Multimedia.

[11]  Alexander Eichhorn Modelling dependency in multimedia streams , 2006, MM '06.

[12]  Joseph Thomas-Kerr,et al.  Building Babel: freeing multimedia processing and delivery from hard-coded formats , 2009 .

[13]  Rik Van de Walle,et al.  Is That a Fish in Your Ear? A Universal Metalanguage for Multimedia , 2007, IEEE MultiMedia.

[14]  Joseph Thomas-Kerr,et al.  Format-Independent Rich Media Delivery Using the Bitstream Binding Language , 2008, IEEE Transactions on Multimedia.

[15]  Christian Timmerer,et al.  Digital Item Adaptation – Coding Format Independence , 2006 .

[16]  John R. Smith,et al.  Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[17]  Benoit Huet,et al.  Toward emotion indexing of multimedia excerpts , 2008, 2008 International Workshop on Content-Based Multimedia Indexing.

[18]  Ralf Lämmel,et al.  Towards an engineering discipline for GRAMMARWARE Draft as of August 17 , 2003 , 2003 .

[19]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[20]  Alexandros Eleftheriadis,et al.  XFlavor: bridging bits and objects in media representation , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[21]  Nicola Guarino,et al.  Sweetening Ontologies with DOLCE , 2002, EKAW.

[22]  Scott Boag,et al.  XQuery 1.0 : An XML Query Language , 2007 .

[23]  Michael J. Witbrock,et al.  An Introduction to the Syntax and Content of Cyc , 2006, AAAI Spring Symposium: Formalizing and Compiling Background Knowledge and Its Applications to Knowledge Representation and Question Answering.

[24]  Joseph Thomas-Kerr,et al.  Enhancing Interoperability via Generic Multimedia Syntax Translation , 2006, 2006 Second International Conference on Automated Production of Cross Media Content for Multi-Channel Distribution (AXMEDIS'06).

[25]  Bernd Girod,et al.  Low-complexity rate-distortion optimized video streaming , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[26]  H. Kurokawa,et al.  Adaptive multimedia playout method based on semantic structure of media stream , 2004, IEEE International Symposium on Communications and Information Technology, 2004. ISCIT 2004..

[27]  Jane Hunter,et al.  Adding Multimedia to the Semantic Web: Building an MPEG-7 ontology , 2001, SWWS.