Building Babel: freeing multimedia processing and delivery from hard-coded formats

The amount of multimedia content available via the Internet, and the number of formats in which it is encoded, stored and delivered continues to grow rapidly. So too the number and diversity of the devices and software applications which produce, process and consume such content. This constantly changing landscape presents an increasing challenge to interoperability, since more and more software and hardware must be upgraded as new formats are developed. However, many of the operations performed on multimedia content are similar across coding formats. In recognising this, this thesis proposes several approaches to format-independent media processing, with an emphasis on content delivery. This considerably simplifies interoperability, since support for a new content format may be provided by disseminating a data file, rather than requiring application and device providers to extend and modify their software and hardware. A fundamental requirement for format-independence is the ability to describe the structure of any given format in a way that exposes how it may be fragmented for delivery or processing, and how other data important to the processing (for instance temporal or scalability parameters) can be extracted from the binary data. Several

[1]  H. Schwarz,et al.  Overview of the Scalable H.264/MPEG4-AVC Extension , 2006, 2006 International Conference on Image Processing.

[2]  Hermann Hellwagner,et al.  A knowledge and component based multimedia adaptation framework , 2004, IEEE Sixth International Symposium on Multimedia Software Engineering.

[3]  Peter J. Ashenden,et al.  The Designer's Guide to VHDL , 1995 .

[4]  Alexandros Eleftheriadis,et al.  Constrained and general dynamic rate shaping of compressed digital video , 1995, Proceedings., International Conference on Image Processing.

[5]  Mihaela van der Schaar,et al.  The MPEG-4 fine-grained scalable video coding method for multimedia streaming over IP , 2001, IEEE Trans. Multim..

[6]  Ralf Lämmel,et al.  Towards an engineering discipline for GRAMMARWARE Draft as of August 17 , 2003 , 2003 .

[7]  Fernando Pereira,et al.  MPEG-A: multimedia application formats , 2005, IEEE MultiMedia.

[8]  Jaehwan Kim,et al.  RTP Payload Format for MPEG-4 Audio/Visual Streams , 2011, RFC.

[9]  Benoit Huet,et al.  Toward emotion indexing of multimedia excerpts , 2008, 2008 International Workshop on Content-Based Multimedia Indexing.

[10]  Silvia Pfeiffer The Ogg Encapsulation Format Version 0 , 2003, RFC.

[11]  Anthony Vetro,et al.  MPEG-21 digital item adaptation: enabling universal multimedia access , 2004, IEEE MultiMedia.

[12]  Nicola Guarino,et al.  Sweetening Ontologies with DOLCE , 2002, EKAW.

[13]  Ulrich H. Reimers,et al.  DVB-The Family of International Standards for Digital Video Broadcasting , 2004, Proceedings of the IEEE.

[14]  Hermann Hellwagner,et al.  Automatic adaptation of streaming multimedia content in a dynamic and distributed environment , 2005, IEEE International Conference on Image Processing 2005.

[15]  Christian Timmerer,et al.  Digital item adaptation: overview of standardization and research activities , 2005, IEEE Transactions on Multimedia.

[16]  Dragan Gasevic,et al.  Bridging knowledge bases' heterogeneity using XML/XSLT approach , 2005, 2005 IEEE International Conference on e-Technology, e-Commerce and e-Service.

[17]  Henning Schulzrinne,et al.  RTP: A Transport Protocol for Real-Time Applications , 1996, RFC.

[18]  Jerry D. Gibson,et al.  Structures for SNR scalable speech coding , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Wesley De Neve,et al.  An optimized MPEG-21 BSDL framework for the adaptation of scalable bitstreams , 2007, J. Vis. Commun. Image Represent..

[20]  Alexandros Eleftheriadis,et al.  Flavor: a language for media representation , 1997, MULTIMEDIA '97.

[21]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[22]  Hermann Hellwagner,et al.  Generic Streaming of Multimedia Content , 2005, EuroIMSA.

[23]  Vivek K. Goyal,et al.  Robust low-delay audio coding using multiple descriptions , 2005, IEEE Transactions on Speech and Audio Processing.

[24]  André Kaup,et al.  An MPEG-7 tool for compression and streaming of XML data , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[25]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[26]  Marina Bosi,et al.  Introduction to Digital Audio Coding and Standards , 2004, J. Electronic Imaging.

[27]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[28]  J. Scott Houchin,et al.  File format technology in JPEG 2000 enables flexible use of still and motion sequences , 2002, Signal Process. Image Commun..

[29]  Roch Lefebvre,et al.  The adaptive multirate wideband speech codec (AMR-WB) , 2002, IEEE Trans. Speech Audio Process..

[30]  Bran Selic,et al.  A model-driven approach to content repurposing , 2004, IEEE MultiMedia.

[31]  Liam Murphy,et al.  User-perceived quality-aware adaptive delivery of MPEG-4 content , 2003, NOSSDAV '03.

[32]  M. Amielh,et al.  Bitstream Syntax Description Language: Application of XML-Schema to Multimedia Content , 2002 .

[33]  Peter Fankhauser,et al.  XML data integration with OWL: experiences and challenges , 2004, 2004 International Symposium on Applications and the Internet. Proceedings..

[34]  Wesley De Neve,et al.  BFlavor: A harmonized approach to media resource adaptation, inspired by MPEG-21 BSDL and XFlavor , 2006, Signal Process. Image Commun..

[35]  Mirina Grosz,et al.  World Wide Web Consortium , 2010 .

[36]  Stefanos Kollias,et al.  Multimedia Content and the Semantic Web , 2005, Multimedia Content and the Semantic Web.

[37]  Susanto Rahardja,et al.  A fine granular scalable to lossless audio coder , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[38]  Ian Burnett,et al.  An Introduction to MPEG-21 , 2006 .

[39]  Scott Boag,et al.  XQuery 1.0 : An XML Query Language , 2007 .

[40]  Alexander Eichhorn Modelling dependency in multimedia streams , 2006, MM '06.

[41]  Neel Sundaresan,et al.  Efficient representation and streaming of XML content over the Internet medium , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[42]  Michael J. Witbrock,et al.  An Introduction to the Syntax and Content of Cyc , 2006, AAAI Spring Symposium: Formalizing and Compiling Background Knowledge and Its Applications to Knowledge Representation and Question Answering.

[43]  Alberto Del Bimbo,et al.  Semantic adaptation of sport videos with user-centred performance analysis , 2006, IEEE Transactions on Multimedia.

[44]  Peter Deutsch,et al.  GZIP file format specification version 4.3 , 1996, RFC.

[45]  Jerry R. Hobbs,et al.  Time in OWL-S , 2004 .

[46]  Rik Van de Walle,et al.  Is That a Fish in Your Ear? A Universal Metalanguage for Multimedia , 2007, IEEE MultiMedia.

[47]  Christian Timmerer,et al.  Transport mechanisms for metadata-driven distributed multimedia adaptation , 2005, 2005 1st International Conference on Multimedia Services Access Networks, 2005. MSAN '05..

[48]  Silvia Pfeiffer,et al.  Annodex: a simple architecture to enable hyperlinking, search & retrieval of time--continuous data on the Web , 2003, MIR '03.

[49]  Stephan Wenger,et al.  H.264/AVC over IP , 2003, IEEE Trans. Circuits Syst. Video Technol..

[50]  Jonathan Robie,et al.  Editors , 2003 .

[51]  Sylvain Devillers An Extension of BSDL for Multimedia Bitstream Syntax Description , 2003, Euro-Par.

[52]  Oliver Becker Streaming Transformations for XML-STX , 2003, XMIDX.

[53]  Bu-Sung Lee,et al.  Event on demand with MPEG-21 video adaptation system , 2006, MM '06.

[54]  Dietmar Jannach,et al.  A Multimedia Adaptation Framework based on Semantic Web Technology , 2005 .

[55]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[56]  Sassan Ahmadi,et al.  On the architecture, operation, and applications of VMR-WB: the new cdma2000 wideband speech coding standard , 2006, IEEE Communications Magazine.

[57]  Hideaki Kimata,et al.  RTP Payload Format for MPEG-4 Audio/Visual Streams , 2000, RFC.

[58]  Joel Waldfogel,et al.  Introduction , 2010, Inf. Econ. Policy.

[59]  Roni Even,et al.  RTP Payload Format for H.264 Video , 2011, RFC.

[60]  王志刚,et al.  Darwin Streaming server的研究与应用 , 2008 .

[61]  YouTube研究会 YouTube活用パーフェクト入門 : broadcast yourself , 2006 .

[62]  Vivek K. Goyal,et al.  RTP Payload Format for MPEG1/MPEG2 Video , 1996, RFC.

[63]  Gordon E. Moore,et al.  Progress in digital integrated electronics , 1975 .

[64]  Benjamin W. Wah,et al.  LSP-based multiple-description coding for real-time low bit-rate voice over IP , 2005, IEEE Transactions on Multimedia.

[65]  Yukari Shirota Applying XML and XSLT techniques to a personalized distance learning system for business mathematical education , 2004, 18th International Conference on Advanced Information Networking and Applications, 2004. AINA 2004..

[66]  Sophie Devillers,et al.  Information Technology-Multimedia Framework (MPEG-21)-Part 7: Digital Item Adaptation , 2006 .

[67]  M. Grube,et al.  Applications of MPEG-4: digital multimedia broadcasting , 2001, IEEE Trans. Consumer Electron..

[68]  Wesley De Neve,et al.  Using Bitstream Structure Descriptions for the Exploitation of Multi-layered Temporal Scalability in H.264/AVC's Base Specification , 2005, PCM.

[69]  Dan Brickley,et al.  Rdf vocabulary description language 1.0 : Rdf schema , 2004 .

[70]  H. Kurokawa,et al.  Adaptive multimedia playout method based on semantic structure of media stream , 2004, IEEE International Symposium on Communications and Information Technology, 2004. ISCIT 2004..

[71]  Stefan Winkler,et al.  Perceived Audiovisual Quality of Low-Bitrate Multimedia Content , 2006, IEEE Transactions on Multimedia.

[72]  Alexandros Eleftheriadis,et al.  XFlavor: bridging bits and objects in media representation , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[73]  Alexandros Eleftheriadis,et al.  Flavor: a formal language for audio-visual object representation , 2004, MULTIMEDIA '04.

[74]  Nicola Cranley,et al.  Incorporating user perception in adaptive video streaming systems , 2006 .

[75]  Lynda Hardman,et al.  That obscure object of desire: multimedia metadata on the Web, Part-1 , 2004, IEEE MultiMedia.

[76]  Christian Timmerer,et al.  Digital Item Adaptation – Coding Format Independence , 2006 .

[77]  R.M. Tol,et al.  TV anytime: STORit on myTV , 2000, 2000 Digest of Technical Papers. International Conference on Consumer Electronics. Nineteenth in the Series (Cat. No.00CH37102).

[78]  C. Michael Sperberg-McQueen,et al.  World Wide Web Consortium , 2009, Encyclopedia of Database Systems.

[79]  Steven J. DeRose,et al.  Xml pointer language (xpointer) , 1998 .

[80]  John R. Smith,et al.  Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[81]  Thomas Wiegand,et al.  3GPP compliant adaptive wireless video streaming using H.264/AVC , 2005, IEEE International Conference on Image Processing 2005.

[82]  T. V. Lakshman,et al.  VBR video: tradeoffs and potentials , 1998, Proc. IEEE.

[83]  Steven J. DeRose,et al.  XML Path Language (XPath) , 1999 .

[84]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[85]  Bernd Girod,et al.  Advances in channel-adaptive video streaming , 2002, Proceedings. International Conference on Image Processing.

[86]  Jane Hunter,et al.  Adding Multimedia to the Semantic Web: Building an MPEG-7 ontology , 2001, SWWS.

[87]  Hong Va Leong,et al.  Semantic-based approach to streaming XML contents using Xstream , 2003, Proceedings 27th Annual International Computer Software and Applications Conference. COMPAC 2003.

[88]  Bernd Girod,et al.  Low-complexity rate-distortion optimized video streaming , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[89]  Philip A. Chou,et al.  Rate-distortion optimized streaming of packetized media , 2006, IEEE Transactions on Multimedia.

[90]  Jane Hunter,et al.  Enhancing the semantic interoperability of multimedia through a core ontology , 2003, IEEE Trans. Circuits Syst. Video Technol..

[91]  Gregory K. Wallace,et al.  The JPEG still picture compression standard , 1992 .