Watch-and-comment as a paradigm toward ubiquitous interactive video editing

The literature reports research efforts allowing the editing of interactive TV multimedia documents by end-users. In this article we propose complementary contributions relative to end-user generated interactive video, video tagging, and collaboration. In earlier work we proposed the watch-and-comment (WaC) paradigm as the seamless capture of an individual's comments so that corresponding annotated interactive videos be automatically generated. As a proof of concept, we implemented a prototype application, the WaCTool, that supports the capture of digital ink and voice comments over individual frames and segments of the video, producing a declarative document that specifies both: different media stream structure and synchronization. In this article, we extend the WaC paradigm in two ways. First, user-video interactions are associated with edit commands and digital ink operations. Second, focusing on collaboration and distribution issues, we employ annotations as simple containers for context information by using them as tags in order to organize, store and distribute information in a P2P-based multimedia capture platform. We highlight the design principles of the watch-and-comment paradigm, and demonstrate related results including the current version of the WaCTool and its architecture. We also illustrate how an interactive video produced by the WaCTool can be rendered in an interactive video environment, the Ginga-NCL player, and include results from a preliminary evaluation.

[1]  Leon Cruickshank,et al.  Interacting with Digital Media at Home via a Second Screen , 2007, Ninth IEEE International Symposium on Multimedia Workshops (ISMW 2007).

[2]  Ethan V. Munson,et al.  Inkteractors: interacting with digital ink , 2008, SAC '08.

[3]  Dick C. A. Bulterman Using SMIL to encode interactive, peer-level multimedia annotations , 2003, DocEng '03.

[4]  Luiz Fernando Gomes Soares,et al.  Composer: Authoring Tool for iTV Programs , 2008, EuroITV.

[5]  Abigail Sellen,et al.  I saw this and thought of you: some social uses of camera phones , 2005, CHI Extended Abstracts.

[6]  Pablo César,et al.  An Architecture for Non-intrusive User Interfaces for Interactive Digital Television , 2007, EuroITV.

[7]  Herng-Yow Chen,et al.  Exploring media correlation and synchronization for navigated hypermedia documents , 2005, MULTIMEDIA '05.

[8]  Pablo César,et al.  Usages of the Secondary Screen in an Interactive Television Environment: Control, Enrich, Share, and Transfer Television Content , 2008, EuroITV.

[9]  Jakob Nielsen,et al.  Finding usability problems through heuristic evaluation , 1992, CHI.

[10]  Gregory D. Abowd,et al.  The Human Experience , 2002, IEEE Pervasive Comput..

[11]  Rudinei Goularte,et al.  Enhancing Multimodal Annotations with Pen-Based Information , 2007, Ninth IEEE International Symposium on Multimedia Workshops (ISMW 2007).

[12]  Kris Luyten,et al.  Telebuddies: social stitching with interactive television , 2006, CHI EA '06.

[13]  Maria da Graça Campos Pimentel,et al.  Interactive multimedia annotations: enriching and extending content , 2004, DocEng '04.

[14]  Michael Kai Petersen,et al.  Semantic Modelling Using TV-Anytime Genre Metadata , 2007, EuroITV.

[15]  Peter C. Wright,et al.  The use of think-aloud evaluation methods in design , 1991, SGCH.

[16]  David Geerts Comparing voice chat and text chat in a communication tool for interactive television , 2006, NordiCHI '06.

[17]  Pablo César,et al.  Benefits of structured multimedia documents in IDTV: the end-user enrichment system , 2006, DocEng '06.

[18]  David A. Shamma,et al.  Watch what I watch: using community activity to understand content , 2007, MIR '07.

[19]  Jeremy M. Thorne,et al.  Awareness and conversational context-sharing to enrich TV-based communication , 2008, CIE.

[20]  Maria da Graça Campos Pimentel,et al.  Supporting multimedia capture in mobile computing environments through a peer-to-peer platform , 2008, SAC '08.

[21]  Hiroshi Ishii,et al.  mediaBlocks: physical containers, transports, and controls for online media , 1998, SIGGRAPH.

[22]  Abigail Sellen,et al.  Understanding videowork , 2007, CHI.

[23]  Newton Lee,et al.  ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP) , 2007, CIE.

[24]  Mohan S. Kankanhalli,et al.  Metadata handling: A video perspective , 2006, TOMCCAP.

[25]  Maria da Graça Campos Pimentel,et al.  Prototyping Applications to Document Human Experiences , 2007, IEEE Pervasive Computing.

[26]  Pablo César,et al.  An architecture for viewer-side enrichment of TV content , 2006, MM '06.

[27]  Shipeng Li,et al.  Interactive video authoring and sharing based on two-layer templates , 2006, HCM '06.

[28]  Crysta J. Metcalf,et al.  The uses of social television , 2008, CIE.

[29]  Lie Lu,et al.  P-Karaoke: personalized karaoke system , 2004, MULTIMEDIA '04.

[30]  John Minor Ross,et al.  Developing web-based video training modules to aid students learning multimedia skills , 2005 .

[31]  Tim Regan,et al.  Media center buddies: instant messaging around a media center , 2004, NordiCHI '04.

[32]  Rogério Ferreira Rodrigues,et al.  Live editing of hypermedia documents , 2006, DocEng '06.

[33]  James H. Aylor,et al.  Computer for the 21st Century , 1999, Computer.

[34]  Orit Shaer,et al.  The tangible video editor: collaborative video editing with active tokens , 2007, Tangible and Embedded Interaction.

[35]  Patrick Schmitz,et al.  Community annotation and remix: a research platform and pilot deployment , 2006, HCM '06.

[36]  Shingo Uchihashi,et al.  A semi-automatic approach to home video editing , 2000, UIST '00.

[37]  Ravin Balakrishnan,et al.  Fluid interaction techniques for the control and annotation of digital video , 2003, UIST '03.

[38]  Maria da Graça Campos Pimentel,et al.  Ubiquitous Interactive Video Editing Via Multimodal Annotations , 2008, EuroITV.

[39]  Pablo César,et al.  The ambulant annotator: empowering viewer-side enrichment of multimedia content , 2006, DocEng '06.

[40]  Nele Van den Ende,et al.  Towards Content-Aware Coding: User Study , 2007, EuroITV.

[41]  Robert E. Kraut,et al.  Watching together: integrating text chat with video , 2007, CHI.

[42]  Hemmeryckx-DeleersnijderBart,et al.  Awareness and conversational context-sharing to enrich TV-based communication , 2008 .

[43]  Ba Tu Truong,et al.  Video abstraction: A systematic review and classification , 2007, TOMCCAP.