Simplifying Video Editing with Intelligent Interaction

Digital video is becoming increasingly ubiquitous. However, editing video remains difficult for several reasons. It is a time-based medium, it has dual tracks of audio and video, and current tools force users to work at the smallest level of detail. In this paper, we describe several visualization and interaction techniques that use video metadata, including a transcript, to mitigate the problems of editing in this domain. We implemented these techniques in Silver, an authoring tool designed to make it easier for novice users to edit video. To help users visualize video, Silver provides multiple views with different semantic content and at different levels of abstraction, including storyboard, editable transcript and timeline views. Silver offers intelligent editing operations that help users resolve the inconsistencies that arise because of the different boundaries in audio and video. We conducted a preliminary user study to investigate the effectiveness of the system. Participants successfully edited video after only a short tutorial, both with and without intelligent editing assistance. Our research suggests several ways in which video editing tools could use metadata to assist users in the reuse and composition of video.

[1]  Wendy E. Mackay,et al.  Virtual video editing in interactive multimedia applications , 1989, CACM.

[2]  John M. Gauch,et al.  The vision digital video library , 1997, Inf. Process. Manag..

[3]  Michael G. Christel,et al.  Multimedia abstractions for a digital video library , 1997, DL '97.

[4]  Shingo Uchihashi,et al.  A semi-automatic approach to home video editing , 2000, UIST '00.

[5]  Thomas D. C. Little,et al.  Automatic Composition Techniques for Video Production , 1998, IEEE Trans. Knowl. Data Eng..

[6]  Takeo Kanade,et al.  Techniques for the Creation and Exploration of Digital Video Libraries , 1996 .

[7]  Stephen W. Smoliar,et al.  Video parsing, retrieval and browsing: an integrated and content-based solution , 1997, MULTIMEDIA '95.

[8]  Alexander G. Hauptmann,et al.  Text, Speech, and Vision for Video Segmentation: The InformediaTM Project , 1995 .

[9]  Michael Mills,et al.  A magnifier tool for video data , 1992, CHI.

[10]  Frank M. Shipman,et al.  Home Video Editing Made Easy - Balancing Automation and User Control , 2001, INTERACT.

[11]  Howard D. Wactlar,et al.  Informedia: improving access to digital video , 1994, INTR.

[12]  Yihong Gong,et al.  Lessons Learned from Building a Terabyte Digital Video Library , 1999, Computer.

[13]  Shingo Uchihashi,et al.  An interactive comic book presentation for exploring video , 2000, CHI.

[14]  Takafumi Miyatake,et al.  IMPACT: an interactive natural-motion-picture dedicated multimedia authoring system , 1991, CHI.

[15]  Richard M. Stern,et al.  The 1996 Hub-4 Sphinx-3 System , 1997 .

[16]  Tat-Seng Chua,et al.  A video retrieval and sequencing system , 1995, TOIS.

[17]  Takafumi Miyatake,et al.  Automatic scene separation and tree structure GUI for video editing , 1997, MULTIMEDIA '96.

[18]  Glorianna Davenport,et al.  Cinematic primitives for multimedia , 1991, IEEE Computer Graphics and Applications.