Alternative Surrogates for Video Objects in a Digital Library: Users' Perspectives on Their Relative Usability

In a digital environment, it is feasible to integrate multimedia materials into a library collection with ease. However, it seems likely that nontextual surrogates for multimedia objects, e.g., videos, could effectively augment textual representations of those objects. In this study, five video surrogates were evaluated in relation to their usefulness and usability in accomplishing specific tasks. The surrogates (storyboards with text or audio keywords, slide shows with text or audio keywords, fast forward) were created for each of seven video segments. Ten participants, all of whom watch videos at least monthly and search for videos at least occasionally, viewed the surrogates for seven video segments and provided comments about the strengths and weaknesses of each. In addition, they performed a series of tasks (gist determination, object recognition, action recognition, and visual gist determination) with three surrogates selected from those available. No surrogate was universally judged "best," but the fast forward surrogate garnered the most support, particularly from experienced video users. The participants expressed their understanding of video gist as composed of three components: topicality, the story of the video, and the visual gist of the video. They identified several real-world tasks for which they regularly use video collections. The viewing compaction rates used in these surrogates supported adequate performance, but users expressed a desire for more control over surrogate speed and sequencing. Further development of these surrogates is warranted by these results, as well as the development of mechanisms for surrogate display.

[1]  Boon-Lock Yeo,et al.  Retrieving and visualizing video , 1997, CACM.

[2]  Mary Czerwinski,et al.  Data mountain: using spatial memory for document management , 1998, UIST '98.

[3]  Abby Goodrum Evaluation of Text-Based and Image-Based Representations for Moving Image Documents , 1997 .

[4]  Walter Kintsch,et al.  Toward a model of text comprehension and production. , 1978 .

[5]  Gary Marchionini,et al.  Key frame preview techniques for video browsing , 1998, DL '98.

[6]  Michael Mills,et al.  A magnifier tool for video data , 1992, CHI.

[7]  Yoshinobu Tonomura,et al.  VideoMAP and VideoSpaceIcon: tools for anatomizing video content , 1993, INTERCHI.

[8]  Takeo Kanade,et al.  Video skimming and characterization through the combination of image and language understanding , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[9]  Gary Marchionini,et al.  Dynamic key frame presentation techniques for augmenting video browsing , 1998, AVI '98.

[10]  Brian C. O'Connor,et al.  Access to Moving Image Documents: Background Concepts and Proposals for Surrogates for film and video Works , 1985, J. Documentation.

[11]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[12]  Andrew Heybey,et al.  I/Browse: the Bellcore video library tool kit , 1996, Electronic Imaging.

[13]  Mark T. Maybury Intelligent multimedia information retrieval , 1997 .

[14]  Gary Marchionini,et al.  Multimodal surrogates for video browsing , 1999, DL '99.

[15]  Laurence B. Heilprin Toward foundations of information science , 1985 .

[16]  Harold Borko,et al.  Abstracting Concepts and Methods , 1975 .

[17]  Abby Goodrum,et al.  Multidimensional scaling of video surrogates , 2001, J. Assoc. Inf. Sci. Technol..

[18]  Gary Marchionini,et al.  Comprehension and Object Recognition Capabilities for Presentations of Simultaneous Video Key Frame Surrogates , 1997, ECDL.

[19]  William S. Cooper,et al.  A definition of relevance for information retrieval , 1971, Inf. Storage Retr..

[20]  Spencer W. Thomas Lessons learned from building a terabyte digital video library , 1999 .

[21]  W. Kintsch,et al.  Strategies of discourse comprehension , 1983 .

[22]  Gary Marchionini,et al.  Agileviews: A Human-Centered Framework for Interfaces to Information Spaces. , 2000 .

[23]  Anoop Gupta,et al.  Browsing digital video , 2000, CHI.

[24]  Michael G. Christel,et al.  Improving Access to a Digital Video Library , 1997, INTERACT.

[25]  Mark E. Rorvig A method for automatically abstracting visual documents , 1993 .

[26]  Michael G. Christel,et al.  Evolving video skims into useful multimedia abstractions , 1998, CHI.

[27]  “ Previewing Video Data : Browsing Key Frames Rates Using a Video Slide Show Interface , .

[28]  Edward Lee Elliott Watch-grab-arrange-see : thinking with motion images via streams and collages , 1993 .

[29]  Yihong Gong,et al.  Lessons Learned from Building a Terabyte Digital Video Library , 1999, Computer.

[30]  James M. Turner,et al.  Determining the subject content of still and moving image documents for storage and retrieval : an experimental investigation , 1994 .

[31]  Jakob Nielsen,et al.  Measuring usability: preference vs. performance , 1994, CACM.

[32]  Michael A. Smith,et al.  Video skimming and characterization through the combination of image and language understanding techniques , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[33]  Alan F. Smeaton Content-based access to digital video: the Físchlár system and the TREC video track , 2001 .