Addressing the challenge of visual information access from digital image and video libraries

While it would seem that digital video libraries should benefit from access mechanisms directed to their visual contents, years of TREC video retrieval evaluation (TRECVID) research have shown that text search against transcript narrative text provides almost all the retrieval capability, even with visually oriented generic topics. A within-subjects study involving 24 novice participants on TRECVID 2004 tasks again confirms this result. The study shows that satisfaction is greater and performance is significantly better on specific and generic information retrieval tasks from news broadcasts when transcripts are available for search. Additional runs with 7 expert users reveal different novice and expert interaction patterns with the video library interface, helping explain the novices' lack of success with image search and visual feature browsing for visual information needs. Analysis of TRECVID visual features well suited for particular tasks provides additional insights into the role of automated feature classification for digital image and video libraries

[1]  Alan F. Smeaton,et al.  Designing the User Interface for the Físchlár Digital Video Library , 2006, J. Digit. Inf..

[2]  James C. French,et al.  An application of multiple viewpoints to content-based image retrieval , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[3]  Marcel Worring,et al.  Interaction in Content-Based Image Retrieval: The Evaluation of the State-of-the-Art Review , 2000, VISUAL.

[4]  Alexander G. Hauptmann,et al.  Successful approaches in the TREC video retrieval evaluations , 2004, MULTIMEDIA '04.

[5]  Jakob Nielsen,et al.  Heuristic Evaluation of Prototypes (individual) , 2022 .

[6]  Kerry Rodden,et al.  How do people manage their digital photographs? , 2003, CHI '03.

[7]  Michael G. Christel,et al.  Finding the right shots: assessing usability and performance of a digital video library interface , 2004, MULTIMEDIA '04.

[8]  Kevin Li,et al.  Faceted metadata for image search and browsing , 2003, CHI '03.

[9]  Eero Sormunen,et al.  End-User Searching Challenges Indexing Practices in the Digital Newspaper Photo Archive , 2004, Information Retrieval.

[10]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Joemon M. Jose,et al.  An adaptive technique for content-based image retrieval , 2006, Multimedia Tools and Applications.

[12]  Stéphane Marchand-Maillet,et al.  Managing video collections at large , 2004, CVDB '04.

[13]  Thomas S. Huang,et al.  Relevance feedback in image retrieval: A comprehensive review , 2003, Multimedia Systems.

[14]  Peter G. B. Enser Pictorial information retrieval , 1995 .

[15]  Sara Shatford,et al.  Analyzing the Subject of a Picture: A Theoretical Approach , 1986 .

[16]  Jakob Nielsen,et al.  Usability inspection methods , 1994, CHI 95 Conference Companion.

[17]  Peter G. B. Enser,et al.  Retrieval of Archival Moving Imagery - CBIR Outside the Frame? , 2002, CIVR.

[18]  Joemon M. Jose,et al.  Spatial querying for image retrieval: a user-oriented evaluation , 1998, SIGIR '98.

[19]  Ching-chih Chen,et al.  Automated semantic annotation and retrieval based on sharable ontology and case-based learning techniques , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[20]  Tno Tpd TRECVID 2004 - An Introduction , 2004 .

[21]  Ben Shneiderman,et al.  Clarifying Search: A User-Interface Framework for Text Searches , 1997, D Lib Mag..

[22]  Gary Marchionini,et al.  The relative effectiveness of concept-based versus content-based video retrieval , 2004, MULTIMEDIA '04.