Content-based video retrieval: Three example systems from TRECVid

The growth in available online video material over the Internet is generally combined with user-assigned tags or content description, which is the mechanism by which we then access such video. However, user-assigned tags have limitations for retrieval and often we want access where the content of the video itself is directly matched against a user's query rather than against some manually assigned surrogate tag. Content-based video retrieval techniques are not yet scalable enough to allow interactive searching on Internet-scale, but the techniques are proving robust and effective for smaller collections. In this article, we show three exemplar systems which demonstrate the state of the art in interactive, content-based retrieval of video shots, and these three are just three of the more than 20 systems developed for the 2007 iteration of the annual TRECVid benchmarking activity. The contribution of our article is to show that retrieving from video using content-based methods is now viable, that it works, and that there are many systems which now do this, such as the three outlined herein. These systems, and others can provide effective search on hundreds of hours of video content and are samples of the kind of content-based search functionality we can expect to see on larger video archives when issues of scale are addressed. © 2008 Wiley Periodicals, Inc. Int J Imaging Syst Technol, 18, 195–201, 2008

[1]  Alan F. Smeaton,et al.  Using score distributions for query-time fusion in multimediaretrieval , 2006, MIR '06.

[2]  Thomas S. Huang,et al.  Relevance feedback in content-based image retrieval: some recent advances , 2002, Inf. Sci..

[3]  Marcel Worring,et al.  Query on demand video browsing , 2007, ACM Multimedia.

[4]  Jenny Benois-Pineau,et al.  The Argos Campaign: Evaluation of Video Analysis Tools , 2007, 2007 International Workshop on Content-Based Multimedia Indexing.

[5]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[6]  Xuelong Li,et al.  Which Components are Important for Interactive Image Searching? , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Georges Quénot,et al.  Active learning for multimedia , 2007, ACM Multimedia.

[8]  Marcel Worring,et al.  A Learned Lexicon-Driven Paradigm for Interactive Video Retrieval , 2007, IEEE Transactions on Multimedia.

[9]  Sheng Tang,et al.  TRECVID 2007 Search Tasks by NUS-ICT , 2007, TRECVID.

[10]  Chong-Wah Ngo,et al.  Towards optimal bag-of-features for object categorization and semantic video retrieval , 2007, CIVR '07.

[11]  Noel E. O'Connor,et al.  Using Dempster-Shafer Theory to Fuse Multiple Information Sources in Region-Based Segmentation , 2007, 2007 IEEE International Conference on Image Processing.

[12]  Dennis Koelma,et al.  The MediaMill TRECVID 2008 Semantic Video Search Engine , 2008, TRECVID.

[13]  Yiannis Kompatsiaris,et al.  K-Space at TRECvid 2006 , 2006, TRECVID.

[14]  N. Lazarevic-McManus,et al.  Performance evaluation in visual surveillance using the F-measure , 2006, VSSN '06.