Interactive Searching and Browsing of Video Archives: Using Text and Using Image Matching

Over the last number of decades much research work has been done in the general area of video and audio analysis. Initially the applications driving this included capturing video in digital form and then being able to store, transmit and render it, which involved a large effort to develop compression and encoding standards. The technology needed to do all this is now easily available and cheap, with applications of digital video processing now commonplace, ranging from CCTV (Closed Circuit TV) for security, to home capture of broadcast TV on home DVRs for personal viewing. One consequence of the development in technology for creating, storing and distributing digital video is that there has been a huge increase in the volume of digital video, and this in turn has created a need for techniques to allow effective management of this video, and by that we mean content management. In the BBC, for example, the archives department receives approximately 500,000 queries per year and has over 350,000 hours of content in its library. Having huge archives of video information is hardly any benefit if we have no effective means of being able to locate video clips which are of relevance to whatever our information needs may be. In this chapter we report our work on developing two specific retrieval and browsing tools for digital video information. Both of these are based on an analysis of the captured video for the purpose of automatically structuring into shots or higher level semantic units like TV news stories. Some also include analysis of the video for the automatic detection of features such as the presence or absence of faces. Both include some elements of searching, where a user specifies a query or information need, and browsing, where a user is allowed to browse through sets of retrieved video shots. We support the presentation of these tools with illustrations of actual video retrieval systems developed and working on hundreds of hours of video content.

[1]  Alan F. Smeaton,et al.  TRECVid 2006 Experiments at Dublin City University , 2012, TRECVID.

[2]  Shih-Fu Chang,et al.  Story boundary detection in large broadcast news video archives: techniques, experience and trends , 2004, MULTIMEDIA '04.

[3]  Alan F. Smeaton,et al.  TRECVID 2004 Experiments in Dublin City University , 2004, TRECVID.

[4]  Paul Over,et al.  TRECVID: evaluating the effectiveness of information retrieval tasks on digital video , 2004, MULTIMEDIA '04.

[5]  Paul Over,et al.  TRECVID 2004 - An Overview , 2004, TRECVID.

[6]  Alan F. Smeaton,et al.  User evaluation outside the lab: the trial of Físchlár-News , 2005 .

[7]  Earl Rennison,et al.  Galaxy of news: an approach to visualizing and understanding expansive news landscapes , 1994, UIST '94.

[8]  Ching-Yung Lin,et al.  Video Collaborative Annotation Forum: Establishing Ground-Truth Labels on Large Multimedia Datasets , 2003, TRECVID.

[9]  Alan F. Smeaton,et al.  A generic news story segmentation system and its evaluation , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Alan F. Smeaton,et al.  Using video objects and relevance feedback in video retrieval , 2005, SPIE Optics East.

[11]  Alan F. Smeaton,et al.  The físchlár digital video system: a digital library of broadcast TV programmes , 2001, JCDL '01.

[12]  Rémi Ronfard Reading movies: an integrated DVD player for browsing movies and their scripts , 2004, MULTIMEDIA '04.

[13]  Alan F. Smeaton,et al.  Improving the Quality of the Personalized Electronic Program Guide , 2004, User Modeling and User-Adapted Interaction.

[14]  Mika Rautiainen,et al.  Temporal color correlograms for video retrieval , 2002, Object recognition supported by user interaction for service robots.

[15]  Alan F. Smeaton,et al.  The Físchlár-News-Stories System: Personalised Access to an Archive of TV News , 2004, RIAO.

[16]  James Allan,et al.  Monitoring the News: a TDT demonstration system , 2001, HLT.

[17]  Russell Swan,et al.  TimeMine: visualizing automatically constructed timelines. , 2000, SIGIR 2000.

[18]  Noel E. O'Connor,et al.  Region-based segmentation of images using syntactic visual features , 2005 .

[19]  Noel E. O'Connor,et al.  Evaluating and combining digital video shot boundary detection algorithms , 2000 .

[20]  Tobun Dorbin Ng,et al.  Collages as dynamic summaries for news video , 2002, MULTIMEDIA '02.

[21]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[22]  Noel Murphy,et al.  Automatic TV advertisement detection from MPEG bitstream , 2002, Pattern Recognit..

[23]  Alan F. Smeaton,et al.  User-interface to a CCTV video search system , 2005 .

[24]  Alan F. Smeaton,et al.  Indexing, browsing, and searching of digital video , 2005, Annu. Rev. Inf. Sci. Technol..

[25]  Shin'ichi Satoh,et al.  Topic Threading for Structuring a Large-Scale News Video Archive , 2004, CIVR.

[26]  Alan F. Smeaton,et al.  A Comparison of Score, Rank and Probability-Based Fusion Methods for Video Shot Retrieval , 2005, CIVR.

[27]  Wei-Hao Lin,et al.  Confounded Expectations: Informedia at TRECVID 2004 , 2004, TRECVID.