Multi-query interactive image and video retrieval -: theory and practice

We propose a new interactive image and video retrieval system called multi-query interactive retrieval, which is designed to jointly optimize the retrieval performance on multiple query topics. The proposed system employs a learning-based hybrid retrieval approach, which can automatically switch between tagging and browsing interface based on user labeling efficiency. To formalize the retrieval process, we use two formal annotation models to track and estimate the retrieval time for each method. Based on the parameters of these models, the system integrates the tagging-based and browsing-based methods in order to minimize overall retrieval time across the full set of query topics. This hybrid multi-topic retrieval approach is demonstrated to be highly effective on two large-scale video collections.

[1]  John R. Smith,et al.  A web-based system for collaborative annotation of large image and video collections: an evaluation and user study , 2005, MULTIMEDIA '05.

[2]  Bijan Parsia,et al.  PhotoStuff-An Image Annotation Tool for the Semantic Web , 2005 .

[3]  Tat-Seng Chua,et al.  TRECVID 2005 by NUS PRIS , 2005, TRECVID.

[4]  Paul Over,et al.  TRECVID 2006 Overview , 2006, TRECVID.

[5]  Alan F. Smeaton,et al.  The físchlár digital video system: a digital library of broadcast TV programmes , 2001, JCDL '01.

[6]  Rong Yan,et al.  A learning-based hybrid tagging and browsing approach for efficient manual image annotation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[8]  Rong Yan,et al.  Extreme video retrieval: joint maximization of human and computer performance , 2006, MM '06.

[9]  John Adcock,et al.  FXPAL Experiments for TRECVID 2004 , 2004, TRECVID.

[10]  John R. Smith,et al.  IBM Research TRECVID-2009 Video Retrieval System , 2009, TRECVID.

[11]  Yihong Gong,et al.  Lessons Learned from Building a Terabyte Digital Video Library , 1999, Computer.

[12]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[13]  John R. Smith,et al.  Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[14]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[15]  Andrew Zisserman,et al.  Oxford TRECVid 2007 - Notebook paper , 2007 .

[16]  Marcel Worring,et al.  The MediaMill TRECVID 2004 Semantic Viedo Search Engine , 2004, TRECVID.

[17]  Rong Yan,et al.  A review of text and image retrieval approaches for broadcast news video , 2007, Information Retrieval.