Exploiting redundancy in cross-channel video retrieval

Video producers, in telling a news story, tend to repeat important visual and speech material multiple times in adjacent shots, thus creating a certain level of redundancy. We describe this phenomenon, and use it to develop a framework to incorporate redundancy for cross-channel retrieval of visual items using speech. Testing our models in a series of retrieval experiments, we find that incorporating the fact that information occurs redundantly into cross-channel retrieval leads to significant improvements in retrieval performance.

[1]  Marcel Worring,et al.  Adding Semantics to Detectors for Video Retrieval , 2007, IEEE Transactions on Multimedia.

[2]  Dong Xu,et al.  Columbia University TRECVID-2006 Video Search and High-Level Feature Extraction , 2006, TRECVID.

[3]  Tao Tao,et al.  Language Model Information Retrieval with Document Expansion , 2006, NAACL.

[4]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[5]  Stephen W. Smoliar,et al.  Content based video indexing and retrieval , 1994, IEEE MultiMedia.

[6]  John R. Smith,et al.  IBM Research TRECVID-2009 Video Retrieval System , 2009, TRECVID.

[7]  Marcel Worring,et al.  The challenge problem for automated detection of 101 semantic concepts in multimedia , 2006, MM '06.

[8]  John R. Smith,et al.  Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[9]  Maarten de Rijke,et al.  The value of stories for speech-based video search , 2007, CIVR '07.

[10]  Jun Yang,et al.  Exploring temporal consistency for video analysis and retrieval , 2006, MIR '06.

[11]  Rong Yan,et al.  Multi-Lingual Broadcast News Retrieval , 2006, TRECVID.

[12]  Jun Yang,et al.  Finding Person X: Correlating Names with Visual Appearances , 2004, CIVR.

[13]  Sheng Tang,et al.  TRECVID 2006 by NUS-I2R , 2006, TRECVID.

[14]  Dennis Koelma,et al.  The MediaMill TRECVID 2008 Semantic Video Search Engine , 2008, TRECVID.

[15]  Djoerd Hiemstra,et al.  Using language models for information retrieval , 2001 .

[16]  Amit Singhal,et al.  Document expansion for speech retrieval , 1999, SIGIR '99.

[17]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[18]  Cor J. Veenman,et al.  The influence of cross-validation on video classification performance , 2006, MM '06.

[19]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.