论文信息 - Video search reranking via information bottleneck principle - 字舞流文

Video search reranking via information bottleneck principle

We propose a novel and generic video/image reranking algorithm, IB reranking, which reorders results from text-only searches by discovering the salient visual patterns of relevant and irrelevant shots from the approximate relevance provided by text results. The IB reranking method, based on a rigorous Information Bottleneck (IB) principle, finds the optimal clustering of images that preserves the maximal mutual information between the search relevance and the high-dimensional low-level visual features of the images in the text search results. Evaluating the approach on the TRECVID 2003-2005 data sets shows significant improvement upon the text search baseline, with relative increases in average performance of up to 23%. The method requires no image search examples from the user, but is competitive with other state-of-the-art example-based approaches. The method is also highly generic and performs comparably with sophisticated models which are highly tuned for specific classes of queries, such as named-persons. Our experimental analysis has also confirmed the proposed reranking method works well when there exist sufficient recurrent visual patterns in the search results, as often the case in multi-source news videos.

Shih-Fu Chang | Winston H. Hsu | Lyndon S. Kennedy | Shih-Fu Chang | L. Kennedy

[1] Alan F. Smeaton,et al. A Comparison of Score, Rank and Probability-Based Fusion Methods for Video Shot Retrieval , 2005, CIVR.

[2] Dong Xu,et al. Columbia University TRECVID-2006 Video Search and High-Level Feature Extraction , 2006, TRECVID.

[3] Naftali Tishby,et al. Unsupervised document classification using sequential information maximization , 2002, SIGIR '02.

[4] W. Bruce Croft,et al. Cluster-based retrieval using language models , 2004, SIGIR '04.

[5] Yiming Yang,et al. Translingual Information Retrieval: A Comparative Evaluation , 1997, IJCAI.

[6] Naftali Tishby,et al. Agglomerative Information Bottleneck , 1999, NIPS.

[7] Shiri Gordon,et al. Applying the information bottleneck principle to unsupervised clustering of discrete and continuous image representations , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[8] Shih-Fu Chang,et al. Topic Tracking Across Broadcast News Videos with Visual Duplicates and Semantic Concepts , 2006, 2006 International Conference on Image Processing.

[9] Pietro Perona,et al. A Visual Category Filter for Google Images , 2004, ECCV.

[10] Shih-Fu Chang,et al. Automatic discovery of query-class-dependent models for multimodal search , 2005, MULTIMEDIA '05.

[11] Rong Yan,et al. Multimedia Search with Pseudo-relevance Feedback , 2003, CIVR.

[12] John R. Smith,et al. IBM Research TRECVID-2009 Video Retrieval System , 2009, TRECVID.

[13] Winston H. Hsu,et al. Video Search and High-Level Feature Extraction , 2005 .

[14] Milind R. Naphade,et al. Learning the semantics of multimedia queries and concepts from a small number of examples , 2005, MULTIMEDIA '05.

[15] D. W. Scott,et al. Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[16] Alexander G. Hauptmann,et al. Successful approaches in the TREC video retrieval evaluations , 2004, MULTIMEDIA '04.

[17] Thomas M. Cover,et al. Elements of Information Theory , 2005 .

[18] Gang Wang,et al. TRECVID 2004 Search and Feature Extraction Task by NUS PRIS , 2004, TRECVID.

[19] Shih-Fu Chang,et al. Visual Cue Cluster Construction via Information Bottleneck Principle and Kernel Density Estimation , 2005, CIVR.

[20] Stephen E. Robertson,et al. Okapi at TREC-3 , 1994, TREC.