This paper is concerned with the problem of improving the performance of text search baseline in video retrieval, specifically for the search tasks in TRECVID. Given a query in plain text, we first implement syntactic segmentation and semantic expansion of the query, then identify the underlying "targeted objects" which should appear in the retrieved video shots, and scale up the weights of the video shots retrieved by the query terms that represent these targeted objects. We name the approaches as "object-sensitive query analysis" for video search. Specifically, we propose a set of methods to identify the specific terms representing the "targeted objects" in a video search query, and a modified object-centric BM25 algorithm to emphasize the impact of these specific object-terms. In practice, we place the process of object-sensitive query analysis before the text search stage, and verify the effectiveness of the proposed approaches with the TRECVID 2005 and 2006 datasets. The experimental results indicate that the proposed object-sensitive approaches to query analysis bring significant improvement upon the raw text search baseline of video search.
[1]
Stephen E. Robertson,et al.
Relevance weighting of search terms
,
1976,
J. Am. Soc. Inf. Sci..
[2]
Alexander G. Hauptmann,et al.
LSCOM Lexicon Definitions and Annotations (Version 1.0)
,
2006
.
[3]
Shih-Fu Chang,et al.
Video search reranking via information bottleneck principle
,
2006,
MM '06.
[4]
Stephen E. Robertson,et al.
Overview of the Okapi projects
,
1997,
J. Documentation.
[5]
Pinar Duygulu Sahin,et al.
Joint visual-text modeling for automatic retrieval of multimedia documents
,
2005,
ACM Multimedia.
[6]
Alexander G. Hauptmann,et al.
Successful approaches in the TREC video retrieval evaluations
,
2004,
MULTIMEDIA '04.
[7]
Alan F. Smeaton,et al.
A Comparison of Score, Rank and Probability-Based Fusion Methods for Video Shot Retrieval
,
2005,
CIVR.
[8]
Dong Xu,et al.
Columbia University TRECVID-2006 Video Search and High-Level Feature Extraction
,
2006,
TRECVID.
[9]
Rong Yan,et al.
Extreme video retrieval: joint maximization of human and computer performance
,
2006,
MM '06.