论文信息 - Video search in concept subspace: a text-like paradigm - 字舞流文

Video search in concept subspace: a text-like paradigm

Though both quantity and quality of semantic concept detection in video are continuously improving, it still remains unclear how to exploit these detected concepts as semantic indices in video search, given a specific query. In this paper, we tackle this problem and propose a video search framework which operates like searching text documents. Noteworthy for its adoption of the well-founded text search principles, this framework first selects a few related concepts for a given query, by employing a tf-idf like scheme, called c-tf-idf, to measure the informativeness of the concepts to this query. These selected concepts form a concept subspace. Then search can be conducted in this concept subspace, either by a Vector Model or a Language Model. Further, two algorithms, i.e., Linear Summation and Random Walk through Concept-Link, are explored to combine the concept search results and other baseline search results in a reranking scheme. This framework is both effective and efficient. Using a lexicon of 311 concepts from the LSCOM concept ontology, experiments conducted on the TRECVID 2006 search data set show that: when used solely, search within the concept subspace achieves the state-of-the-art concept search result; when used to rerank the baseline results, it can improve over the top 20 automatic search runs in TRECVID 2006 on average by approx. 20%, on the most significant one by approx. 50%, all within 180 milliseconds on a normal PC.

Dong Wang | Bo Zhang | Xirong Li | Jianmin Li | Bo Zhang | Dong Wang | Jianmin Li | Xirong Li

[1] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.

[2] Yoram Singer,et al. An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[3] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[4] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[5] John D. Lafferty,et al. A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[6] Rong Yan,et al. Multimedia Search with Pseudo-relevance Feedback , 2003, CIVR.

[7] Akiko Aizawa,et al. An information-theoretic perspective of tf-idf measures , 2003, Inf. Process. Manag..

[8] Gang Wang,et al. TRECVID 2004 Search and Feature Extraction Task by NUS PRIS , 2004, TRECVID.

[9] Milind R. Naphade,et al. Learning the semantics of multimedia queries and concepts from a small number of examples , 2005, MULTIMEDIA '05.

[10] Alan F. Smeaton,et al. A Comparison of Score, Rank and Probability-Based Fusion Methods for Video Shot Retrieval , 2005, CIVR.

[11] Tsinghua University at TRECVID 2005 , 2005, TRECVID.

[12] Paul Over,et al. TRECVID 2005 - An Overview , 2005, TRECVID.

[13] Bo Zhang,et al. Using High-Level Semantic Features in Video Retrieval , 2006, CIVR.

[14] Shih-Fu Chang,et al. Video search reranking via information bottleneck principle , 2006, MM '06.

[15] Marcel Worring,et al. The challenge problem for automated detection of 101 semantic concepts in multimedia , 2006, MM '06.

[16] John R. Smith,et al. Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[17] Dong Wang,et al. Relay Boost Fusion for Learning Rare Concepts in Multimedia , 2006, CIVR.

[18] Jin Zhao,et al. Video Retrieval Using High Level Features: Exploiting Query Matching and Confidence-Based Weighting , 2006, CIVR.

[19] Dennis Koelma,et al. The MediaMill TRECVID 2008 Semantic Video Search Engine , 2008, TRECVID.

[20] Marcel Worring,et al. A Learned Lexicon-Driven Paradigm for Interactive Video Retrieval , 2007, IEEE Transactions on Multimedia.

[21] E. Seneta. Non-negative Matrices and Markov Chains , 2008 .

[22] Christopher Hunt,et al. Notes on the OpenSURF Library , 2009 .

[23] John R. Smith,et al. IBM Research TRECVID-2009 Video Retrieval System , 2009, TRECVID.