论文信息 - Reusing annotation labor for concept selection

Reusing annotation labor for concept selection

Describing shots through the occurrence of semantic concepts is the first step towards modeling the content of a video semantically. An important challenge is to automatically select the right concepts for a given information need. For example, systems should be able to decide whether the concept "Outdoor" should be included into a search for "Street Basketball". In this paper we provide an innovative method to automatically select concepts for an information need. To achieve this, we provide an estimation for the occurrence probability of a concept in relevant shots, which helps us to quantify the helpfulness of a concept. Our method reuses existing training data which is annotated with concept occurrences to build a text collection. Searching in this collection with a text retrieval system and knowing about the concept occurrences allows us to come up with a good estimate for this probability. We evaluate our method against a concept selection benchmark and search runs on both the TRECVID 2005 and 2007 collections. These experiments show that the estimation consistently improves retrieval effectiveness.

Djoerd Hiemstra | Arjen P. de Vries | Robin Aly

[1] Stephen E. Robertson,et al. A probabilistic model of information retrieval: development and comparative experiments - Part 2 , 2000, Inf. Process. Manag..

[2] Chong-Wah Ngo,et al. Towards optimal bag-of-features for object categorization and semantic video retrieval , 2007, CIVR '07.

[3] Marcel Worring,et al. The challenge problem for automated detection of 101 semantic concepts in multimedia , 2006, MM '06.

[4] Bo Zhang,et al. Using High-Level Semantic Features in Video Retrieval , 2006, CIVR.

[5] Rong Yan,et al. Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News , 2007, IEEE Transactions on Multimedia.

[6] Djoerd Hiemstra,et al. The Effectiveness of Concept Based Search for Video Retrieval , 2007, LWA.

[7] Norbert Fuhr,et al. Probabilistic Models in Information Retrieval , 1992, Comput. J..

[8] W. Bruce Croft,et al. Using Probabilistic Models of Document Retrieval without Relevance Information , 1979, J. Documentation.

[9] John R. Smith,et al. Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[10] Djoerd Hiemstra,et al. Using Query Profiles for Clarification , 2006, ECIR.

[11] Djoerd Hiemstra,et al. A probabilistic ranking framework using unobservable binary events for video search , 2008, CIVR '08.