论文信息 - Quaero at TRECVID 2011: Semantic Indexing and Multimedia Event Detection

Quaero at TRECVID 2011: Semantic Indexing and Multimedia Event Detection

The Quaero group is a consortium of French and German organizations working on Multimedia Indexing and Retrieval. LIG and KIT participated to the semantic indexing task and LIG participated to the organization of this task. LIG also participated to the multimedia event detection task. This paper describes these participations. For the semantic indexing task, our approach uses a six-stages processing pipelines for computing scores for the likelihood of a video shot to contain a target concept. These scores are then used for producing a ranked list of images or shots that are the most likely to contain the target concept. The pipeline is composed of the following steps: descriptor extraction, descriptor optimization, classifi cation, fusion of descriptor variants, higher-level fusion, and re-ranking. We used a number of diff erent descriptors and a hierarchical fusion strategy. We also used conceptual feedback by adding a vector of classi fication score to the pool of descriptors. The best Quaero run has a Mean Inferred Average Precision of 0.1529, which ranked us 3rd out of 19 participants. We participated to the multimedia event detection task with a system derived from the generic one we have for general purpose concept indexing in videos considering the target events as concepts. Detection scores on videos are produced from the scores on shots.

[1] Nicolas Ballas,et al. IRIM at TRECVID 2013: Semantic indexing and multimedia instance search , 2013 .

[2] Georges Quénot,et al. Re-ranking by local re-scoring for video indexing and retrieval , 2011, CIKM '11.

[3] Stéphane Ayache,et al. Video Corpus Annotation Using Active Learning , 2008, ECIR.

[4] Stéphane Ayache,et al. Using Topic Concepts for Semantic Video Shots Classification , 2006, CIVR.

[5] Georges Quénot,et al. CLIPS at TRECVID : Shot Boundary Detection and Feature Detection , 2003, TRECVID.

[6] Koen E. A. van de Sande,et al. A comparison of color features for visual concept classification , 2008, CIVR '08.

[7] Paul Over,et al. Evaluation campaigns and TRECVid , 2006, MIR '06.

[8] Stéphane Ayache,et al. IRIM at TRECVID 2010: High Level Feature Extraction and Instance Search , 2010 .

[9] Emine Yilmaz,et al. A simple and efficient sampling method for estimating AP and NDCG , 2008, SIGIR '08.

[10] John R. Smith,et al. Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[11] Georges Quénot,et al. Evaluations of multi-learner approaches for concept indexing in video documents , 2010, RIAO.