论文信息 - TRECVID: evaluating the effectiveness of information retrieval tasks on digital video

TRECVID: evaluating the effectiveness of information retrieval tasks on digital video

Paul Over

Alan F. Smeaton

Wessel Kraaij

A. Smeaton

P. Over

Wessel Kraaij

Abstract:TRECVID is an annual exercise which encourages research in information retrieval from digital video by providing a large video test collection, uniform scoring procedures, and a forum for organizations interested in comparing their results. TRECVID benchmarking covers both interactive and manual searching by end users, as well as the benchmarking of some supporting technologies including shot boundary detection, extraction of some semantic features, and the automatic segmentation of TV news broadcasts into non-overlapping news stories. TRECVID has a broad range of over 40 participating groups from across the world and as it is now (2004) in its 4th annual cycle it is opportune to stand back and look at the lessons we have learned from the cumulative activity. In this paper we shall present a brief and high-level overview of the TRECVID activity covering the data, the benchmarked tasks, the overall results obtained by groups to date and an overview of the approaches taken by selective groups in some tasks. While progress from one year to the next cannot be measured directly because of the changing nature of the video data we have been using, we shall present a summary of the lessons we have learned from TRECVID and include some pointers on what we feel are the most important of these lessons.

参考文献

[1] John S. Boreczky,et al. Comparison of video shot boundary detection techniques , 1996, J. Electronic Imaging.

[2] Ralph M. Ford. Quantitative comparison of shot boundary detection metrics , 1998, Electronic Imaging.

[3] Thierry Pun,et al. Toward a fair benchmark for image browsers , 2000, SPIE Optics East.

[4] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..

[5] Paul Over,et al. TREC video retrieval evaluation: a case study and status report , 2004 .

[6] Chantal Soulé-Dupuis,et al. Coupling approaches, coupling media and coupling languages for information retrieval , 2004 .

引用

The Open Video Digital Library: A Möbius strip of research and practice

J. Assoc. Inf. Sci. Technol.

2006

Multi-modal surrogates for retrieving and making sense of videos: is synchronization between the multiple modalities optimal?

2010

Video tapestries with continuous temporal zoom

SIGGRAPH 2010

2010

Term Selection and Query Operations for Video Retrieval

ECIR

2007

The challenge problem for automated detection of 101 semantic concepts in multimedia

MM '06

2006

The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing

IEEE Transactions on Pattern Analysis and Machine Intelligence

2006

Asymmetric Learning and Dissimilarity Spaces for Content-Based Retrieval

CIVR

2006

A two-level queueing system for interactive browsing and searching of video content

Multimedia Systems

2006

Using Segmented Objects in Ostensive Video Shot Retrieval

Adaptive Multimedia Retrieval

2005

A crowdsourcing framework for the production and use of film and television data

New Rev. Hypermedia Multim.

2011

TRECVID: evaluating the effectiveness of information retrieval tasks on digital video

The Open Video Digital Library: A Möbius strip of research and practice

Multi-modal surrogates for retrieving and making sense of videos: is synchronization between the multiple modalities optimal?

Video tapestries with continuous temporal zoom

Term Selection and Query Operations for Video Retrieval

The challenge problem for automated detection of 101 semantic concepts in multimedia

Image retrieval: Ideas, influences, and trends of the new age

Model-shared subspace boosting for multi-label classification

Practical Application of Near Duplicate Detection for Image Database

Visual Learning of Socio-Video Semantics

Medical Visual Information Retrieval: State of the Art and Challenges Ahead

Semantic indexing and retrieval of video

Efficient and Robust Methods for Audio and Video Signal Analysis

The trecvid 2007 BBC rushes summarization evaluation pilot

Graph Partition Model for Robust Temporal Data Segmentation

The Semantic Pathfinder for Generic News Video Indexing

The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing

Asymmetric Learning and Dissimilarity Spaces for Content-Based Retrieval

A two-level queueing system for interactive browsing and searching of video content

Using Segmented Objects in Ostensive Video Shot Retrieval

A crowdsourcing framework for the production and use of film and television data