相关论文

TRECVID: evaluating the effectiveness of information retrieval tasks on digital video

Abstract:TRECVID is an annual exercise which encourages research in information retrieval from digital video by providing a large video test collection, uniform scoring procedures, and a forum for organizations interested in comparing their results. TRECVID benchmarking covers both interactive and manual searching by end users, as well as the benchmarking of some supporting technologies including shot boundary detection, extraction of some semantic features, and the automatic segmentation of TV news broadcasts into non-overlapping news stories. TRECVID has a broad range of over 40 participating groups from across the world and as it is now (2004) in its 4th annual cycle it is opportune to stand back and look at the lessons we have learned from the cumulative activity. In this paper we shall present a brief and high-level overview of the TRECVID activity covering the data, the benchmarked tasks, the overall results obtained by groups to date and an overview of the approaches taken by selective groups in some tasks. While progress from one year to the next cannot be measured directly because of the changing nature of the video data we have been using, we shall present a summary of the lessons we have learned from TRECVID and include some pointers on what we feel are the most important of these lessons.

引用
The Open Video Digital Library: A Möbius strip of research and practice
J. Assoc. Inf. Sci. Technol.
2006
Multi-modal surrogates for retrieving and making sense of videos: is synchronization between the multiple modalities optimal?
2010
Video tapestries with continuous temporal zoom
SIGGRAPH 2010
2010
Term Selection and Query Operations for Video Retrieval
ECIR
2007
The challenge problem for automated detection of 101 semantic concepts in multimedia
MM '06
2006
Image retrieval: Ideas, influences, and trends of the new age
CSUR
2008
Model-shared subspace boosting for multi-label classification
KDD '07
2007
Practical Application of Near Duplicate Detection for Image Database
MCSS
2014
Visual Learning of Socio-Video Semantics
2015
Medical Visual Information Retrieval: State of the Art and Challenges Ahead
2007 IEEE International Conference on Multimedia and Expo
2007
Semantic indexing and retrieval of video
MM '06
2006
Efficient and Robust Methods for Audio and Video Signal Analysis
2018
The trecvid 2007 BBC rushes summarization evaluation pilot
TVS '07
2007
Graph Partition Model for Robust Temporal Data Segmentation
PAKDD
2005
The Semantic Pathfinder for Generic News Video Indexing
2006 IEEE International Conference on Multimedia and Expo
2006
The Semantic Pathfinder: Using an Authoring Metaphor for Generic Multimedia Indexing
IEEE Transactions on Pattern Analysis and Machine Intelligence
2006
Asymmetric Learning and Dissimilarity Spaces for Content-Based Retrieval
CIVR
2006
A two-level queueing system for interactive browsing and searching of video content
Multimedia Systems
2006
Using Segmented Objects in Ostensive Video Shot Retrieval
Adaptive Multimedia Retrieval
2005
A crowdsourcing framework for the production and use of film and television data
New Rev. Hypermedia Multim.
2011