TRECVID 2004 was the fourth running of a TRECstyle video retrieval evaluation, the goal of which remains to promote progress in content-based retrieval from digital video via open, metrics-based evaluation. Over time this effort should yield a better understanding of how systems can effectively accomplish such retrieval and how one can reliably benchmark their performance. TRECVID is funded by the Advanced Research and Development Activity (ARDA) and the National Institute of Standards and Technology (NIST). The evaluation used as test data about 70 hours of US broadcast news video in MPEG-1 format that had been collected for TDT-2 by the Linguistic Data Consortium in 1998. 33 teams from various research organizations — 7 from Asia/Australia, 17 from Europe, and 9 from the Americas — participated in one or more of four tasks: shot boundary determination, story segmentation, feature extraction, and search (manual or interactive). Results were scored by NIST using manually created truth data for shot boundary determination and story segmentation. Feature extraction and search submissions were evaluated based on partial manual judgments of the pooled submissions. This paper is an introduction to, and an overview of, the evaluation framework (the tasks, data, and measures), the results, and the approaches taken by the participating groups. For detailed information about the approaches and results, the reader should see the online proceedings on the TRECVID website (www-nlpir.nist.gov/projects/trecvid).
[1]
Jean-Luc Gauvain,et al.
The LIMSI Broadcast News transcription system
,
2002,
Speech Commun..
[2]
Peter G. B. Enser,et al.
Retrieval of Archival Moving Imagery - CBIR Outside the Frame?
,
2002,
CIVR.
[3]
Ching-Yung Lin,et al.
Video Collaborative Annotation Forum: Establishing Ground-Truth Labels on Large Multimedia Datasets
,
2003,
TRECVID.
[4]
John S. Boreczky,et al.
Comparison of video shot boundary detection techniques
,
1996,
J. Electronic Imaging.
[5]
Charles A. Bouman,et al.
Storage and Retrieval for Image and Video Databases VII
,
1998
.
[6]
Ralph M. Ford.
Quantitative comparison of shot boundary detection metrics
,
1998,
Electronic Imaging.
[7]
Sara Shatford,et al.
Analyzing the Subject of a Picture: A Theoretical Approach
,
1986
.
[8]
Philippe Aigrain,et al.
The automatic real-time analysis of film editing and transition effects and its applications
,
1994,
Comput. Graph..
[9]
Noel E. O'Connor,et al.
Evaluating and combining digital video shot boundary detection algorithms
,
2000
.
[10]
Stéphane Marchand-Maillet,et al.
Towards a Standard Protocol for the Evaluation of Video-to-Shots Segmentation Algorithms
,
1999
.