The TREC Video Retrieval Evaluation (TRECVID) 2007 represents the seventh running of a TREC-style (trec.nist.gov) video retrieval evaluation, the goal of which remains to promote progress in content-based retrieval from digital video via open, metrics-based evaluation. Over time this effort should yield a better understanding of how systems can effectively accomplish such retrieval and how one can reliably benchmark their performance. TRECVID 2007 was funded by the US National Institute of Standards and Technology (NIST) and the Intelligence Advanced Research Projects Activity (IARPA). 54 teams (see Table 1 at the end of the paper) from various research organizations — 17 from Asia, 23 from Europe, 12 from the Americas, and 2 from Australia — participated in one or more of four tasks: shot boundary determination, high-level feature extraction, search (fully automatic, manually assisted, or interactive) or pre-production video (rushes) summarization. See Figure 1 for an overview of TRECVID’s evolution. In 2007 TRECVID began what sets out to be a 3-year cycle using new data sources, related to the broadcast news used in 2003-2006 but significantly different. Data for the search and feature tasks was about 100 hours of (MPEG-1) news magazine, science news, news reports, documentaries, educational programming, and archival video almost entirely in Dutch from the Netherlands Institute for Sound and Vision. About 6 additional hours of Sound and Vision data was used for the shot boundary task. The BBC Archive provided about 50 hours of “rushes” pre-production video material with natural sound, errors, etc. from several BBC dramatic series for use in the summarization task.
[1]
B. Manly.
Randomization, Bootstrap and Monte Carlo Methods in Biology
,
2018
.
[2]
Paul Over,et al.
The trecvid 2007 BBC rushes summarization evaluation pilot
,
2007,
TVS '07.
[3]
Andrew P. Robinson,et al.
Randomization, Bootstrap and Monte Carlo Methods in Biology
,
2007
.
[4]
Stéphane Ayache,et al.
TRECVID 2007: Collaborative Annotation using Active Learning
,
2007,
TRECVID.
[5]
Emine Yilmaz,et al.
Estimating average precision with incomplete and imperfect judgments
,
2006,
CIKM '06.
[6]
Peter G. B. Enser,et al.
Retrieval of Archival Moving Imagery - CBIR Outside the Frame?
,
2002,
CIVR.
[7]
Franciska de Jong,et al.
Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition
,
2007,
SAMT.
[8]
A. I.,et al.
Neural Field Continuum Limits and the Structure–Function Partitioning of Cognitive–Emotional Brain Networks
,
2023,
Biology.
[9]
Paul Over,et al.
TRECVID 2006 Overview
,
2006,
TRECVID.
[10]
Sara Shatford,et al.
Analyzing the Subject of a Picture: A Theoretical Approach
,
1986
.
[11]
V. Rich.
Personal communication
,
1989,
Nature.