论文信息 - TRECVID 2019: An evaluation campaign to benchmark Video Activity Detection, Video Captioning and Matching, and Video Search & retrieval

Abstract:The TREC Video Retrieval Evaluation (TRECVID) 2019 was a TREC-style video analysis and retrieval evaluation, the goal of which remains to promote progress in research and development of content-based exploitation and retrieval of information from digital video via open, metrics-based evaluation. Over the last nineteen years this effort has yielded a better understanding of how systems can effectively accomplish such processing and how one can reliably benchmark their performance. TRECVID has been funded by NIST (National Institute of Standards and Technology) and other US government agencies. In addition, many organizations and individuals worldwide contribute significant time and effort. TRECVID 2019 represented a continuation of four tasks from TRECVID 2018. In total, 27 teams from various research organizations worldwide completed one or more of the following four tasks: 1. Ad-hoc Video Search (AVS) 2. Instance Search (INS) 3. Activities in Extended Video (ActEV) 4. Video to Text Description (VTT) This paper is an introduction to the evaluation framework, tasks, data, and measures used in the workshop.

参考文献

[1] Alvin F. Martin,et al. The DET curve in assessment of detection task performance , 1997, EUROSPEECH.

[2] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[3] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[4] Emine Yilmaz,et al. Estimating average precision with incomplete and imperfect judgments , 2006, CIKM '06.

[5] Emine Yilmaz,et al. A simple and efficient sampling method for estimating AP and NDCG , 2008, SIGIR '08.

[6] Larry S. Davis,et al. AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video , 2011, AVSS.

[7] Jonathan Weese,et al. UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems , 2013, *SEMEVAL.

[8] Timothy Baldwin,et al. Can machine translation systems be evaluated by the crowd alone , 2015, Natural Language Engineering.

[9] C. Lawrence Zitnick,et al. CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[11] Georges Quénot,et al. TRECVid Semantic Indexing of Video: A 6-year Retrospective , 2016 .

[12] Jonathan G. Fiscus,et al. TRECVID 2016: Evaluating Video Search, Video Event Detection, Localization, and Hyperlinking , 2016, TRECVID.

[13] Philipp Koehn,et al. Findings of the 2017 Conference on Machine Translation (WMT17) , 2017, WMT.

[14] B. Manly. Randomization, Bootstrap and Monte Carlo Methods in Biology , 2018 .

[15] George Awad,et al. Evaluation of automatic video captioning using direct assessment , 2017, PloS one.

[16] Xirong Li,et al. Dual Encoding for Zero-Example Video Retrieval , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17] George Awad,et al. V3C - a Research Video Collection , 2018, MMM.

引用

2020 Sequestered Data Evaluation for Known Activities in Extended Video: Summary and Results

2021 IEEE Winter Conference on Applications of Computer Vision Workshops (WACVW)

2021

Overview of The MediaEval 2021 Predicting Media Memorability Task

ArXiv

2021

Overview of MediaEval 2020 Predicting Media Memorability Task: What Makes a Video Memorable?

ArXiv

2020

EURECOM at TRECVid AVS 2019

TRECVID

2019

FDU Participation in TRECVID 2019 VTT Task

TRECVID

2019

Interactive Search vs. Automatic Search

ACM Trans. Multim. Comput. Commun. Appl.

2021

Face, Body, Voice: Video Person-Clustering with Multiple Modalities

2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)

2021

An Automatic Caption Generation for video clip with reducing frames in order to shorten processing time

TRECVID

2019

University of Applied Sciences Mittweida and Chemnitz University of Technology at TRECVID ActEv 2019

TRECVID

2018

Waseda_Meisei_SoftBank at TRECVID 2019: Ad-hoc Video Search

TRECVID

2019

Investigating Memorability of Dynamic Media

ArXiv

2020

TRECVID 2019: An evaluation campaign to benchmark Video Activity Detection, Video Captioning and Matching, and Video Search & retrieval

Joint Analysis and Prediction of Human Actions and Paths in Video

MMVG-INF-Etrol@TRECVID 2019: Activities in Extended Video

Inf@TRECVID 2019: Instance Search Task

An Annotated Video Dataset for Computing Video Memorability

PicSOM and EURECOM Experiments in TRECVID 2019

Hybrid Sequence Encoder for Text Based Video Retrieval

ITI-CERTH participation in TRECVID 2018

Kindai University and Kobe University at TRECVID 2019 AVS Task

VIRET tool keyword search at TRECVID 2019 AVS task

2020 Sequestered Data Evaluation for Known Activities in Extended Video: Summary and Results

Overview of The MediaEval 2021 Predicting Media Memorability Task

Overview of MediaEval 2020 Predicting Media Memorability Task: What Makes a Video Memorable?

EURECOM at TRECVid AVS 2019

FDU Participation in TRECVID 2019 VTT Task

Interactive Search vs. Automatic Search

Face, Body, Voice: Video Person-Clustering with Multiple Modalities

An Automatic Caption Generation for video clip with reducing frames in order to shorten processing time

University of Applied Sciences Mittweida and Chemnitz University of Technology at TRECVID ActEv 2019

Waseda_Meisei_SoftBank at TRECVID 2019: Ad-hoc Video Search

Investigating Memorability of Dynamic Media