相关论文

Abstract:The TREC Video Retrieval Evaluation (TRECVID) 2019 was a TREC-style video analysis and retrieval evaluation, the goal of which remains to promote progress in research and development of content-based exploitation and retrieval of information from digital video via open, metrics-based evaluation. Over the last nineteen years this effort has yielded a better understanding of how systems can effectively accomplish such processing and how one can reliably benchmark their performance. TRECVID has been funded by NIST (National Institute of Standards and Technology) and other US government agencies. In addition, many organizations and individuals worldwide contribute significant time and effort. TRECVID 2019 represented a continuation of four tasks from TRECVID 2018. In total, 27 teams from various research organizations worldwide completed one or more of the following four tasks: 1. Ad-hoc Video Search (AVS) 2. Instance Search (INS) 3. Activities in Extended Video (ActEV) 4. Video to Text Description (VTT) This paper is an introduction to the evaluation framework, tasks, data, and measures used in the workshop.

参考文献

[1]  Alvin F. Martin,et al.  The DET curve in assessment of detection task performance , 1997, EUROSPEECH.

[2]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[3]  Alon Lavie,et al.  METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[4]  Emine Yilmaz,et al.  Estimating average precision with incomplete and imperfect judgments , 2006, CIKM '06.

[5]  Emine Yilmaz,et al.  A simple and efficient sampling method for estimating AP and NDCG , 2008, SIGIR '08.

[6]  Larry S. Davis,et al.  AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video , 2011, AVSS.

[7]  Jonathan Weese,et al.  UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems , 2013, *SEMEVAL.

[8]  Timothy Baldwin,et al.  Can machine translation systems be evaluated by the crowd alone , 2015, Natural Language Engineering.

[9]  C. Lawrence Zitnick,et al.  CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[11]  Georges Quénot,et al.  TRECVid Semantic Indexing of Video: A 6-year Retrospective , 2016 .

[12]  Jonathan G. Fiscus,et al.  TRECVID 2016: Evaluating Video Search, Video Event Detection, Localization, and Hyperlinking , 2016, TRECVID.

[13]  Philipp Koehn,et al.  Findings of the 2017 Conference on Machine Translation (WMT17) , 2017, WMT.

[14]  B. Manly Randomization, Bootstrap and Monte Carlo Methods in Biology , 2018 .

[15]  George Awad,et al.  Evaluation of automatic video captioning using direct assessment , 2017, PloS one.

[16]  Xirong Li,et al.  Dual Encoding for Zero-Example Video Retrieval , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  George Awad,et al.  V3C - a Research Video Collection , 2018, MMM.

引用
Joint Analysis and Prediction of Human Actions and Paths in Video
ArXiv
2020
MMVG-INF-Etrol@TRECVID 2019: Activities in Extended Video
TRECVID
2019
Inf@TRECVID 2019: Instance Search Task
TRECVID
2019
An Annotated Video Dataset for Computing Video Memorability
Data in Brief
2021
PicSOM and EURECOM Experiments in TRECVID 2019
TRECVID
2019
Hybrid Sequence Encoder for Text Based Video Retrieval
TRECVID
2019
ITI-CERTH participation in TRECVID 2018
TRECVID
2017
Kindai University and Kobe University at TRECVID 2019 AVS Task
TRECVID
2019
VIRET tool keyword search at TRECVID 2019 AVS task
TRECVID
2019
2020 Sequestered Data Evaluation for Known Activities in Extended Video: Summary and Results
2021 IEEE Winter Conference on Applications of Computer Vision Workshops (WACVW)
2021
Overview of The MediaEval 2021 Predicting Media Memorability Task
ArXiv
2021
Overview of MediaEval 2020 Predicting Media Memorability Task: What Makes a Video Memorable?
ArXiv
2020
EURECOM at TRECVid AVS 2019
TRECVID
2019
FDU Participation in TRECVID 2019 VTT Task
TRECVID
2019
Interactive Search vs. Automatic Search
ACM Trans. Multim. Comput. Commun. Appl.
2021
Face, Body, Voice: Video Person-Clustering with Multiple Modalities
2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)
2021
An Automatic Caption Generation for video clip with reducing frames in order to shorten processing time
TRECVID
2019
University of Applied Sciences Mittweida and Chemnitz University of Technology at TRECVID ActEv 2019
TRECVID
2018
Waseda_Meisei_SoftBank at TRECVID 2019: Ad-hoc Video Search
TRECVID
2019
Investigating Memorability of Dynamic Media
ArXiv
2020