论文信息 - TRECVID 2016: Evaluating Video Search, Video Event Detection, Localization, and Hyperlinking

Abstract:HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés. TRECVID 2016: Evaluating Video Search, Video Event Detection, Localization, and Hyperlinking George Awad, Jonathan Fiscus, David Joy, Martial Michel, Alan Smeaton, Wessel Kraaij, Maria Eskevich, Robin Aly, Roeland Ordelman, Marc Ritter, et al.

参考文献

[1] Maria Eskevich,et al. Adapting Binary Information Retrieval Evaluation Metrics for Segment-based Retrieval Tasks , 2013, ArXiv.

[2] Paul Over,et al. The TRECVid 2008 Event Detection evaluation , 2009, 2009 Workshop on Applications of Computer Vision (WACV).

[3] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[4] David A. Shamma,et al. YFCC100M , 2015, Commun. ACM.

[5] C. Lawrence Zitnick,et al. CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7] Georges Quénot,et al. TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.

[8] Jonathan Weese,et al. UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems , 2013, *SEMEVAL.

[9] i-LIDS Team,et al. Imagery Library for Intelligent Detection Systems (i-LIDS); A Standard for Testing Video Based Detection Systems , 2006, Proceedings 40th Annual 2006 International Carnahan Conference on Security Technology.

[10] M. Koit,et al. Human Language Technologies - The Baltic Perspective: Proceedings of the Fifth International Conference Baltic HLT 2012 - Volume 247 Frontiers in Artificial Intelligence and Applications , 2012 .

[11] Emine Yilmaz,et al. Estimating average precision with incomplete and imperfect judgments , 2006, CIKM '06.

[12] Martha Larson,et al. Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation , 2017, MMM.

[13] Lori Lamel. Multilingual Speech Processing Activities in Quaero: Application to Multimedia Search in Unstructured Data , 2012, Baltic HLT.

[14] Tao Mei,et al. MSR-VTT: A Large Video Description Dataset for Bridging Video and Language , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Emine Yilmaz,et al. A simple and efficient sampling method for estimating AP and NDCG , 2008, SIGIR '08.

[16] Georges Quénot,et al. TRECVid Semantic Indexing of Video: A 6-year Retrospective , 2016 .

[17] Paul Over,et al. Creating HAVIC: Heterogeneous Audio Visual Internet Collection , 2012, LREC.

[18] Martha Larson,et al. Blip10000: a social video dataset containing SPUG content for tagging and retrieval , 2013, MMSys.

[19] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[20] Thomas Sikora,et al. Feature-based video key frame extraction for low quality video sequences , 2009, 2009 10th Workshop on Image Analysis for Multimedia Interactive Services.

[21] Gareth J. F. Jones,et al. Evaluating Search and Hyperlinking: An Example of the Design, Test, Refine Cycle for Metric Development , 2015, MediaEval.

引用

Deep Learning Based Imbalanced Data Classification and Information Retrieval for Multimedia Big Data

2018

Concept Language Models and Event-based Concept Number Selection for Zero-example Event Detection

ICMR

2017

A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking

IEEE MultiMedia

2018

Temporal localization of audio events for conflict monitoring in social media

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

2017

Binary convolutional neural network features off-the-shelf for image to video linking in endoscopic multimedia databases

Multimedia Tools and Applications

2018

YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

2017

Shenzhen Institutes of Advanced Technology, CAS, China at TRECVID INS 2016

TRECVID

2016

Waseda at TRECVID 2016: Ad-hoc Video Search

TRECVID

2016

Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation

MMM

2017

A semantic-based video scene segmentation using a deep neural network

J. Inf. Sci.

2019

A Study on Multimodal Video Hyperlinking with Visual Aggregation

2018 IEEE International Conference on Multimedia and Expo (ICME)

2018

Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking

IEEE Transactions on Multimedia

2020

An improved hybridized deep structured model for accurate video event recognition

2020

Video Description

ACM Comput. Surv.

2018

VidCEP: Complex Event Processing Framework to Detect Spatiotemporal Patterns in Video Streams

2019 IEEE International Conference on Big Data (Big Data)

2019

Enabling GPU-Enhanced Computer Vision and Machine Learning Research Using Containers

ISC Workshops

2019

TRECVID 2016: Evaluating Video Search, Video Event Detection, Localization, and Hyperlinking

Deep Learning Based Imbalanced Data Classification and Information Retrieval for Multimedia Big Data

Concept Language Models and Event-based Concept Number Selection for Zero-example Event Detection

A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking

Temporal localization of audio events for conflict monitoring in social media

Binary convolutional neural network features off-the-shelf for image to video linking in endoscopic multimedia databases

UEC at TRECVID 2016 AVS task

Informedia @ TRECVID 2017

Informedia @ TRECVID 2016

Evaluation of automatic video captioning using direct assessment

YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video

Shenzhen Institutes of Advanced Technology, CAS, China at TRECVID INS 2016

Waseda at TRECVID 2016: Ad-hoc Video Search

Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation

A semantic-based video scene segmentation using a deep neural network

A Study on Multimodal Video Hyperlinking with Visual Aggregation

Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking

An improved hybridized deep structured model for accurate video event recognition

Video Description

VidCEP: Complex Event Processing Framework to Detect Spatiotemporal Patterns in Video Streams

Enabling GPU-Enhanced Computer Vision and Machine Learning Research Using Containers