相关论文

Abstract:HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés. TRECVID 2016: Evaluating Video Search, Video Event Detection, Localization, and Hyperlinking George Awad, Jonathan Fiscus, David Joy, Martial Michel, Alan Smeaton, Wessel Kraaij, Maria Eskevich, Robin Aly, Roeland Ordelman, Marc Ritter, et al.

参考文献

[1]  Maria Eskevich,et al.  Adapting Binary Information Retrieval Evaluation Metrics for Segment-based Retrieval Tasks , 2013, ArXiv.

[2]  Paul Over,et al.  The TRECVid 2008 Event Detection evaluation , 2009, 2009 Workshop on Applications of Computer Vision (WACV).

[3]  Alon Lavie,et al.  METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[4]  David A. Shamma,et al.  YFCC100M , 2015, Commun. ACM.

[5]  C. Lawrence Zitnick,et al.  CIDEr: Consensus-based image description evaluation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7]  Georges Quénot,et al.  TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.

[8]  Jonathan Weese,et al.  UMBC_EBIQUITY-CORE: Semantic Textual Similarity Systems , 2013, *SEMEVAL.

[9]  i-LIDS Team,et al.  Imagery Library for Intelligent Detection Systems (i-LIDS); A Standard for Testing Video Based Detection Systems , 2006, Proceedings 40th Annual 2006 International Carnahan Conference on Security Technology.

[10]  M. Koit,et al.  Human Language Technologies - The Baltic Perspective: Proceedings of the Fifth International Conference Baltic HLT 2012 - Volume 247 Frontiers in Artificial Intelligence and Applications , 2012 .

[11]  Emine Yilmaz,et al.  Estimating average precision with incomplete and imperfect judgments , 2006, CIKM '06.

[12]  Martha Larson,et al.  Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation , 2017, MMM.

[13]  Lori Lamel Multilingual Speech Processing Activities in Quaero: Application to Multimedia Search in Unstructured Data , 2012, Baltic HLT.

[14]  Tao Mei,et al.  MSR-VTT: A Large Video Description Dataset for Bridging Video and Language , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Emine Yilmaz,et al.  A simple and efficient sampling method for estimating AP and NDCG , 2008, SIGIR '08.

[16]  Georges Quénot,et al.  TRECVid Semantic Indexing of Video: A 6-year Retrospective , 2016 .

[17]  Paul Over,et al.  Creating HAVIC: Heterogeneous Audio Visual Internet Collection , 2012, LREC.

[18]  Martha Larson,et al.  Blip10000: a social video dataset containing SPUG content for tagging and retrieval , 2013, MMSys.

[19]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[20]  Thomas Sikora,et al.  Feature-based video key frame extraction for low quality video sequences , 2009, 2009 10th Workshop on Image Analysis for Multimedia Interactive Services.

[21]  Gareth J. F. Jones,et al.  Evaluating Search and Hyperlinking: An Example of the Design, Test, Refine Cycle for Metric Development , 2015, MediaEval.

引用
Deep Learning Based Imbalanced Data Classification and Information Retrieval for Multimedia Big Data
2018
Concept Language Models and Event-based Concept Number Selection for Zero-example Event Detection
ICMR
2017
A Crossmodal Approach to Multimodal Fusion in Video Hyperlinking
IEEE MultiMedia
2018
Temporal localization of audio events for conflict monitoring in social media
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
2017
Binary convolutional neural network features off-the-shelf for image to video linking in endoscopic multimedia databases
Multimedia Tools and Applications
2018
UEC at TRECVID 2016 AVS task
TRECVID
2016
Informedia @ TRECVID 2017
TRECVID
2017
Informedia @ TRECVID 2016
TRECVID
2016
Evaluation of automatic video captioning using direct assessment
PloS one
2017
YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video
2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
2017
Shenzhen Institutes of Advanced Technology, CAS, China at TRECVID INS 2016
TRECVID
2016
Waseda at TRECVID 2016: Ad-hoc Video Search
TRECVID
2016
Multimodal Video-to-Video Linking: Turning to the Crowd for Insight and Evaluation
MMM
2017
A semantic-based video scene segmentation using a deep neural network
J. Inf. Sci.
2019
A Study on Multimodal Video Hyperlinking with Visual Aggregation
2018 IEEE International Conference on Multimedia and Expo (ICME)
2018
Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking
IEEE Transactions on Multimedia
2020
An improved hybridized deep structured model for accurate video event recognition
2020
Video Description
ACM Comput. Surv.
2018
VidCEP: Complex Event Processing Framework to Detect Spatiotemporal Patterns in Video Streams
2019 IEEE International Conference on Big Data (Big Data)
2019
Enabling GPU-Enhanced Computer Vision and Machine Learning Research Using Containers
ISC Workshops
2019