A Fine Granularity Object-Level Representation for Event Detection and Recounting
暂无分享,去创建一个
[1] Florian Metze,et al. Beyond audio and video retrieval: towards multimedia summarization , 2012, ICMR.
[2] Yi Yang,et al. Complex Event Detection using Semantic Saliency and Nearly-Isotonic SVM , 2015, ICML.
[3] Chong-Wah Ngo,et al. Event Detection with Zero Example: Select the Right and Suppress the Wrong Concepts , 2016, ICMR.
[4] Masoud Mazloom,et al. Conceptlets: Selective Semantics for Classifying Video Events , 2014, IEEE Transactions on Multimedia.
[5] Georges Quénot,et al. TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.
[6] Ramakant Nevatia,et al. DECK: Discovering Event Composition Knowledge from Web Images for Zero-Shot Event Detection and Recounting in Videos , 2017, AAAI.
[7] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).
[8] Olivier Buisson,et al. Content-Based Copy Retrieval Using Distortion-Based Probabilistic Similarity Search , 2007, IEEE Transactions on Multimedia.
[9] Wei Liu,et al. BUPT-MCPRL at TRECVID 2012 , 2010, TRECVID.
[10] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.
[11] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.
[12] Dennis Koelma,et al. The ImageNet Shuffle: Reorganized Pre-training for Video Event Detection , 2016, ICMR.
[13] Yi Yang,et al. A discriminative CNN video representation for event detection , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Mubarak Shah,et al. Recognizing Complex Events Using Large Margin Joint Low-Level Event Model , 2012, ECCV.
[15] Ming-Syan Chen,et al. Video Event Detection by Inferring Temporal Instance Labels , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[16] Alexander J. Smola,et al. Stacked Attention Networks for Image Question Answering , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Ramakant Nevatia,et al. DISCOVER: Discovering Important Segments for Classification of Video Events and Recounting , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[18] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[19] Ramakant Nevatia,et al. ISOMER: Informative Segment Observations for Multimedia Event Recounting , 2014, ICMR.
[20] Yi Li,et al. R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.
[21] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[22] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[23] Cordelia Schmid,et al. Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[24] Dong Liu,et al. EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video , 2015, ACM Multimedia.
[25] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[26] Chong-Wah Ngo,et al. [Invited Paper] Object Pooling for Multimedia Event Detection and Evidence Localization , 2016 .
[27] Yunde Jia,et al. Multimedia event detection via deep spatial-temporal neural networks , 2016, 2016 IEEE International Conference on Multimedia and Expo (ICME).
[28] Chong-Wah Ngo,et al. On the use of commonsense ontology for multimedia event recounting , 2015, International Journal of Multimedia Information Retrieval.
[29] Yi Yang,et al. DevNet: A Deep Event Network for multimedia event detection and evidence recounting , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[30] Andrea Vedaldi,et al. Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.
[31] Michael R. Lyu,et al. Bridging the Semantic Gap Between Image Contents and Tags , 2010, IEEE Transactions on Multimedia.
[32] Chong-Wah Ngo,et al. VIREO-TNO @ TRECVID 2014: Multimedia Event Detection and Recounting (MED and MER) , 2014, TRECVID.
[33] Dong Liu,et al. Encoding Concept Prototypes for Video Event Detection and Summarization , 2015, ICMR.
[34] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[35] A. Smeaton,et al. TRECVID 2013 -- An Overview of the Goals, Tasks, Data, Evaluation Mechanisms, and Metrics | NIST , 2011 .
[36] Koen E. A. van de Sande,et al. Selective Search for Object Recognition , 2013, International Journal of Computer Vision.
[37] Philip H. S. Torr,et al. BING: Binarized normed gradients for objectness estimation at 300fps , 2019, Computational Visual Media.
[38] Cees Snoek,et al. What do 15,000 object categories tell us about classifying and localizing actions? , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[39] Cees Snoek,et al. Bag-of-Fragments: Selecting and Encoding Video Fragments for Event Detection and Recounting , 2015, ICMR.
[40] Rong Yan,et al. Can High-Level Concepts Fill the Semantic Gap in Video Retrieval? A Case Study With Broadcast News , 2007, IEEE Transactions on Multimedia.
[41] Gang Hua,et al. Semantic Model Vectors for Complex Video Event Recognition , 2012, IEEE Transactions on Multimedia.
[42] William I. Grosky,et al. Narrowing the semantic gap - improved text-based web document retrieval using visual features , 2002, IEEE Trans. Multim..
[43] Teruko Mitamura,et al. Zero-Example Event Search using MultiModal Pseudo Relevance Feedback , 2014, ICMR.
[44] Nitish Srivastava,et al. Exploiting Image-trained CNN Architectures for Unconstrained Video Classification , 2015, BMVC.
[45] Yi Yang,et al. You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[46] Yongdong Zhang,et al. Enhancing Video Event Recognition Using Automatically Constructed Semantic-Visual Knowledge Base , 2015, IEEE Transactions on Multimedia.
[47] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[48] Josef Sivic,et al. NetVLAD: CNN Architecture for Weakly Supervised Place Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[49] Cordelia Schmid,et al. Deep Convolutional Matching , 2015, ArXiv.
[50] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[51] Tao Mei,et al. Super Fast Event Recognition in Internet Videos , 2015, IEEE Transactions on Multimedia.
[52] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[53] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.
[54] Hui Cheng,et al. Video event recognition using concept attributes , 2013, 2013 IEEE Workshop on Applications of Computer Vision (WACV).
[55] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[56] C. Lawrence Zitnick,et al. Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.
[57] Larry S. Davis,et al. VRFP: On-the-Fly Video Retrieval Using Web Images and Fast Fisher Vector Products , 2015, IEEE Transactions on Multimedia.
[58] Chong-Wah Ngo,et al. Video Event Detection Using Motion Relativity and Feature Selection , 2014, IEEE Transactions on Multimedia.
[59] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.
[60] Cees Snoek,et al. VideoStory: A New Multimedia Embedding for Few-Example Recognition and Translation of Events , 2014, ACM Multimedia.
[61] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[62] Nicu Sebe,et al. The Many Shades of Negativity , 2017, IEEE Transactions on Multimedia.
[63] Nanning Zheng,et al. ER3: A Unified Framework for Event Retrieval, Recognition and Recounting , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[64] Mubarak Shah,et al. High-level event recognition in unconstrained videos , 2013, International Journal of Multimedia Information Retrieval.
[65] Svetlana Lazebnik,et al. Multi-scale Orderless Pooling of Deep Convolutional Activation Features , 2014, ECCV.
[66] D. T. Lee,et al. Video Event Detection via Multi-modality Deep Learning , 2014, 2014 22nd International Conference on Pattern Recognition.
[67] Yi Yang,et al. Multi-Class Active Learning by Uncertainty Sampling with Diversity Maximization , 2015, International Journal of Computer Vision.
[68] Kavita Bala,et al. Learning visual similarity for product design with convolutional neural networks , 2015, ACM Trans. Graph..
[69] Thomas Deselaers,et al. Measuring the Objectness of Image Windows , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[70] Yi Yang,et al. Image Classification by Cross-Media Active Learning With Privileged Information , 2016, IEEE Transactions on Multimedia.