LAMV: Learning to Align and Match Videos with Kernelized Temporal Layers
暂无分享,去创建一个
Matthijs Douze | Rita Cucchiara | Hervé Jégou | Lorenzo Baraldi | H. Jégou | L. Baraldi | R. Cucchiara | Matthijs Douze
[1] Ivan Laptev,et al. Learnable pooling with Context Gating for video classification , 2017, ArXiv.
[2] Cordelia Schmid,et al. Event Retrieval in Large Video Collections with Circulant Temporal Encoding , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[3] Rui Caseiro,et al. High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[4] David A. Shamma,et al. YFCC100M , 2015, Commun. ACM.
[5] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[6] Jiajun Wang,et al. Partial Copy Detection in Videos: A Benchmark and an Evaluation of Popular Methods , 2016, IEEE Transactions on Big Data.
[7] Basura Fernando,et al. Unsupervised Human Action Detection by Action Matching , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[8] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[9] Luc Van Gool,et al. Deep Temporal Linear Encoding Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Ronan Sicre,et al. Particular object retrieval with integral max-pooling of CNN activations , 2015, ICLR.
[11] Hervé Jégou,et al. Orientation Covariant Aggregation of Local Descriptors with Embeddings , 2014, ECCV.
[12] Tinne Tuytelaars,et al. Modeling video evolution for action recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Shin'ichi Satoh,et al. Temporal Matching Kernel with Explicit Feature Maps , 2015, ACM Multimedia.
[14] Nanning Zheng,et al. ER3: A Unified Framework for Event Retrieval, Recognition and Recounting , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[15] Ondrej Chum. Low Dimensional Explicit Feature Maps , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[16] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[17] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[18] Hervé Jégou,et al. Kernel Local Descriptors with Implicit Rotation Matching , 2015, ICMR.
[19] Rui Caseiro,et al. Exploiting the Circulant Structure of Tracking-by-Detection with Kernels , 2012, ECCV.
[20] Andrew Zisserman,et al. Efficient additive kernels via explicit feature maps , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[21] Albert Gordo,et al. Deep Image Retrieval: Learning Global Representations for Image Search , 2016, ECCV.
[22] Christopher Joseph Pal,et al. Describing Videos by Exploiting Temporal Structure , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[23] Cordelia Schmid,et al. Stable Hyper-pooling and Query Expansion for Event Detection , 2013, 2013 IEEE International Conference on Computer Vision.
[24] Cordelia Schmid,et al. Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.
[25] Benjamin Recht,et al. Random Features for Large-Scale Kernel Machines , 2007, NIPS.
[26] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[27] Tinne Tuytelaars,et al. Rank Pooling for Action Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[28] Xiaogang Wang,et al. Object Detection from Video Tubelets with Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[29] Cordelia Schmid,et al. Circulant Temporal Encoding for Video Retrieval and Temporal Alignment , 2015, International Journal of Computer Vision.
[30] Jiajun Wang,et al. VCDB: A Large-Scale Database for Partial Copy Detection in Videos , 2014, ECCV.
[31] Apostol Natsev,et al. YouTube-8M: A Large-Scale Video Classification Benchmark , 2016, ArXiv.
[32] Antonis A. Argyros,et al. Temporal Action Co-Segmentation in 3D Motion Capture Data and Videos , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[33] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.