Informedia@TrecVID 2014: MED and MER
暂无分享,去创建一个
Shiguang Shan | Florian Metze | Alexander Hauptmann | Bhiksha Raj | Rita Singh | Pinar Duygulu Sahin | Chuang Gan | Huan Li | Deyu Meng | Xiaojun Chang | Yi Yang | Teruko Mitamura | Ming Lin | Anurag Kumar | Yajie Miao | Yang Cai | Lu Jiang | Shicheng Xu | Shoou-I Yu | Susanne Burger | Zhigang Ma | Lara J. Martin | Xingzhong Du | Zhongwen Xu | Zexi Mao | Xuanchong Li | Lara Martin | Zhenzhong Lan | Nikolas Wolfe | Richard Stern | S. Shan | P. D. Sahin | Zhongwen Xu | Shoou-I Yu | Florian Metze | Chuang Gan | Alexander Hauptmann | Xiaojun Chang | Yi Yang | Lu Jiang | Deyu Meng | B. Raj | Y. Miao | R. Stern | Rita Singh | Anurag Kumar | T. Mitamura | Huan Li | Zhenzhong Lan | Nikolas Wolfe | Zexi Mao | Xingzhong Du | Yang Cai | Ming Lin | Zhigang Ma | Xuanchong Li | Susanne Burger | Shicheng Xu
[1] Cordelia Schmid,et al. Evaluation of Local Spatio-temporal Features for Action Recognition , 2009, BMVC.
[2] Teruko Mitamura,et al. Zero-Example Event Search using MultiModal Pseudo Relevance Feedback , 2014, ICMR.
[3] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.
[4] A. Waibel,et al. A one-pass decoder based on polymorphic linguistic context assignment , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..
[5] Daphne Koller,et al. Self-Paced Learning for Latent Variable Models , 2010, NIPS.
[6] Carlos Busso,et al. IEMOCAP: interactive emotional dyadic motion capture database , 2008, Lang. Resour. Evaluation.
[7] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[8] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).
[9] Björn W. Schuller,et al. Recent developments in openSMILE, the munich open-source multimedia feature extractor , 2013, ACM Multimedia.
[10] Thomas Mensink,et al. Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.
[11] Deyu Meng,et al. Easy Samples First: Self-paced Reranking for Zero-Example Multimedia Search , 2014, ACM Multimedia.
[12] Douglas E. Sturim,et al. Support vector machines using GMM supervectors for speaker verification , 2006, IEEE Signal Processing Letters.
[13] Alexander G. Hauptmann,et al. Temporal Extension of Scale Pyramid and Spatial Pyramid Matching for Action Recognition , 2014, ArXiv.
[14] Alexander G. Hauptmann,et al. Leveraging high-level and low-level features for multimedia event detection , 2012, ACM Multimedia.
[15] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.
[16] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[17] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..
[18] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[19] Yi Yang,et al. Resource Constrained Multimedia Event Detection , 2014, MMM.
[20] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[21] Bhiksha Raj,et al. Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification , 2011, INTERSPEECH.
[22] John Platt,et al. Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .
[23] Alexander G. Hauptmann,et al. MoSIFT: Recognizing Human Actions in Surveillance Videos , 2009 .
[24] Bhiksha Raj,et al. Beyond Gaussian Pyramid: Multi-skip Feature Stacking for action recognition , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[25] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[26] Ming Yang,et al. Surveillance Event Detection , 2008, TRECVID.
[27] Alexander G. Hauptmann,et al. Instructional Videos for Unsupervised Harvesting and Learning of Action Examples , 2014, ACM Multimedia.
[28] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[29] Andrew Zisserman,et al. The devil is in the details: an evaluation of recent feature encoding methods , 2011, BMVC.
[30] John D. Lafferty,et al. A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.
[31] Koen E. A. van de Sande,et al. Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[32] Frédéric Jurie,et al. Modeling spatial layout with fisher vectors for image categorization , 2011, 2011 International Conference on Computer Vision.
[33] Cordelia Schmid,et al. Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[34] Florian Metze,et al. CMU-Informedia @ TRECVID 2013 Multimedia Event Detection , 2013 .
[35] Andrew Zisserman,et al. Efficient Additive Kernels via Explicit Feature Maps , 2012, IEEE Trans. Pattern Anal. Mach. Intell..
[36] Wei Liu,et al. Multimedia classification and event detection using double fusion , 2013, Multimedia Tools and Applications.
[37] Georges Quénot,et al. TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.
[38] Yi Yang,et al. A discriminative CNN video representation for event detection , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[39] Shiguang Shan,et al. Self-Paced Learning with Diversity , 2014, NIPS.