Towards Efficient Learning of Optimal Spatial Bag-of-Words Representations
暂无分享,去创建一个
Deyu Meng | Alexander G. Hauptmann | Wei Tong | Lu Jiang | Alexander Hauptmann | Lu Jiang | Deyu Meng | Wei Tong
[1] Georges Quénot,et al. TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.
[2] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .
[3] Yihong Gong,et al. Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.
[4] Yihong Gong,et al. Linear spatial pyramid matching using sparse coding for image classification , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[5] Lei Bao,et al. Informedia@TRECVID 2011: Surveillance Event Detection , 2011, TRECVID.
[6] Trevor Darrell,et al. Beyond spatial pyramids: Receptive field learning for pooled image features , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[7] Ming Yang,et al. Surveillance Event Detection , 2008, TRECVID.
[8] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).
[9] Alexei A. Efros,et al. Unbiased look at dataset bias , 2011, CVPR 2011.
[10] Cordelia Schmid,et al. Discriminative spatial saliency for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[11] Thomas S. Huang,et al. Efficient Highly Over-Complete Sparse Coding Using a Mixture Model , 2010, ECCV.
[12] Jorma Laaksonen,et al. Spatial extensions to bag of visual words , 2009, CIVR '09.
[13] Yi Yang,et al. E-LAMP: integration of innovative ideas for multimedia event detection , 2013, Machine Vision and Applications.
[14] Cor J. Veenman,et al. Visual Word Ambiguity , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[15] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[16] Yihong Gong,et al. Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[17] Gaurav Sharma,et al. Learning discriminative spatial representation for image classification , 2011, BMVC.
[18] A. Smeaton,et al. TRECVID 2013 -- An Overview of the Goals, Tasks, Data, Evaluation Mechanisms, and Metrics | NIST , 2011 .
[19] Alexander G. Hauptmann,et al. MoSIFT: Recognizing Human Actions in Surveillance Videos , 2009 .
[20] Jean Ponce,et al. A Theoretical Analysis of Feature Pooling in Visual Recognition , 2010, ICML.
[21] Marcel Worring,et al. Fusing concept detection and geo context for visual search , 2012, ICMR.
[22] Koen E. A. van de Sande,et al. Recommendations for video event recognition using concept vocabularies , 2013, ICMR.
[23] Alexander G. Hauptmann,et al. Leveraging high-level and low-level features for multimedia event detection , 2012, ACM Multimedia.
[24] Cordelia Schmid,et al. Learning Object Representations for Visual Object Class Recognition , 2007, ICCV 2007.
[25] Bingbing Ni,et al. Geometric ℓp-norm feature pooling for image classification , 2011, CVPR 2011.
[26] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.
[27] Cordelia Schmid,et al. Evaluation of Local Spatio-temporal Features for Action Recognition , 2009, BMVC.
[28] H. Sharp. Cardinality of finite topologies , 1968 .
[29] M. C. Er,et al. A Fast Algorithm for Generating Set Partitions , 1988, Comput. J..
[30] Chong-Wah Ngo,et al. VIREO-TNO @ TRECVID 2014: Multimedia Event Detection and Recounting (MED and MER) , 2014, TRECVID.
[31] Jean Ponce,et al. Learning mid-level features for recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[32] Richard M. Stern,et al. Informedia e-lamp @ TRECVID 2012 multimedia event detection and recounting MED and MER , 2012 .