Multimodal feature fusion for robust event detection in web videos
暂无分享,去创建一个
Shuang Wu | Unsang Park | Stavros Tsakalidis | Xiaodan Zhuang | Pradeep Natarajan | Rohit Prasad | Premkumar Natarajan | Shiv Naga Prasad Vitaladevuni | S. Vitaladevuni | P. Natarajan | P. Natarajan | R. Prasad | Xiaodan Zhuang | S. Tsakalidis | Shuang Wu | U. Park | Stavros Tsakalidis
[1] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).
[2] Jean Ponce,et al. Learning mid-level features for recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[3] David J. Field,et al. Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.
[4] Hao Su,et al. Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification , 2010, NIPS.
[5] Pradeep Natarajan,et al. Efficient Orthogonal Matching Pursuit using sparse random projections for scene and video classification , 2011, 2011 International Conference on Computer Vision.
[6] Ivan Laptev,et al. Improving bag-of-features action recognition with non-local cues , 2010, BMVC.
[7] Arun Ross,et al. Score normalization in multimodal biometric systems , 2005, Pattern Recognit..
[8] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .
[9] Ian H. Witten,et al. The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.
[10] Koen E. A. van de Sande,et al. Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[11] Yang Song,et al. Taxonomic classification for web-based videos , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[12] Ananth Sankar. Bayesian model combination (BAYCOM) for improved recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[13] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[14] John C. Platt,et al. Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .
[15] Ivan Laptev,et al. On Space-Time Interest Points , 2005, International Journal of Computer Vision.
[16] Cordelia Schmid,et al. Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[17] Petr Motlícek,et al. Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction , 2010, EURASIP J. Audio Speech Music. Process..
[18] Baoxin Li,et al. YouTubeCat: Learning to categorize wild web videos , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[19] Daniel P. W. Ellis,et al. Soundtrack classification by transient events , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[20] Martin F. Porter,et al. An algorithm for suffix stripping , 1997, Program.
[21] Bernd Girod,et al. Compressed Histogram of Gradients: A Low-Bitrate Descriptor , 2011, International Journal of Computer Vision.
[22] Jiebo Luo,et al. Large-scale multimodal semantic concept detection for consumer video , 2007, MIR '07.
[23] Luc Van Gool,et al. Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..
[24] Gabriela Csurka,et al. Visual categorization with bags of keypoints , 2002, eccv 2004.
[25] Pietro Perona,et al. A walk through the web’s video clips , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.
[26] Jiebo Luo,et al. Recognizing realistic actions from videos “in the wild” , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[27] Larry S. Davis,et al. Objects in Action: An Approach for Combining Action Understanding and Object Perception , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.
[28] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .
[29] Cordelia Schmid,et al. Evaluation of Local Spatio-temporal Features for Action Recognition , 2009, BMVC.
[30] Luc Van Gool,et al. SURF: Speeded Up Robust Features , 2006, ECCV.
[31] Anderson Rocha,et al. Robust Fusion: Extreme Value Theory for Recognition Score Normalization , 2010, ECCV.
[32] Bernd Girod,et al. CHoG: Compressed histogram of gradients A low bit-rate feature descriptor , 2009, CVPR.
[33] S. V. N. Vishwanathan,et al. Multiple Kernel Learning and the SMO Algorithm , 2010, NIPS.
[34] Luciano Sbaiz,et al. Finding meaning on YouTube: Tag recommendation and category discovery , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[35] Cor J. Veenman,et al. Visual Word Ambiguity , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.