Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification
暂无分享,去创建一个
Jun Wang | Xiangyang Xue | Yu-Gang Jiang | Zuxuan Wu | Jian Pu | Jun Wang | Yu-Gang Jiang | X. Xue | Zuxuan Wu | Jian Pu
[1] Yoshua Bengio,et al. Multi-Task Learning for Stock Selection , 1996, NIPS.
[2] Rich Caruana,et al. Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.
[3] John R. Smith,et al. Multimedia semantic indexing using model vectors , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).
[4] Michael I. Jordan,et al. Multiple kernel learning, conic duality, and the SMO algorithm , 2004, ICML.
[5] Antonio Torralba,et al. Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.
[6] Cees G. M. Snoek,et al. Early versus late fusion in semantic video analysis , 2005, MULTIMEDIA '05.
[7] Massimiliano Pontil,et al. Convex multi-task feature learning , 2008, Machine Learning.
[8] Tao Mei,et al. Correlative multi-label video annotation , 2007, ACM Multimedia.
[9] Andrea Vedaldi,et al. Objects in Context , 2007, 2007 IEEE 11th International Conference on Computer Vision.
[10] Cordelia Schmid,et al. Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[11] Jean-Philippe Vert,et al. Clustered Multi-Task Learning: A Convex Formulation , 2008, NIPS.
[12] Subhransu Maji,et al. Classification using intersection kernel support vector machines is efficient , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[13] T. Stanford,et al. Multisensory integration: current issues from the perspective of the single neuron , 2008, Nature Reviews Neuroscience.
[14] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[15] Jiebo Luo,et al. Heterogeneous feature machines for visual recognition , 2009, 2009 IEEE 12th International Conference on Computer Vision.
[16] Rong Yan,et al. Large-scale multimedia semantic concept modeling using robust subspace bagging and MapReduce , 2009, LS-MMRM '09.
[17] Andrew Zisserman,et al. Multiple kernels for object detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.
[18] Shih-Fu Chang,et al. Short-term audio-visual atoms for generic video concept classification , 2009, ACM Multimedia.
[19] Jieping Ye,et al. Multi-Task Feature Learning Via Efficient l2, 1-Norm Minimization , 2009, UAI.
[20] Xiangyang Xue,et al. A novel audio fingerprinting method robust to time scale modification and pitch shifting , 2010, ACM Multimedia.
[21] Dit-Yan Yeung,et al. A Convex Formulation for Learning Task Relationships in Multi-Task Learning , 2010, UAI.
[22] Mohan S. Kankanhalli,et al. Multimodal fusion for multimedia analysis: a survey , 2010, Multimedia Systems.
[23] Alexander C. Loui,et al. Audio-visual grouplet: temporal audio-visual interactions for general video concept classification , 2011, ACM Multimedia.
[24] Jiayu Zhou,et al. Integrating low-rank and group-sparse structures for robust multi-task learning , 2011, KDD.
[25] Juhan Nam,et al. Multimodal Deep Learning , 2011, ICML.
[26] Shih-Fu Chang,et al. Consumer video understanding: a benchmark database and an evaluation of human and machine performance , 2011, ICMR.
[27] Jiayu Zhou,et al. A multi-task learning formulation for predicting disease progression , 2011, KDD.
[28] Kristen Grauman,et al. Learning with Whom to Share in Multi-task Feature Learning , 2011, ICML.
[29] G. DeAngelis,et al. A Normalization Model of Multisensory Integration , 2011, Nature Neuroscience.
[30] Hongliang Fei,et al. Structured Feature Selection and Task Relationship Inference for Multi-task Learning , 2011, ICDM.
[31] Chong-Wah Ngo,et al. Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation , 2012, IEEE Transactions on Image Processing.
[32] Nitish Srivastava,et al. Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..
[33] Daoqiang Zhang,et al. Multi-modal multi-task learning for joint prediction of multiple regression and classification variables in Alzheimer's disease , 2012, NeuroImage.
[34] Dong Liu,et al. BBNVISER : BBN VISER TRECVID 2012 Multimedia Event Detection and Multimedia Event Recounting Systems , 2012, TRECVID.
[35] Dong Liu,et al. Robust late fusion with rank minimization , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[36] Shuang Wu,et al. Multimodal feature fusion for robust event detection in web videos , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[37] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[38] Cristian Sminchisescu,et al. Dynamic Eye Movement Datasets and Learnt Saliency Models for Visual Action Recognition , 2012, ECCV.
[39] Yung-Yu Chuang,et al. Cross-Domain Multicue Fusion for Concept-Based Video Indexing , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[40] Andrew Zisserman,et al. Three things everyone should know to improve object retrieval , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[41] Cordelia Schmid,et al. Action and Event Recognition with Fisher Vectors on a Compact Feature Set , 2013, 2013 IEEE International Conference on Computer Vision.
[42] Xiangyang Xue,et al. Multiple Task Learning Using Iteratively Reweighted Least Square , 2013, IJCAI.
[43] Samy Bengio,et al. Using Web Co-occurrence Statistics for Improving Image Categorization , 2013, ArXiv.
[44] Dong Liu,et al. Discovering joint audio–visual codewords for video event detection , 2013, Machine Vision and Applications.
[45] Florian Metze,et al. CMU-Informedia @ TRECVID 2013 Multimedia Event Detection , 2013 .
[46] Andrew Zisserman,et al. Deep Fisher Networks for Large-Scale Image Classification , 2013, NIPS.
[47] Daniel P. W. Ellis,et al. Subband autocorrelation features for video soundtrack classification , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[48] Dong Liu,et al. Sample-Specific Late Fusion for Visual Category Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[49] Patrick Bouthemy,et al. Better Exploiting Motion for Better Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[50] Cordelia Schmid,et al. The AXES submissions at TRECVID 2013 , 2013, TRECVID.
[51] Thomas Mensink,et al. Image Classification with the Fisher Vector: Theory and Practice , 2013, International Journal of Computer Vision.
[52] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[53] Nicu Sebe,et al. Feature Weighting via Optimal Thresholding for Video Analysis , 2013, 2013 IEEE International Conference on Computer Vision.
[54] Xirong Li,et al. Few-Example Video Event Retrieval using Tag Propagation , 2014, ICMR.