Unsupervised Hierarchical Dynamic Parsing and Encoding for Action Recognition
暂无分享,去创建一个
Jiahuan Zhou | Xiaoqing Ding | Bing Su | Ying Wu | Ying Wu | Jiahuan Zhou | Bing Su | Xiaoqing Ding
[1] Gang Hua,et al. Order-Preserving Wasserstein Distance for Sequence Matching , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[2] Ying Wu,et al. Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[3] Fernando De la Torre,et al. Maximum Margin Temporal Clustering , 2012, AISTATS.
[4] Lei Wang,et al. In defense of soft-assignment coding , 2011, 2011 International Conference on Computer Vision.
[5] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[6] Fernando De la Torre,et al. Unsupervised discovery of facial events , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[7] Philip H. S. Torr,et al. Learning discriminative space-time actions from weakly labelled videos , 2012, BMVC.
[8] Tinne Tuytelaars,et al. Modeling video evolution for action recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Bing Su,et al. Discriminative Transformation for Multi-Dimensional Temporal Sequences. , 2017, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.
[10] Sergio Escalera,et al. ChaLearn multi-modal gesture recognition 2013: grand challenge and workshop summary , 2013, ICMI '13.
[11] Philip H. S. Torr,et al. Learning Discriminative Space–Time Action Parts from Weakly Labelled Videos , 2013, International Journal of Computer Vision.
[12] Hanqing Lu,et al. Fusing multi-modal features for gesture recognition , 2013, ICMI '13.
[13] Benjamin Z. Yao,et al. Learning deformable action templates from cluttered videos , 2009, 2009 IEEE 12th International Conference on Computer Vision.
[14] William Brendel,et al. Learning spatiotemporal graphs of human activities , 2011, 2011 International Conference on Computer Vision.
[15] Deva Ramanan,et al. Parsing Videos of Actions with Segmental Grammars , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[16] Andrew Zisserman,et al. Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.
[17] Cordelia Schmid,et al. Action recognition by dense trajectories , 2011, CVPR 2011.
[18] Limin Wang,et al. Action recognition with trajectory-pooled deep-convolutional descriptors , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[19] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[20] Limin Wang,et al. MoFAP: A Multi-level Representation for Action Recognition , 2015, International Journal of Computer Vision.
[21] Cordelia Schmid,et al. Actom sequence models for efficient action detection , 2011, CVPR 2011.
[22] Ying Wu,et al. Cross-View Action Modeling, Learning, and Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[23] Adriana Kovashka,et al. Learning a hierarchy of discriminative space-time neighborhood features for human action recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[24] Zhuowen Tu,et al. Action Recognition with Actons , 2013, 2013 IEEE International Conference on Computer Vision.
[25] Xiaodong Yang,et al. Action Recognition Using Super Sparse Coding Vector with Spatio-temporal Awareness , 2014, ECCV.
[26] Sergio Escalera,et al. Multi-modal gesture recognition challenge 2013: dataset and results , 2013, ICMI '13.
[27] Limin Wang,et al. Multi-view Super Vector for Action Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[28] Andrew Zisserman,et al. Improving Human Action Recognition Using Score Distribution and Ranking , 2014, ACCV.
[29] Thomas S. Huang,et al. Image Classification Using Super-Vector Coding of Local Image Descriptors , 2010, ECCV.
[30] Stefano Soatto,et al. Dynamic Textures , 2003, International Journal of Computer Vision.
[31] Patrick Bouthemy,et al. Better Exploiting Motion for Better Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[32] Xiaoqing Ding,et al. Discriminative Dimensionality Reduction for Multi-Dimensional Sequences , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[33] Yu Qiao,et al. Action Recognition with Stacked Fisher Vectors , 2014, ECCV.
[34] Yihong Gong,et al. Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[35] Yun Fu,et al. Modeling Complex Temporal Composition of Actionlets for Activity Prediction , 2012, ECCV.
[36] Yale Song,et al. Action Recognition by Hierarchical Sequence Summarization , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[37] Alex Pentland,et al. Coupled hidden Markov models for complex action recognition , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[38] Ivan Laptev,et al. On Space-Time Interest Points , 2005, International Journal of Computer Vision.
[39] Limin Wang,et al. Latent Hierarchical Model of Temporal Structure for Complex Activity Classification , 2014, IEEE Transactions on Image Processing.
[40] Ying Wu,et al. Heteroscedastic max-min distance analysis , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[41] Ming Yang,et al. 3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[42] Hao Wang,et al. Hierarchical Dynamic Parsing and Encoding for Action Recognition , 2016, ECCV.
[43] Fei-Fei Li,et al. Learning latent temporal structure for complex event detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[44] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.
[45] Chunfeng Yuan,et al. Human Action Recognition Based on Context-Dependent Graph Kernels , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[46] Barbara Caputo,et al. Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..
[47] Shaogang Gong,et al. Beyond Tracking: Modelling Activity and Understanding Behaviour , 2006, International Journal of Computer Vision.
[48] R. Vidal,et al. Histograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of human actions , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[49] Cordelia Schmid,et al. A Spatio-Temporal Descriptor Based on 3D-Gradients , 2008, BMVC.
[50] Luc Van Gool,et al. Gesture Recognition Portfolios for Personalization , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[51] Yihong Gong,et al. Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.
[52] Juan Carlos Niebles,et al. Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification , 2010, ECCV.
[53] Andrew Zisserman,et al. Domain-Adaptive Discriminative One-Shot Learning of Gestures , 2014, ECCV.
[54] Stephen J. Maybank,et al. Learning Human Actions by Combining Global Dynamics and Local Appearance , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[55] Yang Wang,et al. Hidden Part Models for Human Action Recognition: Probabilistic versus Max Margin , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[56] Junsong Yuan,et al. Learning Actionlet Ensemble for 3D Human Action Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[57] 乔宇. Motionlets: Mid-Level 3D Parts for Human Motion Recognition , 2013 .
[58] Yunde Jia,et al. Parsing video events with goal inference and intent prediction , 2011, 2011 International Conference on Computer Vision.
[59] Ying Wu,et al. Action recognition with multiscale spatio-temporal contexts , 2011, CVPR 2011.
[60] Cordelia Schmid,et al. A Robust and Efficient Video Representation for Action Recognition , 2015, International Journal of Computer Vision.
[61] Cordelia Schmid,et al. Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[62] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[63] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.
[64] Ying Wu,et al. Learning Maximum Margin Temporal Warping for Action Recognition , 2013, 2013 IEEE International Conference on Computer Vision.
[65] Xiaoqing Ding,et al. Linear Sequence Discriminant Analysis: A Model-Based Dimensionality Reduction Method for Vector Sequences , 2013, 2013 IEEE International Conference on Computer Vision.
[66] Larry S. Davis,et al. Representing Videos Using Mid-level Discriminative Patches , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[67] Chih-Jen Lin,et al. LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..
[68] Iasonas Kokkinos,et al. Discovering discriminative action parts from mid-level video representations , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[69] Cristian Sminchisescu,et al. Conditional Random Fields for Contextual Human Motion Recognition , 2005, ICCV.
[70] Luc Van Gool,et al. Temporal Segment Networks: Towards Good Practices for Deep Action Recognition , 2016, ECCV.
[71] Jintao Li,et al. Hierarchical spatio-temporal context modeling for action recognition , 2009, CVPR.
[72] Guo-Jun Qi,et al. Differential Recurrent Neural Networks for Action Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[73] Cordelia Schmid,et al. Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[74] Ramakant Nevatia,et al. Large-scale event detection using semi-hidden Markov models , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.
[75] Cordelia Schmid,et al. P-CNN: Pose-Based CNN Features for Action Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[76] Nitish Srivastava,et al. Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.
[77] C. Schmid,et al. Recognizing activities with cluster-trees of tracklets , 2012, BMVC.
[78] Lin Sun,et al. Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[79] Thomas Mensink,et al. Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.