Pose-Appearance Relational Modeling for Video Action Recognition
暂无分享,去创建一个
Liang Wang | Zhenan Sun | Wei Wang | Meng Cui | Kunbo Zhang
[1] F. S. Abas,et al. Robust video content analysis schemes for human action recognition , 2021, Science progress.
[2] Marcus Rohrbach,et al. SMART Frame Selection for Action Recognition , 2020, AAAI.
[3] Mattia Segu,et al. Depth-Aware Action Recognition: Pose-Motion Encoding through Temporal Heatmaps , 2020, ArXiv.
[4] Shlok Kumar Mishra,et al. Pose and Joint-Aware Action Recognition , 2020, 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
[5] Luigi Cinque,et al. 2-D Skeleton-Based Action Recognition via Two-Branch Stacked LSTM-RNNs , 2020, IEEE Transactions on Multimedia.
[6] Yuxin Peng,et al. Video Captioning With Object-Aware Spatio-Temporal Correlation and Aggregation , 2020, IEEE Transactions on Image Processing.
[7] Zhiyong Wang,et al. Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Dapeng Wu,et al. Global and Local Knowledge-Aware Attention Network for Action Recognition , 2020, IEEE Transactions on Neural Networks and Learning Systems.
[9] Mario Sznaier,et al. Dynamic Motion Representation for Human Action Recognition , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).
[10] Lukasz Kaiser,et al. Reformer: The Efficient Transformer , 2020, ICLR.
[11] Philippe Weinzaepfel,et al. Mimetics: Towards Understanding Human Actions Out of Context , 2019, International Journal of Computer Vision.
[12] Chen Gao,et al. Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition , 2019, NeurIPS.
[13] Fei Wang,et al. SAST: Learning Semantic Action-Aware Spatial-Temporal Features for Efficient Action Recognition , 2019, IEEE Access.
[14] Wei Wu,et al. STM: SpatioTemporal and Motion Encoding for Action Recognition , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[15] Du Q. Huynh,et al. Hallucinating IDT Descriptors and I3D Optical Flow Features for Action Recognition With CNNs , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[16] Yali Wang,et al. PA3D: Pose-Action 3D Machine for Video Recognition , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Yansong Tang,et al. Learning Semantics-Preserving Attention and Contextual Interaction for Group Activity Recognition , 2019, IEEE Transactions on Image Processing.
[18] Alexander Wong,et al. STAR-Net: Action Recognition using Spatio-Temporal Activation Reprojection , 2019, 2019 16th Conference on Computer and Robot Vision (CRV).
[19] Ling-Yu Duan,et al. Unified Spatio-Temporal Attention Networks for Action Recognition in Videos , 2019, IEEE Transactions on Multimedia.
[20] Chuang Gan,et al. TSM: Temporal Shift Module for Efficient Video Understanding , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[21] Xiao Liu,et al. StNet: Local and Global Spatial-Temporal Modeling for Action Recognition , 2018, AAAI.
[22] Xiaoyan Sun,et al. Temporal–Spatial Mapping for Action Recognition , 2018, IEEE Transactions on Circuits and Systems for Video Technology.
[23] Yi Li,et al. RESOUND: Towards Action Recognition Without Representation Bias , 2018, ECCV.
[24] Taiki Sekii,et al. Pose Proposal Networks , 2018, ECCV.
[25] Cordelia Schmid,et al. PoTion: Pose MoTion Representation for Action Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[26] Dahua Lin,et al. Recognize Actions by Disentangling Components of Dynamics , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[27] Lei Shi,et al. Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[28] Thomas Brox,et al. ECO: Efficient Convolutional Network for Online Video Understanding , 2018, ECCV.
[29] Ling Shao,et al. Deep Action Parsing in Videos With Large-Scale Synthesized Data , 2018, IEEE Transactions on Image Processing.
[30] Jun Zhang,et al. Attend It Again: Recurrent Attention Convolutional Neural Network for Action Recognition , 2018 .
[31] David Picard,et al. 2D/3D Pose Estimation and Action Recognition Using Multitask Deep Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[32] Dahua Lin,et al. Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition , 2018, AAAI.
[33] Yutaka Satoh,et al. Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet? , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[34] Limin Wang,et al. Appearance-and-Relation Networks for Video Classification , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[35] Luc Van Gool,et al. Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification , 2017, ArXiv.
[36] Gang Yu,et al. Cascaded Pyramid Network for Multi-person Pose Estimation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[37] Yu Qiao,et al. RPAN: An End-to-End Recurrent Pose-Attention Network for Action Recognition in Videos , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[38] Cordelia Schmid,et al. LCR-Net: Localization-Classification-Regression for Human Pose , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[39] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[40] Fabio Viola,et al. The Kinetics Human Action Video Dataset , 2017, ArXiv.
[41] Ghassan Al-Regib,et al. TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition , 2017, Signal Process. Image Commun..
[42] Sridha Sridharan,et al. Two Stream LSTM: A Deep Fusion Framework for Human Action Recognition , 2017, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).
[43] Pichao Wang,et al. Joint Distance Maps Based Action Recognition With Convolutional Neural Networks , 2017, IEEE Signal Processing Letters.
[44] Thomas Brox,et al. FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[45] Yaser Sheikh,et al. Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[46] Tiejun Huang,et al. Sequential Deep Trajectory Descriptor for Action Recognition With Three-Stream CNN , 2016, IEEE Transactions on Multimedia.
[47] Luc Van Gool,et al. Temporal Segment Networks: Towards Good Practices for Deep Action Recognition , 2016, ECCV.
[48] Gang Wang,et al. Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition , 2016, ECCV.
[49] Gang Wang,et al. NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[50] Mirella Lapata,et al. Long Short-Term Memory-Networks for Machine Reading , 2016, EMNLP.
[51] Ruslan Salakhutdinov,et al. Action Recognition using Visual Attention , 2015, NIPS 2015.
[52] Yong Du,et al. Hierarchical recurrent neural network for skeleton based action recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[53] Thomas Brox,et al. FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[54] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[55] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[56] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[57] Cordelia Schmid,et al. Towards Understanding Action Recognition , 2013, 2013 IEEE International Conference on Computer Vision.
[58] Weiyu Zhang,et al. From Actemes to Action: A Strongly-Supervised Representation for Detailed Action Understanding , 2013, 2013 IEEE International Conference on Computer Vision.
[59] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[60] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.
[61] Thomas Serre,et al. HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.
[62] Cordelia Schmid,et al. Action recognition by dense trajectories , 2011, CVPR 2011.
[63] Jiebo Luo,et al. Recognizing realistic actions from videos “in the wild” , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[64] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[65] Horst Bischof,et al. A Duality Based Approach for Realtime TV-L1 Optical Flow , 2007, DAGM-Symposium.
[66] I. Laptev,et al. On Space-Time Interest Points , 2005, ICCV 2005.
[67] B. Caputo,et al. Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..
[68] S. Hochreiter,et al. Long Short-Term Memory , 1997, Neural Computation.
[69] Xiangjiu Che,et al. R-STAN: Residual Spatial-Temporal Attention Network for Action Recognition , 2019, IEEE Access.