Deep Learning for Action and Gesture Recognition in Image Sequences: A Survey
暂无分享,去创建一个
Sergio Escalera | Xavier Baró | Hugo Jair Escalante | Shohreh Kasaei | Maryam Asadi-Aghbolaghi | Albert Clapés | Isabelle Guyon | Víctor Ponce-López | Marco Bellantonio | H. Escalante | V. Ponce-López | Xavier Baró | Maryam Asadi-Aghbolaghi | Albert Clapés | M. Bellantonio | S. Kasaei | Sergio Escalera | Isabelle M Guyon
[1] Xiaogang Wang,et al. Multi-source Deep Learning for Human Pose Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[2] Ricardo Chavarriaga,et al. Detecting anomalies to improve classification performance in opportunistic sensor networks , 2011, 2011 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops).
[3] Yu Qiao,et al. Action Recognition with Stacked Fisher Vectors , 2014, ECCV.
[4] Richard P. Wildes,et al. Spatiotemporal Residual Networks for Video Action Recognition , 2016, NIPS.
[5] Christian Wolf,et al. Multi-scale Deep Learning for Gesture Detection and Localization , 2014, ECCV Workshops.
[6] Shmuel Peleg,et al. Compact CNN for indexing egocentric videos , 2015, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).
[7] Jonathan Tompson,et al. MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation , 2014, ACCV.
[8] Junji Yamato,et al. Recognizing human action in time-sequential images using hidden Markov model , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[9] Alberto Del Bimbo,et al. L1-regularized Logistic Regression Stacking and Transductive CRF Smoothing for Action Recognition in Video , 2013 .
[10] Guoyuan Liang,et al. A vision-based robotic grasping system using deep learning for 3D object recognition and pose estimation , 2013, 2013 IEEE International Conference on Robotics and Biomimetics (ROBIO).
[11] C. Lawrence Zitnick,et al. Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.
[12] Cees G. M. Snoek,et al. University of Amsterdam at THUMOS Challenge 2014 , 2014 .
[13] Wen Gao,et al. Deep Alternative Neural Network: Exploring Contexts as Early as Possible for Action Recognition , 2016, NIPS.
[14] Christian Wolf,et al. Sequential Deep Learning for Human Action Recognition , 2011, HBU.
[15] Isabelle Guyon,et al. Principal motion components for gesture recognition using a single-example , 2013, ArXiv.
[16] Yiannis Kompatsiaris,et al. Moving camera human activity localization and recognition with motionplanes and multiple homographies , 2015, 2015 IEEE International Conference on Image Processing (ICIP).
[17] Qinjun Xu,et al. Learning semantic context feature-tree for action recognition via nearest neighbor fusion , 2016, Neurocomputing.
[18] Matthew J. Hausknecht,et al. Beyond short snippets: Deep networks for video classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[19] Truong Q. Nguyen,et al. Real-time sign language fingerspelling recognition using convolutional neural networks from depth map , 2015, 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR).
[20] Mohamed R. Amer,et al. Monte Carlo Tree Search for Scheduling Activity Recognition , 2013, 2013 IEEE International Conference on Computer Vision.
[21] Antoni B. Chan,et al. Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.
[22] James A. Reggia,et al. Robust human action recognition via long short-term memory , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).
[23] Limin Wang,et al. Action and Gesture Temporal Spotting with Super Vector Representation , 2014, ECCV Workshops.
[24] Andrea Vedaldi,et al. Dynamic Image Networks for Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[25] Larry S. Davis,et al. AVSS 2011 demo session: A large-scale benchmark dataset for event recognition in surveillance video , 2011, AVSS.
[26] Greg Mori,et al. A Hierarchical Deep Temporal Model for Group Activity Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[27] Sander Dieleman,et al. Beyond Temporal Pooling: Recurrence and Temporal Convolutions for Gesture Recognition in Video , 2015, International Journal of Computer Vision.
[28] Houqiang Li,et al. Sign Language Recognition using 3D convolutional neural networks , 2015, 2015 IEEE International Conference on Multimedia and Expo (ICME).
[29] Bernard Ghanem,et al. ActivityNet: A large-scale video benchmark for human activity understanding , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[30] Richard P. Wildes,et al. Spatiotemporal Multiplier Networks for Video Action Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[31] Zhe Wang,et al. Exploring Fisher vector and deep networks for action spotting , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[32] Ling Shao,et al. Deep Dynamic Neural Networks for Multimodal Gesture Segmentation and Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[33] Yun Yang,et al. Action recognition from depth sequences using weighted fusion of 2D and 3D auto-correlation of gradients features , 2016, Multimedia Tools and Applications.
[34] Lior Wolf,et al. RNN Fisher Vectors for Action Recognition and Image Annotation , 2015, ECCV.
[35] Stefan Wermter,et al. Gesture Recognition with a Convolutional Long Short-Term Memory Recurrent Neural Network , 2016, ESANN.
[36] Hairong Qi,et al. Group Sparsity and Geometry Constrained Dictionary Learning for Action Recognition from Depth Maps , 2013, 2013 IEEE International Conference on Computer Vision.
[37] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[38] Gang Sun,et al. A Key Volume Mining Deep Framework for Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[39] Patrick Bouthemy,et al. Optical flow modeling and computation: A survey , 2015, Comput. Vis. Image Underst..
[40] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[41] Jonghyun Choi,et al. ActionFlowNet: Learning Motion Representation for Action Recognition , 2016, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).
[42] Xin Xu,et al. Large-scale gesture recognition with a fusion of RGB-D data based on the C3D model , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).
[43] Ali Farhadi,et al. Actions ~ Transformations , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[44] Bingbing Ni,et al. Progressively Parsing Interactional Objects for Fine Grained Action Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[45] Cordelia Schmid,et al. Multi-region Two-Stream R-CNN for Action Detection , 2016, ECCV.
[46] Hossein Mousavi Hondori,et al. A Review on Technical and Clinical Impact of Microsoft Kinect on Physical Therapy and Rehabilitation , 2014, Journal of medical engineering.
[47] Wei Wang,et al. How scenes imply actions in realistic videos? , 2016, 2016 IEEE International Conference on Image Processing (ICIP).
[48] Antoni B. Chan,et al. Maximum-Margin Structured Learning with Deep Networks for 3D Human Pose Estimation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[49] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[50] Michael Jones,et al. An improved deep learning architecture for person re-identification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[51] Gonen Eren,et al. Evaluation of video activity localizations integrating quality and quantity measurements , 2014, Comput. Vis. Image Underst..
[52] Koichi Shinoda,et al. Cross-view human action recognition from depth maps using spectral graph sequences , 2017, Comput. Vis. Image Underst..
[53] C. V. Jawahar,et al. First Person Action Recognition Using Deep Learned Descriptors , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[54] Nitish Srivastava,et al. Initialization Strategies of Spatio-Temporal Convolutional Neural Networks , 2015, ArXiv.
[55] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.
[56] Ajmal Mian,et al. Learning a Deep Model for Human Action Recognition from Novel Viewpoints , 2016 .
[57] Juan Song,et al. Large-scale Isolated Gesture Recognition using pyramidal 3D convolutional networks , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).
[58] Yang Wang,et al. Improving Human Action Recognition by Non-action Classification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[59] Bingbing Ni,et al. Temporal Action Localization with Pyramid of Score Distribution Features , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[60] Xiangji Huang,et al. Deep learning for healthcare decision making with EMRs , 2014, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).
[61] Nicu Sebe,et al. Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos , 2017, MMM.
[62] Suman Saha,et al. Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos , 2016, BMVC.
[63] Cordelia Schmid,et al. Encoding Feature Maps of CNNs for Action Recognition , 2015 .
[64] Zhe Wang,et al. Towards Good Practices for Very Deep Two-Stream ConvNets , 2015, ArXiv.
[65] Koen E. A. van de Sande,et al. Selective Search for Object Recognition , 2013, International Journal of Computer Vision.
[66] Hua Han,et al. Sequentially Supervised Long Short-Term Memory for Gesture Recognition , 2016, Cognitive Computation.
[67] Vijay John,et al. Deep Learning-Based Fast Hand Gesture Recognition Using Representative Frames , 2016, 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA).
[68] Mohan S. Kankanhalli,et al. Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[69] Lei Chen,et al. Deep Structured Models For Group Activity Recognition , 2015, BMVC.
[70] Ricardo Chavarriaga,et al. Benchmarking classification techniques using the Opportunity human activity dataset , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.
[71] Andrew Zisserman,et al. Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.
[72] Alexander C. Berg,et al. Combining multiple sources of knowledge in deep CNNs for action recognition , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).
[73] Ming Shao,et al. A Multi-stream Bi-directional Recurrent Neural Network for Fine-Grained Action Detection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[74] Lin Sun,et al. Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[75] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.
[76] Alberto Montes Gómez. Temporal activity detection in untrimmed videos with recurrent neural networks , 2016 .
[77] J. Scharcanski,et al. Computer Vision Techniques for the Diagnosis of Skin Cancer , 2013 .
[78] Gang Wang,et al. NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[79] Christian Wolf,et al. Action Classification in Soccer Videos with Long Short-Term Memory Recurrent Neural Networks , 2010, ICANN.
[80] Li Fei-Fei,et al. End-to-End Learning of Action Detection from Frame Glimpses in Videos , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[81] Christian Wolf,et al. ModDrop: Adaptive Multi-Modal Gesture Recognition , 2014, IEEE Trans. Pattern Anal. Mach. Intell..
[82] Isabelle Guyon,et al. Principal motion components for one-shot gesture recognition , 2017, Pattern Analysis and Applications.
[83] Yutaka Satoh,et al. Human Action Recognition Without Human , 2016, ECCV Workshops.
[84] Dimitris Samaras,et al. Action Detection with Improved Dense Trajectories and Sliding Window , 2014, ECCV Workshops.
[85] Anupam Agrawal,et al. A survey on activity recognition and behavior understanding in video surveillance , 2012, The Visual Computer.
[86] Sergio Escalera,et al. ChaLearn Joint Contest on Multimedia Challenges Beyond Visual Analysis: An overview , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).
[87] Luc Van Gool,et al. Temporal Segment Networks: Towards Good Practices for Deep Action Recognition , 2016, ECCV.
[88] Haroon Idrees,et al. Action Localization in Videos through Context Walk , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[89] Prakash Ishwar,et al. Two-Stream CNNs for Gesture-Based Verification and Identification: Learning User Style , 2016, CVPR 2016.
[90] Yu-Gang Jiang,et al. Harnessing Object and Scene Semantics for Large-Scale Video Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[91] Nuno Vasconcelos,et al. VLAD3: Encoding Dynamics of Deep Features for Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[92] Juergen Gall,et al. Temporal Action Detection Using a Statistical Language Model , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[93] Hugo Jair Escalante,et al. Late fusion of heterogeneous methods for multimedia image retrieval , 2008, MIR '08.
[94] Yong Pei,et al. Integrating multi-stage depth-induced contextual information for human action recognition and localization , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).
[95] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[96] Yingli Tian,et al. Embedding Sequential Information into Spatiotemporal Features for Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[97] 乔宇,et al. Hybrid Super Vector with Improved Dense Trajectories for Action Recognition , 2013 .
[98] Limin Wang,et al. Action recognition with trajectory-pooled deep-convolutional descriptors , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[99] Behrooz Mahasseni,et al. Regularizing Long Short Term Memory with 3D Human-Skeleton Sequences for Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[100] Ajmal Mian,et al. 3D Action Recognition from Novel Viewpoints , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[101] Hugo Jair Escalante,et al. A naïve Bayes baseline for early gesture recognition , 2016, Pattern Recognit. Lett..
[102] Sepp Hochreiter,et al. Untersuchungen zu dynamischen neuronalen Netzen , 1991 .
[103] Geoffrey Zweig,et al. An introduction to computational networks and the computational network toolkit (invited talk) , 2014, INTERSPEECH.
[104] Jitendra Malik,et al. Finding action tubes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[105] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..
[106] Yuanjun Xiong,et al. CUHK & SIAT Submission for THUMOS 15 Action Recognition Challenge , 2015 .
[107] Deva Ramanan,et al. Video Annotation and Tracking with Active Learning , 2011, NIPS.
[108] Luc Van Gool,et al. DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[109] Alexei A. Efros,et al. Guest Editorial: Big Data , 2016, International Journal of Computer Vision.
[110] Song Han,et al. Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.
[111] Xiangyang Ji,et al. Action Recognition with Joint Attention on Multi-Level Deep Features , 2016, ArXiv.
[112] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[113] Cordelia Schmid,et al. A Robust and Efficient Video Representation for Action Recognition , 2015, International Journal of Computer Vision.
[114] John Salvatier,et al. Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.
[115] Jonathan Tompson,et al. Learning Human Pose Estimation Features with Convolutional Networks , 2013, ICLR.
[116] Nitish Srivastava,et al. Exploiting Image-trained CNN Architectures for Unconstrained Video Classification , 2015, BMVC.
[117] Barbara Lewandowska-Tomaszczyk,et al. Affective Robotics: Modelling and Testing Cultural Prototypes , 2014, Cognitive Computation.
[118] Bernard Ghanem,et al. DAPs: Deep Action Proposals for Action Understanding , 2016, ECCV.
[119] Pavlo Molchanov,et al. Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[120] Hideki Nakayama,et al. Multimodal Gesture Recognition Using Multi-stream Recurrent Neural Network , 2015, PSIVT.
[121] Jun Wan,et al. Multi-Modality Fusion based on Consensus-Voting and 3D Convolution for Isolated Gesture Recognition , 2016, ArXiv.
[122] Ruzena Bajcsy,et al. Bio-inspired Dynamic 3D Discriminative Skeletal Features for Human Action Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.
[123] Daniel Roggen,et al. Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition , 2016, Sensors.
[124] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.
[125] Bingbing Ni,et al. Interaction part mining: A mid-level approach for fine-grained action recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[126] Wei Chen,et al. Action Detection by Implicit Intentional Motion Clustering , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[127] Yong Du,et al. Hierarchical recurrent neural network for skeleton based action recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[128] Zhi Liu,et al. 3D-based Deep Convolutional Neural Network for action recognition with depth sequences , 2016, Image Vis. Comput..
[129] Andrew Zisserman,et al. Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[130] Hanqing Lu,et al. Fusing multi-modal features for gesture recognition , 2013, ICMI '13.
[131] Cees Snoek,et al. Spot On: Action Localization from Pointly-Supervised Proposals , 2016, ECCV.
[132] Cordelia Schmid,et al. Long-Term Temporal Convolutions for Action Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[133] Cees Snoek,et al. Objects2action: Classifying and Localizing Actions without Any Video Example , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[134] Gang Wang,et al. Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition , 2016, ECCV.
[135] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[136] Yi Yang,et al. UTS-CMU at THUMOS 2015 , 2015 .
[137] Ming Yang,et al. 3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[138] Hermann Ney,et al. Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data is Continuous and Weakly Labelled , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[139] Samy Bengio,et al. Torch: a modular machine learning software library , 2002 .
[140] Ling Shao,et al. Kernelized Multiview Projection for Robust Action Recognition , 2016, International Journal of Computer Vision.
[141] Mohan M. Trivedi,et al. Hand Gesture Recognition in Real Time for Automotive Interfaces: A Multimodal Vision-Based Approach and Evaluations , 2014, IEEE Transactions on Intelligent Transportation Systems.
[142] Rama Chellappa,et al. Statistical analysis on Stiefel and Grassmann manifolds with applications in computer vision , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[143] Bowen Zhang,et al. Real-Time Action Recognition with Enhanced Motion Vector CNNs , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[144] Nicholas Rhinehart,et al. Learning Action Maps of Large Environments via First-Person Vision , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[145] Immanuel Bayer,et al. A multi modal approach to gesture recognition from audio and video data , 2013, ICMI '13.
[146] Giulio Paci,et al. A Multi-scale Approach to Gesture Detection and Recognition , 2013, 2013 IEEE International Conference on Computer Vision Workshops.
[147] Greg Mori,et al. Structure Inference Machines: Recurrent Neural Networks for Analyzing Relations in Group Activity Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[148] Tao Mei,et al. MSR Asia MSM at THUMOS Challenge 2015 , 2015 .
[149] Christian Wolf,et al. Hand Segmentation with Structured Convolutional Learning , 2014, ACCV.
[150] Benjamin Schrauwen,et al. Sign Language Recognition Using Convolutional Neural Networks , 2014, ECCV Workshops.
[151] Pichao Wang,et al. Large-scale Isolated Gesture Recognition using Convolutional Neural Networks , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).
[152] Xilin Chen,et al. Two streams Recurrent Neural Networks for Large-Scale Continuous Gesture Recognition , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).
[153] Jakub Konecný,et al. One-shot-learning gesture recognition using HOG-HOF features , 2014, J. Mach. Learn. Res..
[154] Pavlo Molchanov,et al. Hand gesture recognition with 3D convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[155] Gang Wang,et al. Deep Multimodal Feature Analysis for Action Recognition in RGB+D Videos , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[156] Jürgen Schmidhuber,et al. Learning Precise Timing with LSTM Recurrent Networks , 2003, J. Mach. Learn. Res..
[157] Song-Chun Zhu,et al. Visual Persuasion: Inferring Communicative Intents of Images , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[158] Yonghong Song,et al. Hand gesture recognition using view projection from point cloud , 2016, 2016 IEEE International Conference on Image Processing (ICIP).
[159] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[160] Ricardo Chavarriaga,et al. Ensemble creation and reconfiguration for activity recognition: An information theoretic approach , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.
[161] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[162] Mubarak Shah,et al. Automatic action annotation in weakly labeled videos , 2016, Comput. Vis. Image Underst..
[163] Guo-Jun Qi,et al. Differential Recurrent Neural Networks for Action Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[164] Apostol Natsev,et al. YouTube-8M: A Large-Scale Video Classification Benchmark , 2016, ArXiv.
[165] Pichao Wang,et al. Large-scale Continuous Gesture Recognition Using Convolutional Neural Networks , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).
[166] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.
[167] Sergio Escalera,et al. Deep learning based super-resolution for improved action recognition , 2015, 2015 International Conference on Image Processing Theory, Tools and Applications (IPTA).
[168] Ken Perlin,et al. Real-Time Continuous Pose Recovery of Human Hands Using Convolutional Networks , 2014, ACM Trans. Graph..
[169] Mohamed S. Kamel,et al. A semi-supervised temporal clustering method for facial emotion analysis , 2014, 2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).
[170] Shih-Fu Chang,et al. Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[171] Xun Xu,et al. Multi-Task Zero-Shot Action Recognition with Prioritised Data Augmentation , 2016, ECCV.
[172] Luc Van Gool,et al. Two-Stream SR-CNNs for Action Recognition in Videos , 2016, BMVC.
[173] Hsien-I Lin,et al. Human hand gesture recognition using a convolution neural network , 2014, 2014 IEEE International Conference on Automation Science and Engineering (CASE).
[174] Anthony G. Cohn,et al. Weakly supervised activity analysis with spatio-temporal localisation , 2016, Neurocomputing.
[175] Jing Zhang,et al. Action Recognition From Depth Maps Using Deep Convolutional Neural Networks , 2016, IEEE Transactions on Human-Machine Systems.
[176] Geoffrey E. Hinton,et al. Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..
[177] Christopher Joseph Pal,et al. Delving Deeper into Convolutional Networks for Learning Video Representations , 2015, ICLR.
[178] Tinne Tuytelaars,et al. Rank Pooling for Action Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[179] Cordelia Schmid,et al. Learning to Track for Spatio-Temporal Action Localization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[180] Cordelia Schmid,et al. P-CNN: Pose-Based CNN Features for Action Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[181] Oscar Koller,et al. Using Convolutional 3D Neural Networks for User-independent continuous gesture recognition , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).
[182] Masayuki Inaba,et al. Design and implementation of a system that generates assembly programs from visual recognition of human action sequences , 1990, EEE International Workshop on Intelligent Robots and Systems, Towards a New Frontier of Applications.
[183] Adnan Khashman,et al. Deep learning in vision-based static hand gesture recognition , 2017, Neural Computing and Applications.
[184] Cees Snoek,et al. What do 15,000 object categories tell us about classifying and localizing actions? , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).