Video Activity Recognition: State-of-the-Art
暂无分享,去创建一个
Basilio Sierra | Igor Rodriguez Rodriguez | José María Martínez-Otzeta | Itsaso Rodríguez-Moreno | Ekaitz Jauregi | B. Sierra | J. M. Martínez-Otzeta | E. Jauregi | I. Rodriguez | Itsaso Rodríguez-Moreno
[1] Jean-Christophe Nebel,et al. Common-sense reasoning for human action recognition , 2013, Pattern Recognit. Lett..
[2] S. Santhosh Kumar,et al. Human activity recognition using optical flow based feature set , 2016, 2016 IEEE International Carnahan Conference on Security Technology (ICCST).
[3] Larry S. Davis,et al. Temporal Difference Networks for Video Action Recognition , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).
[4] Sonia Sehgal. Human Activity Recognition Using BPNN Classifier on HOG Features , 2018, 2018 International Conference on Intelligent Circuits and Systems (ICICS).
[5] Jake K. Aggarwal,et al. View invariant human action recognition using histograms of 3D joints , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.
[6] Christian Thurau,et al. Behavior Histograms for Action Recognition and Human Detection , 2007, Workshop on Human Motion.
[7] Anupam Agrawal,et al. Vision based hand gesture recognition for human computer interaction: a survey , 2012, Artificial Intelligence Review.
[8] Feng Niu,et al. HMM-Based Segmentation and Recognition of Human Activities from Video Sequences , 2005, 2005 IEEE International Conference on Multimedia and Expo.
[9] S. P. Lloyd,et al. Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.
[10] Serge J. Belongie,et al. Behavior recognition via sparse spatio-temporal features , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.
[11] Yann LeCun,et al. Convolutional Learning of Spatio-temporal Features , 2010, ECCV.
[12] Anthony Widjaja,et al. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.
[13] Anders Grunnet-Jepsen,et al. Intel RealSense Stereoscopic Depth Cameras , 2017, CVPR 2017.
[14] Gregory D. Hager,et al. Histograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of human actions , 2009, CVPR.
[15] Jean-Christophe Nebel,et al. Episodic Reasoning for Vision-Based Human Action Recognition , 2014, TheScientificWorldJournal.
[16] James McCormick. Sloth , 1996, BMJ.
[17] Lin Sun,et al. Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[18] Jiebo Luo,et al. Recognizing realistic actions from videos “in the wild” , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[19] Nicola Bellotto,et al. Social activity recognition based on probabilistic merging of skeleton features with proximity priors from RGB-D data , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[20] Ying Wu,et al. Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[21] Zhe Wang,et al. Towards Good Practices for Very Deep Two-Stream ConvNets , 2015, ArXiv.
[22] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[23] Guillaume Bouchard,et al. Hierarchical part-based visual object categorization , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[24] Zhengyou Zhang,et al. Microsoft Kinect Sensor and Its Effect , 2012, IEEE Multim..
[25] Supavadee Aramvith,et al. Human action recognition using direction histograms of optical flow , 2011, 2011 11th International Symposium on Communications & Information Technologies (ISCIT).
[26] Seth J. Teller,et al. Particle Video: Long-Range Motion Estimation Using Point Trajectories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).
[27] Wojciech Zaremba,et al. Learning to Execute , 2014, ArXiv.
[28] Cordelia Schmid,et al. Actions in context , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[29] Anoop Cherian,et al. Video Representation Learning Using Discriminative Pooling , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[30] Mykola Pechenizkiy,et al. A survey on using domain and contextual knowledge for human activity recognition in video streams , 2016, Expert Syst. Appl..
[31] Greg Mori,et al. Social roles in hierarchical models for human activity recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[32] Emilio Maggio,et al. Video Tracking - Theory and Practice , 2011 .
[33] Vladimir Vapnik,et al. An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.
[34] Nello Cristianini,et al. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .
[35] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[36] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[37] Jitendra Malik,et al. Recognizing action at a distance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.
[38] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[39] Tinne Tuytelaars,et al. Modeling video evolution for action recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[40] Christopher G. Harris,et al. A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.
[41] Andrea Vedaldi,et al. Dynamic Image Networks for Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[42] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[43] Janusz Konrad,et al. Action Recognition Using Sparse Representation on Covariance Manifolds of Optical Flow , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.
[44] Cordelia Schmid,et al. AVA: A Video Dataset of Spatio-Temporally Localized Atomic Visual Actions , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[45] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[46] Du Tran,et al. Human Activity Recognition with Metric Learning , 2008, ECCV.
[47] Jing Tian,et al. Action recognition using multi-directional projected depth motion maps , 2018, Journal of Ambient Intelligence and Humanized Computing.
[48] Pinar Duygulu Sahin,et al. Pose sentences: A new representation for action recognition using sequence of pose words , 2008, 2008 19th International Conference on Pattern Recognition.
[49] Tiziana D'Orazio,et al. Human activity recognition for automatic visual surveillance of wide areas , 2004, VSSN '04.
[50] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[51] Luc Van Gool,et al. Action snippets: How many frames does human action recognition require? , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[52] Leonid Sigal,et al. Poselet Key-Framing: A Model for Human Activity Recognition , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[53] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.
[54] Shipra Aggarwal,et al. Motion detection, tracking and classification for automated Video Surveillance , 2016, 2016 IEEE 1st International Conference on Power Electronics, Intelligent Control and Energy Systems (ICPEICES).
[55] Matti Pietikäinen,et al. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..
[56] Serhan Cosar,et al. Social Activity Recognition on Continuous RGB-D Video Sequences , 2020, Int. J. Soc. Robotics.
[57] Emanuele Frontoni,et al. HMM-based Activity Recognition with a Ceiling RGB-D Camera , 2017, ICPRAM.
[58] Antonio Fernández-Caballero,et al. A survey of video datasets for human action and activity recognition , 2013, Comput. Vis. Image Underst..
[59] Juan José Pantrigo,et al. Convolutional Neural Networks and Long Short-Term Memory for skeleton-based human activity and hand gesture recognition , 2018, Pattern Recognit..
[60] Yifeng He,et al. Human action recognition via multiview discriminative analysis of canonical correlations , 2016, 2016 IEEE International Conference on Image Processing (ICIP).
[61] Barbara Caputo,et al. Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..
[62] Stephen J. Maybank,et al. Activity recognition using a supervised non-parametric hierarchical HMM , 2016, Neurocomputing.
[63] Tieniu Tan,et al. A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).
[64] Bhiksha Raj,et al. Beyond Gaussian Pyramid: Multi-skip Feature Stacking for action recognition , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[65] Kai-Kuang Ma,et al. Sum-of-gradient based fast intra coding in 3D-HEVC for depth map sequence (SOG-FDIC) , 2017, J. Vis. Commun. Image Represent..
[66] David R. Bull,et al. GMM-based efficient foreground detection with adaptive region update , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).
[67] Barbara Caputo,et al. Recognition with local features: the kernel recipe , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.
[68] Hong Liu,et al. Robust 3D Action Recognition Through Sampling Local Appearances and Global Distributions , 2018, IEEE Transactions on Multimedia.
[69] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[70] Nasser Kehtarnavaz,et al. Real-time human action recognition based on depth motion maps , 2013, Journal of Real-Time Image Processing.
[71] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.
[72] Chong-Wah Ngo,et al. Motion-Based Video Representation for Scene Change Detection , 2004, International Journal of Computer Vision.
[73] Paul J. M. Havinga,et al. Activity Recognition Using Inertial Sensing for Healthcare, Wellbeing and Sports Applications: A Survey , 2010, ARCS Workshops.
[74] Ronen Basri,et al. Actions as space-time shapes , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.
[75] Antonio Torralba,et al. LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.
[76] Nicu Sebe,et al. Deep appearance and motion learning for egocentric activity recognition , 2018, Neurocomputing.
[77] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.
[78] Jenq-Neng Hwang,et al. A Review on Video-Based Human Activity Recognition , 2013, Comput..
[79] Yongxiong Wang,et al. A self-adaptive weighted affinity propagation clustering for key frames extraction on human action recognition , 2015, J. Vis. Commun. Image Represent..
[80] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[81] Peyman Milanfar,et al. Action Recognition from One Example , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[82] Ying Wu,et al. Robust 3D Action Recognition with Random Occupancy Patterns , 2012, ECCV.
[83] N. C. Chauhan,et al. Deep neural network model for group activity recognition using contextual relationship , 2019, Engineering Science and Technology, an International Journal.
[84] J. Aggarwal,et al. Recognizing human action from a far field of view , 2009, 2009 Workshop on Motion and Video Computing (WMVC).
[85] Thomas Serre,et al. HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.
[86] Bharti Bansal,et al. Gesture Recognition: A Survey , 2016 .
[87] Cordelia Schmid,et al. Learning Object Representations for Visual Object Class Recognition , 2007, ICCV 2007.
[88] Matthew J. Hausknecht,et al. Beyond short snippets: Deep networks for video classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[89] Carlo Tomasi,et al. Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.
[90] JoAnne K. Gronley,et al. Use of cluster analysis for gait pattern classification of patients in the early and late recovery phases following stroke. , 2003, Gait & posture.
[91] Andrew Zisserman,et al. Representing shape with a spatial pyramid kernel , 2007, CIVR '07.
[92] Christopher Hunt,et al. Notes on the OpenSURF Library , 2009 .
[93] Shih-Chia Huang,et al. An Advanced Motion Detection Algorithm With Video Quality Analysis for Video Surveillance Systems , 2011, IEEE Transactions on Circuits and Systems for Video Technology.
[94] Stefan Carlsson,et al. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.
[95] Ivan Laptev,et al. On Space-Time Interest Points , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.
[96] Lihong Zheng,et al. A Survey on Human Action Recognition Using Depth Sensors , 2015, 2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA).
[97] Mehrtash Tafazzoli Harandi,et al. Going deeper into action recognition: A survey , 2016, Image Vis. Comput..
[98] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).
[99] Yun Fu,et al. Human Action Recognition and Prediction: A Survey , 2018, International Journal of Computer Vision.
[100] Berthold K. P. Horn,et al. Determining Optical Flow , 1981, Other Conferences.
[101] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.
[102] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[103] Marc'Aurelio Ranzato,et al. Large Scale Distributed Deep Networks , 2012, NIPS.
[104] Cordelia Schmid,et al. Weakly Supervised Learning of Interactions between Humans and Objects , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[105] Cordelia Schmid,et al. Long-Term Temporal Convolutions for Action Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[106] Ming Yang,et al. 3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[107] Xiaodong Yang,et al. Recognizing actions using depth motion maps-based histograms of oriented gradients , 2012, ACM Multimedia.
[108] E. Trucco,et al. Video Tracking: A Concise Survey , 2006, IEEE Journal of Oceanic Engineering.
[109] Susanne Westphal,et al. The “Something Something” Video Database for Learning and Evaluating Visual Common Sense , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[110] Ferda Nur Alpaslan,et al. Video Action Recognition Using an Optical Flow Based Representation , 2022 .
[111] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[112] Wanqing Li,et al. Action recognition based on a bag of 3D points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.
[113] S. Gong,et al. Recognising action as clouds of space-time interest points , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[114] Anupam Agrawal,et al. A survey on activity recognition and behavior understanding in video surveillance , 2012, The Visual Computer.
[115] Luc Van Gool,et al. Temporal Segment Networks: Towards Good Practices for Deep Action Recognition , 2016, ECCV.
[116] Lior Wolf,et al. Kernel principal angles for classification machines with applications to image sequence interpretation , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..
[117] Z. Liu,et al. A real time system for dynamic hand gesture recognition with a depth sensor , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).
[118] Juan Carlos Niebles,et al. Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification , 2010, ECCV.
[119] James W. Davis,et al. An appearance-based representation of action , 1996, Proceedings of 13th International Conference on Pattern Recognition.
[120] Camille Couprie,et al. Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[121] Koby Crammer,et al. On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..
[122] J.K. Aggarwal,et al. Human activity analysis , 2011, ACM Comput. Surv..
[123] Andrew Zisserman,et al. Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.
[124] Sung Wook Baik,et al. Action Recognition in Video Sequences using Deep Bi-Directional LSTM With CNN Features , 2018, IEEE Access.
[125] Radha Poovendran,et al. Human activity recognition for video surveillance , 2008, 2008 IEEE International Symposium on Circuits and Systems.
[126] Xiaoshuai Sun,et al. Two-Stream 3-D convNet Fusion for Action Recognition in Videos With Arbitrary Size and Length , 2018, IEEE Transactions on Multimedia.
[127] Andrew Zisserman,et al. Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[128] Juan Carlos Niebles,et al. Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words , 2006, BMVC.
[129] Vinod Nair,et al. Automated Visual Surveillance Using Hidden Markov Models , 2002 .
[130] James J. Little,et al. Simultaneous Tracking and Action Recognition using the PCA-HOG Descriptor , 2006, The 3rd Canadian Conference on Computer and Robot Vision (CRV'06).
[131] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[132] Limin Wang,et al. Action recognition with trajectory-pooled deep-convolutional descriptors , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[133] Alexander J. Smola,et al. Binet-Cauchy Kernels on Dynamical Systems and its Application to the Analysis of Dynamic Scenes , 2007, International Journal of Computer Vision.
[134] Kris M. Kitani,et al. Going Deeper into First-Person Activity Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[135] Jason J. Corso,et al. Action bank: A high-level representation of activity in video , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[136] Bernard Ghanem,et al. ActivityNet: A large-scale video benchmark for human activity understanding , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[137] Gunnar Farnebäck,et al. Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.
[138] Fernando Pérez-Cruz,et al. Supervised-PCA and SVM classifiers for object detection in infrared images , 2003, Proceedings of the IEEE Conference on Advanced Video and Signal Based Surveillance, 2003..
[139] Juan Carlos Niebles,et al. A Hierarchical Model of Shape and Appearance for Human Action Classification , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.
[140] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.
[141] Ali Farhadi,et al. Actions ~ Transformations , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[142] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[143] James W. Davis,et al. The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..
[144] Marjorie Skubic,et al. VicoVR-Based Wireless Daily Activity Recognition and Assessment System for Stroke Rehabilitation , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).
[145] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[146] Jürgen Schmidhuber,et al. Bidirectional LSTM Networks for Improved Phoneme Classification and Recognition , 2005, ICANN.
[147] Zicheng Liu,et al. HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[148] Chee Kheong Siew,et al. Extreme learning machine: Theory and applications , 2006, Neurocomputing.
[149] Cordelia Schmid,et al. Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[150] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[151] X. Li. HMM based action recognition using oriented histograms of optical flow field , 2007 .
[152] Soharab Hossain Shaikh,et al. A comprehensive survey of human action recognition with spatio-temporal interest point (STIP) detector , 2015, The Visual Computer.
[153] Cordelia Schmid,et al. Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).