ActivityNet: A large-scale video benchmark for human activity understanding
暂无分享,去创建一个
Bernard Ghanem | Juan Carlos Niebles | Fabian Caba Heilbron | Victor Escorcia | Victor Escorcia | Bernard Ghanem
[1] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[2] Yiannis Aloimonos,et al. Towards a Sensorimotor WordNet SM : Closing the Semantic Gap , 2006 .
[3] Juan Carlos Niebles,et al. Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification , 2010, ECCV.
[4] Juan Carlos Niebles,et al. Collecting and Annotating Human Activities in Web Videos , 2014, ICMR.
[5] Juan Carlos Niebles,et al. Discriminative Hierarchical Modeling of Spatio-temporally Composable Human Activities , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[6] Bernt Schiele,et al. A database for fine grained activity detection of cooking activities , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[7] Thomas Serre,et al. HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.
[8] Indriyati Atmosukarto,et al. Action Recognition Using Discriminative Structured Trajectory Groups , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.
[9] Nanning Zheng,et al. Concurrent Action Detection with Structural Prediction , 2013, 2013 IEEE International Conference on Computer Vision.
[10] Mubarak Shah,et al. Action MACH a spatio-temporal Maximum Average Correlation Height filter for action recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[11] Cordelia Schmid,et al. Actions in context , 2009, CVPR.
[12] Yiannis Aloimonos,et al. The minimalist grammar of action , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.
[13] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.
[14] Thomas Mensink,et al. Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.
[15] Deva Ramanan,et al. Efficiently Scaling up Crowdsourced Video Annotation , 2012, International Journal of Computer Vision.
[16] Yu Qiao,et al. Action Recognition with Stacked Fisher Vectors , 2014, ECCV.
[17] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[18] Cordelia Schmid,et al. The LEAR submission at Thumos 2014 , 2014 .
[19] Jay Stewart,et al. The American Time Use Survey , 2005 .
[20] Krista A. Ehinger,et al. SUN Database: Exploring a Large Collection of Scene Categories , 2014, International Journal of Computer Vision.
[21] Bernard Ghanem,et al. Camera Motion and Surrounding Scene Appearance as Context for Action Recognition , 2014, ACCV.
[22] Heng Wang. LEAR-INRIA submission for the THUMOS workshop , 2013 .
[23] Cees G. M. Snoek,et al. University of Amsterdam at THUMOS Challenge 2014 , 2014 .
[24] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.
[25] Cordelia Schmid,et al. Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
[26] Indriyati Atmosukarto,et al. Trajectory-based Fisher kernel representation for action recognition in videos , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).
[27] Ronen Basri,et al. Actions as Space-Time Shapes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[28] Juan Carlos Niebles,et al. Spatio-temporal Human-Object Interactions for Action Recognition in Videos , 2013, 2013 IEEE International Conference on Computer Vision Workshops.
[29] Antonio Torralba,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .
[30] Barbara Caputo,et al. Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..
[31] Nazli Ikizler-Cinbis,et al. Object, Scene and Actions: Combining Multiple Features for Human Action Recognition , 2010, ECCV.
[32] Bernt Schiele,et al. 2D Human Pose Estimation: New Benchmark and State of the Art Analysis , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[33] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[34] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.
[35] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.
[36] David R Bassett,et al. 2011 Compendium of Physical Activities: a second update of codes and MET values. , 2011, Medicine and science in sports and exercise.
[37] Fei-Fei Li,et al. Combining the Right Features for Complex Event Recognition , 2013, 2013 IEEE International Conference on Computer Vision.
[38] Bernt Schiele,et al. Fine-Grained Activity Recognition with Holistic and Pose Based Features , 2014, GCPR.
[39] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.