DIY Human Action Dataset Generation
暂无分享,去创建一个
Vivek Pradeep | Mehran Khodabandeh | Hamid Reza Vaezi Joze | Ilya Zharkov | V. Pradeep | Ilya Zharkov | Mehran Khodabandeh
[1] Greg Mori,et al. Discovering human interactions in videos with limited data labeling , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[2] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[3] Marc'Aurelio Ranzato,et al. Transformation-Based Models of Video Sequences , 2017, ArXiv.
[4] Daan Wierstra,et al. Stochastic Back-propagation and Variational Inference in Deep Latent Gaussian Models , 2014, ArXiv.
[5] Gang Wang,et al. NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.
[7] Ronald Poppe,et al. A survey on vision-based human action recognition , 2010, Image Vis. Comput..
[8] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.
[9] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[10] Yaser Sheikh,et al. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[11] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.
[12] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.
[13] Jan Kautz,et al. MoCoGAN: Decomposing Motion and Content for Video Generation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[14] Sergio Escalera,et al. Keep it accurate and diverse: Enhancing action recognition performance by ensemble learning , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[15] Shunta Saito,et al. Temporal Generative Adversarial Nets with Singular Value Clipping , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[16] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[17] Rob Fergus,et al. Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.
[18] Martial Hebert,et al. The Pose Knows: Video Forecasting by Generating Pose Futures , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[19] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[20] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[21] Jake K. Aggarwal,et al. View invariant human action recognition using histograms of 3D joints , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.
[22] Jiajun Wu,et al. Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks , 2016, NIPS.
[23] Ming Yang,et al. 3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[24] Martin Szummer,et al. Temporal texture modeling , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.
[25] Cordelia Schmid,et al. Action recognition by dense trajectories , 2011, CVPR 2011.
[26] Jan Kautz,et al. Unsupervised Image-to-Image Translation Networks , 2017, NIPS.
[27] Stefano Soatto,et al. Dynamic Textures , 2003, International Journal of Computer Vision.
[28] Cordelia Schmid,et al. Learning from Synthetic Humans , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[29] Cordelia Schmid,et al. Actions in context , 2009, CVPR.
[30] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[31] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[32] Yann LeCun,et al. Deep multi-scale video prediction beyond mean square error , 2015, ICLR.
[33] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[34] Andrew Zisserman,et al. Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[35] Seunghoon Hong,et al. Decomposing Motion and Content for Natural Video Sequence Prediction , 2017, ICLR.
[36] Antonio Torralba,et al. Generating Videos with Scene Dynamics , 2016, NIPS.
[37] Patrice Y. Simard,et al. Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..
[38] Nitish Srivastava,et al. Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.
[39] Xi Zhang,et al. Learning Classifiers from Synthetic Data Using a Multichannel Autoencoder , 2015, ArXiv.
[40] Dimitris Samaras,et al. Two-person interaction detection using body-pose features and multiple instance learning , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.
[41] Fabio Viola,et al. The Kinetics Human Action Video Dataset , 2017, ArXiv.
[42] Barbara Caputo,et al. Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..
[43] Ali Farhadi,et al. Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding , 2016, ECCV.
[44] Matthew J. Hausknecht,et al. Beyond short snippets: Deep networks for video classification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[45] Patrick Pérez,et al. View-Independent Action Recognition from Temporal Self-Similarities , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[46] Yoshua Bengio,et al. Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[47] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[48] Johan Karlsson,et al. Automatic target recognition using discrimination based on optimal transport , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[49] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.
[50] Masaki Saito,et al. Temporal Generative Adversarial Nets , 2016, ArXiv.
[51] Wanqing Li,et al. Action recognition based on a bag of 3D points , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.
[52] Ira Kemelmacher-Shlizerman,et al. The MegaFace Benchmark: 1 Million Faces for Recognition at Scale , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[53] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[54] Shin'ichi Satoh,et al. Active Learning for Structured Prediction from Partially Labeled Data , 2017, ArXiv.
[55] Antonio Manuel López Peña,et al. Procedural Generation of Videos to Train Deep Action Recognition Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[56] Honglak Lee,et al. Action-Conditional Video Prediction using Deep Networks in Atari Games , 2015, NIPS.
[57] Mark D. McDonnell,et al. Understanding Data Augmentation for Classification: When to Warp? , 2016, 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA).
[58] Kensuke Yokoi,et al. APAC: Augmented PAttern Classification with Neural Networks , 2015, ArXiv.
[59] Ming-Yu Liu,et al. Coupled Generative Adversarial Networks , 2016, NIPS.
[60] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[61] Silvio Savarese,et al. Single-source Attention Path Prediction Multi-source Attention Predicted Observed , 2018 .
[62] Richard S. Zemel,et al. Generative Moment Matching Networks , 2015, ICML.
[63] Sergey Levine,et al. Unsupervised Learning for Physical Interaction through Video Prediction , 2016, NIPS.
[64] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.
[65] Cordelia Schmid,et al. Actions in context , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[66] Silvio Savarese,et al. Tracking the Untrackable: Learning to Track Multiple Cues with Long-Term Dependencies , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[67] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.
[68] Jaakko Lehtinen,et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.
[69] Dimitris N. Metaxas,et al. StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[70] Marc Levoy,et al. Fast texture synthesis using tree-structured vector quantization , 2000, SIGGRAPH.
[71] Alex Graves,et al. Video Pixel Networks , 2016, ICML.
[72] Mubarak Shah,et al. A 3-dimensional sift descriptor and its application to action recognition , 2007, ACM Multimedia.