Conquering the CNN Over-Parameterization Dilemma: A Volterra Filtering Approach for Action Recognition
暂无分享,去创建一个
[1] Andrew Zisserman,et al. Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.
[2] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[3] Xiang Zhang,et al. OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.
[4] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.
[5] Jiebo Luo,et al. Recognizing realistic actions from videos “in the wild” , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[6] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[7] Yang Gao,et al. Compact Bilinear Pooling , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[9] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.
[10] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[11] Yun Fu,et al. Human Action Recognition and Prediction: A Survey , 2018, International Journal of Computer Vision.
[12] Liyi Dai,et al. Cross-Modality Distillation: A Case for Conditional Generative Adversarial Networks , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[13] Cordelia Schmid,et al. Evaluation of Local Spatio-temporal Features for Action Recognition , 2009, BMVC.
[14] Benjamin Pfaff. Theory Of Functionals And Of Integral And Integro Differential Equations , 2016 .
[15] Subhransu Maji,et al. Bilinear CNNs for Fine-grained Visual Recognition , 2015 .
[16] LeCunYann,et al. Learning Hierarchical Features for Scene Labeling , 2013 .
[17] Hamid Krim,et al. Human Activity Modeling as Brownian Motion on Shape Manifold , 2011, SSVM.
[18] Y. Fu. Human Action Recognition and Prediction: A Survey , 2020 .
[19] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.
[20] Thomas Serre,et al. HMDB: A large video database for human motion recognition , 2011, 2011 International Conference on Computer Vision.
[21] Christian Wolf,et al. Sequential Deep Learning for Human Action Recognition , 2011, HBU.
[22] Ming Yang,et al. 3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[23] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[24] M. Schetzen. The Volterra and Wiener Theories of Nonlinear Systems , 1980 .
[25] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[26] Stanislaw Osowski,et al. Multilayer neural network structure as Volterra filter , 1994, Proceedings of IEEE International Symposium on Circuits and Systems - ISCAS '94.
[27] King-Sun Fu,et al. IEEE Transactions on Pattern Analysis and Machine Intelligence Publication Information , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[28] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[29] Trevor Darrell,et al. Learning with Side Information through Modality Hallucination , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[30] Zhi-Quan Luo,et al. Decision Level Fusion: An Event Driven Approach , 2018, 2018 26th European Signal Processing Conference (EUSIPCO).
[31] Zhi-Quan Luo,et al. Event Driven Fusion , 2019, 1904.11520.
[32] Fabio Viola,et al. The Kinetics Human Action Video Dataset , 2017, ArXiv.
[33] Hanspeter Pfister,et al. Trainable Convolution Filters and Their Application to Face Recognition , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[34] Horst Bischof,et al. A Duality Based Approach for Realtime TV-L1 Optical Flow , 2007, DAGM-Symposium.
[35] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[36] Luc Van Gool,et al. Deep Temporal Linear Encoding Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[37] Mubarak Shah,et al. UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild , 2012, ArXiv.
[38] Andrew Zisserman,et al. Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[39] Juan Carlos Niebles,et al. Modeling Temporal Structure of Decomposable Motion Segments for Activity Classification , 2010, ECCV.
[40] Camille Couprie,et al. Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[41] Petros Daras,et al. Non-linear Convolution Filters for CNN-Based Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).