Video Relationship Reasoning Using Gated Spatio-Temporal Energy Graph
暂无分享,去创建一个
Ali Farhadi | Ruslan Salakhutdinov | Louis-Philippe Morency | Yao-Hung Tsai | Yao-Hung Hubert Tsai | Santosh Kumar Divvala | R. Salakhutdinov | S. Divvala | Ali Farhadi | Louis-Philippe Morency
[1] Cordelia Schmid,et al. Action Recognition with Improved Trajectories , 2013, 2013 IEEE International Conference on Computer Vision.
[2] Jian Yang,et al. Context-Dependent Diffusion Network for Visual Relationship Detection , 2018, ACM Multimedia.
[3] Abhinav Gupta,et al. ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Bo Dai,et al. Detecting Visual Relationships with Deep Relational Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[6] Nenghai Yu,et al. Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition , 2018, ECCV.
[7] Nebojsa Jojic,et al. Discovering Order in Unordered Datasets: Generative Markov Networks , 2017, ArXiv.
[8] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[9] Ali Farhadi,et al. Recognition using visual phrases , 2011, CVPR 2011.
[10] Nir Friedman,et al. Probabilistic Graphical Models - Principles and Techniques , 2009 .
[11] Tat-Seng Chua,et al. Video Visual Relation Detection , 2017, ACM Multimedia.
[12] Yejin Choi,et al. Neural Motifs: Scene Graph Parsing with Global Context , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[13] Vladlen Koltun,et al. Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.
[14] Ali Farhadi,et al. Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding , 2016, ECCV.
[15] Abhinav Gupta,et al. Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[16] Trevor Darrell,et al. Hidden Conditional Random Fields , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[17] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[18] Michael Felsberg,et al. Accurate Scale Estimation for Robust Visual Tracking , 2014, BMVC.
[19] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.
[20] Xilin Chen,et al. Visual Relationship Detection With Deep Structural Ranking , 2018, AAAI.
[21] Ben Taskar,et al. Discriminative Probabilistic Models for Relational Data , 2002, UAI.
[22] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.
[23] Jean-Michel Morel,et al. A non-local algorithm for image denoising , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).
[24] Vibhav Vineet,et al. Conditional Random Fields as Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[25] Silvio Savarese,et al. Structural-RNN: Deep Learning on Spatio-Temporal Graphs , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Geoffrey E. Hinton,et al. Factored conditional restricted Boltzmann Machines for modeling motion style , 2009, ICML '09.
[27] Eric P. Xing,et al. Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[28] Shih-Fu Chang,et al. Visual Translation Embedding Network for Visual Relation Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[29] Andrew McCallum,et al. FACTORIE: Probabilistic Programming via Imperatively Defined Factor Graphs , 2009, NIPS.
[30] Shuqiang Jiang,et al. Deep Structured Learning for Visual Relationship Detection , 2018, AAAI.
[31] Susanne Westphal,et al. The “Something Something” Video Database for Learning and Evaluating Visual Common Sense , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[32] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[33] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[34] Abhinav Gupta,et al. Videos as Space-Time Region Graphs , 2018, ECCV.
[35] Michael S. Bernstein,et al. Visual Relationship Detection with Language Priors , 2016, ECCV.
[36] Ali Farhadi,et al. Asynchronous Temporal Fields for Action Recognition , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[37] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[38] Ali Farhadi,et al. VisKE: Visual knowledge extraction and question answering by visual verification of relation phrases , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[39] Patrick Seemann,et al. Matrix Factorization Techniques for Recommender Systems , 2014 .
[40] Raquel Urtasun,et al. Fully Connected Deep Structured Networks , 2015, ArXiv.
[41] Miguel Á. Carreira-Perpiñán,et al. Multiscale conditional random fields for image labeling , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..
[42] Bolei Zhou,et al. Temporal Relational Reasoning in Videos , 2017, ECCV.
[43] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[44] Fabio Viola,et al. The Kinetics Human Action Video Dataset , 2017, ArXiv.