Watch, Reason and Code: Learning to Represent Videos Using Program
暂无分享,去创建一个
Chuang Gan | Wenbing Huang | Wenwu Zhu | Anton van den Hengel | Yiwei Zhang | Qi Wu | Xuguang Duan | Chuang Gan | Wenwu Zhu | Qi Wu | Wenbing Huang | Yiwei Zhang | Xuguang Duan | A. Hengel
[1] Wei Xu,et al. Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[2] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[3] Yale Song,et al. To Click or Not To Click: Automatic Selection of Beautiful Thumbnails from Videos , 2016, CIKM.
[4] Matthew J. Hausknecht,et al. Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis , 2018, ICLR.
[5] Tomas Mikolov,et al. Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets , 2015, NIPS.
[6] Lukasz Kaiser,et al. Neural GPUs Learn Algorithms , 2015, ICLR.
[7] Yi Yang,et al. DevNet: A Deep Event Network for multimedia event detection and evidence recounting , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Zhou Su,et al. Weakly Supervised Dense Video Captioning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Michael S. Ryoo,et al. Human activity prediction: Early recognition of ongoing activities from streaming videos , 2011, 2011 International Conference on Computer Vision.
[10] Chuang Gan,et al. Weakly Supervised Dense Event Captioning in Videos , 2018, NeurIPS.
[11] Zhe Gan,et al. Semantic Compositional Networks for Visual Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Tao Mei,et al. MSR-VTT: A Large Video Description Dataset for Bridging Video and Language , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Juan Carlos Niebles,et al. Dense-Captioning Events in Videos , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[14] Eric Medvet,et al. Automatic Synthesis of Regular Expressions from Examples , 2014, Computer.
[15] Zhe Gan,et al. StyleNet: Generating Attractive Visual Captions with Styles , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[16] Wojciech Jaskowski,et al. ViZDoom: A Doom-based AI research platform for visual reinforcement learning , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).
[17] Tao Mei,et al. Video Captioning with Transferred Semantic Attributes , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[18] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.
[19] John R. Hershey,et al. Attention-Based Multimodal Fusion for Video Description , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[20] Andrew Zisserman,et al. Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.
[21] Tao Mei,et al. To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression , 2018, AAAI.
[22] Richard E. Pattis,et al. Karel the Robot: A Gentle Introduction to the Art of Programming , 1994 .
[23] Matthew J. Hausknecht,et al. Neural Program Meta-Induction , 2017, NIPS.
[24] Hyeonwoo Noh,et al. Neural Program Synthesis from Diverse Demonstration Videos , 2018, ICML.
[25] Marcin Andrychowicz,et al. Neural Random Access Machines , 2015, ERCIM News.
[26] Mubarak Shah,et al. Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[27] Razvan Pascanu,et al. A simple neural network module for relational reasoning , 2017, NIPS.
[28] Leonidas J. Guibas,et al. Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[29] Trevor Darrell,et al. Sequence to Sequence -- Video to Text , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[30] Chuang Gan,et al. Video Captioning with Multi-Faceted Attention , 2016, TACL.
[31] Armando Solar-Lezama,et al. Learning to Infer Graphics Programs from Hand-Drawn Images , 2017, NeurIPS.
[32] Nenghai Yu,et al. Deliberation Networks: Sequence Generation Beyond One-Pass Decoding , 2017, NIPS.
[33] Sebastian Nowozin,et al. DeepCoder: Learning to Write Programs , 2016, ICLR.
[34] Lihong Li,et al. Neuro-Symbolic Program Synthesis , 2016, ICLR.