To Create What You Tell: Generating Videos from Captions
暂无分享,去创建一个
Tao Mei | Houqiang Li | Ting Yao | Zhaofan Qiu | Yingwei Pan | Tao Mei | Ting Yao | Yingwei Pan | Zhaofan Qiu | Houqiang Li
[1] Quoc V. Le,et al. Semi-supervised Sequence Learning , 2015, NIPS.
[2] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
[3] Graham W. Taylor,et al. Deconvolutional networks , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[4] Dit-Yan Yeung,et al. Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.
[5] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[6] Nitish Srivastava,et al. Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.
[7] William B. Dolan,et al. Collecting Highly Parallel Data for Paraphrase Evaluation , 2011, ACL.
[8] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..
[9] Tao Mei,et al. Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.
[11] Hui Jiang,et al. Generating images with recurrent adversarial networks , 2016, ArXiv.
[12] Hossein Mobahi,et al. Deep learning from temporal coherence in video , 2009, ICML '09.
[13] Tao Mei,et al. Video Captioning with Transferred Semantic Attributes , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[14] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[15] Vineeth N. Balasubramanian,et al. Sync-DRAW: Automatic Video Generation using Deep Recurrent Attentive Architectures , 2016, ACM Multimedia.
[16] Jonathon Shlens,et al. Conditional Image Synthesis with Auxiliary Classifier GANs , 2016, ICML.
[17] Tao Mei,et al. Boosting Image Captioning with Attributes , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[18] John Salvatier,et al. Theano: A Python framework for fast computation of mathematical expressions , 2016, ArXiv.
[19] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.
[20] Masaki Saito,et al. Temporal Generative Adversarial Nets , 2016, ArXiv.
[21] Tao Mei,et al. Jointly Modeling Embedding and Translation to Bridge Video and Language , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[22] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[23] Ruslan Salakhutdinov,et al. Generating Images from Captions with Attention , 2015, ICLR.
[24] Antonio Torralba,et al. Generating Videos with Scene Dynamics , 2016, NIPS.
[25] Trevor Darrell,et al. YouTube2Text: Recognizing and Describing Arbitrary Activities Using Semantic Hierarchies and Zero-Shot Recognition , 2013, 2013 IEEE International Conference on Computer Vision.
[26] Alex Graves,et al. DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.
[27] Simon Haykin,et al. GradientBased Learning Applied to Document Recognition , 2001 .
[28] Tao Mei,et al. MSR-VTT: A Large Video Description Dataset for Bridging Video and Language , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[29] Lorenzo Torresani,et al. Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).
[30] Shunta Saito,et al. Temporal Generative Adversarial Nets with Singular Value Clipping , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[31] Rob Fergus,et al. Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.