暂无分享,去创建一个
Miguel P. Eckstein | Scott T. Grafton | Tsu-Jui Fu | Xin Eric Wang | William Yang Wang | M. Eckstein | Tsu-Jui Fu | W. Wang | X. Wang
[1] Li Fei-Fei,et al. CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[2] Shunta Saito,et al. Temporal Generative Adversarial Nets with Singular Value Clipping , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[3] Yoshua Bengio,et al. Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Deqing Sun,et al. A Bayesian approach to adaptive video super resolution , 2011, CVPR 2011.
[5] Renjie Liao,et al. Video Super-Resolution via Deep Draft-Ensemble Learning , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[6] Seoung Wug Oh,et al. Deep Video Super-Resolution Network Using Dynamic Upsampling Filters Without Explicit Motion Compensation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[7] Deva Ramanan,et al. CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning , 2020, ICLR.
[8] Luc Van Gool,et al. Dynamic Filter Networks , 2016, NIPS.
[9] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.
[10] Yann LeCun,et al. Deep multi-scale video prediction beyond mean square error , 2015, ICLR.
[11] Seunghoon Hong,et al. Decomposing Motion and Content for Natural Video Sequence Prediction , 2017, ICLR.
[12] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.
[13] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[14] Satoshi Nakamura,et al. Interactive Image Manipulation with Natural Language Instruction Commands , 2018, ArXiv.
[15] Vibhav Vineet,et al. ImageSpirit: Verbal Guided Image Parsing , 2013, ACM Trans. Graph..
[16] Andrew Zisserman,et al. Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.
[18] Nenghai Yu,et al. Coherent Online Video Style Transfer , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[19] Matthias Zwicker,et al. Video Synthesis from a Single Image and Motion Stroke , 2018, ArXiv.
[20] Vicente Ordonez,et al. Text2Scene: Generating Compositional Scenes From Textual Descriptions , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[21] Vineeth N. Balasubramanian,et al. Attentive Semantic Video Generation Using Captions , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[22] Ruslan Salakhutdinov,et al. Gated-Attention Architectures for Task-Oriented Language Grounding , 2017, AAAI.
[23] Xianming Liu,et al. Robust Video Super-Resolution with Learned Temporal Dynamics , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[24] Jon Barker,et al. SDC-Net: Video Prediction Using Spatially-Displaced Convolution , 2018, ECCV.
[25] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Jiawei He,et al. Probabilistic Video Generation using Holistic Attribute Control , 2018, ECCV.
[27] Xiaodong Liu,et al. Language-Based Image Editing with Recurrent Attentive Models , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[28] Jeff Donahue,et al. Adversarial Video Generation on Complex Datasets , 2019 .
[29] William Yang Wang,et al. Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning , 2020, EMNLP.
[30] Jan Kautz,et al. Video-to-Video Synthesis , 2018, NeurIPS.
[31] Yunbo Wang,et al. Eidetic 3D LSTM: A Model for Video Prediction and Beyond , 2019, ICLR.
[32] Rama Chellappa,et al. Conditional GAN with Discriminative Filter Generation for Text-to-Video Synthesis , 2019, IJCAI.
[33] Zhuowen Tu,et al. Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[34] Xiaoou Tang,et al. Video Frame Synthesis Using Deep Voxel Flow , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[35] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.
[36] Tao Mei,et al. To Create What You Tell: Generating Videos from Captions , 2017, ACM Multimedia.
[37] Antonio Torralba,et al. Generating Videos with Scene Dynamics , 2016, NIPS.
[38] Nitish Srivastava,et al. Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.
[39] Bolei Zhou,et al. Deep Flow-Guided Video Inpainting , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[40] Takeru Miyato,et al. cGANs with Projection Discriminator , 2018, ICLR.
[41] Juan Carlos Niebles,et al. Learning to Decompose and Disentangle Representations for Video Prediction , 2018, NeurIPS.
[42] Jan Kautz,et al. MoCoGAN: Decomposing Motion and Content for Video Generation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[43] Alex Graves,et al. Video Pixel Networks , 2016, ICML.
[44] Yoshua Bengio,et al. Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[45] Gierad Laput,et al. PixelTone: a multimodal interface for image editing , 2013, CHI.
[46] Han Zhang,et al. Self-Attention Generative Adversarial Networks , 2018, ICML.
[47] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[48] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[49] Winston H. Hsu,et al. VORNet: Spatio-Temporally Consistent Video Inpainting for Object Removal , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[50] Gerhard Rigoll,et al. Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[51] Sergio Gomez Colmenarejo,et al. Parallel Multiscale Autoregressive Density Estimation , 2017, ICML.
[52] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[53] Shunta Saito,et al. TGANv2: Efficient Training of Large Models for Video Generation with Multiple Subsampling Layers , 2018, ArXiv.
[54] Yang Wang,et al. Cross-Modal Self-Attention Network for Referring Image Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[55] Jan Kautz,et al. Few-shot Video-to-Video Synthesis , 2019, NeurIPS.
[56] In So Kweon,et al. Deep Video Inpainting , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[57] Weiming Dong,et al. Arbitrary Video Style Transfer via Multi-Channel Correlation , 2020, AAAI.
[58] Zheng Sun,et al. Real-time Localized Photorealistic Video Style Transfer , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).
[59] Serge J. Belongie,et al. Controllable Video Generation with Sparse Trajectories , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[60] Jakob Uszkoreit,et al. Scaling Autoregressive Video Models , 2019, ICLR.
[61] David Duvenaud,et al. Scalable Gradients for Stochastic Differential Equations , 2020, AISTATS.
[62] Nicolas Thome,et al. Disentangling Physical Dynamics From Unknown Factors for Unsupervised Video Prediction , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[63] Joanna Materzynska,et al. The Jester Dataset: A Large-Scale Video Dataset of Human Gestures , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).
[64] Sergey Levine,et al. Stochastic Variational Video Prediction , 2017, ICLR.
[65] Sjoerd van Steenkiste,et al. Towards Accurate Generative Models of Video: A New Metric & Challenges , 2018, ArXiv.
[66] Yitong Li,et al. Video Generation From Text , 2017, AAAI.
[67] Chenxi Liu,et al. Recurrent Multimodal Interaction for Referring Image Segmentation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[68] Zhe Gan,et al. Adaptive Feature Abstraction for Translating Video to Text , 2018, AAAI.
[69] Rob Fergus,et al. Stochastic Video Generation with a Learned Prior , 2018, ICML.
[70] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.
[71] Aäron van den Oord,et al. Predicting Video with VQVAE , 2021, ArXiv.
[72] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.
[73] Zhe Gan,et al. AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.