Cross-Modal Contrastive Learning for Text-to-Image Generation
暂无分享,去创建一个
[1] Jiaolong Yang,et al. Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[2] Liam Paninski,et al. Estimation of Entropy and Mutual Information , 2003, Neural Computation.
[3] Yoshua Bengio,et al. Mutual Information Neural Estimation , 2018, ICML.
[4] Yuichi Yoshida,et al. Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.
[5] Seunghoon Hong,et al. Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[6] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[7] Oriol Vinyals,et al. Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.
[8] Geoffrey E. Hinton,et al. A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.
[9] Thomas Fevens,et al. Dual Adversarial Inference for Text-to-Image Synthesis , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[10] Dustin Tran,et al. Hierarchical Implicit Models and Likelihood-Free Variational Inference , 2017, NIPS.
[11] Han Zhang,et al. Self-Attention Generative Adversarial Networks , 2018, ICML.
[12] Wei Chen,et al. DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-To-Image Synthesis , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Ting Chen,et al. On Self Modulation for Generative Adversarial Networks , 2018, ICLR.
[14] Cordelia Schmid,et al. What makes for good views for contrastive learning , 2020, NeurIPS.
[15] Joon Son Chung,et al. Perfect Match: Improved Cross-modal Embeddings for Audio-visual Synchronisation , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[16] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.
[17] Stefan Wermter,et al. Semantic Object Accuracy for Generative Text-to-Image Synthesis , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[18] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.
[19] Dacheng Tao,et al. Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge , 2019, NeurIPS.
[20] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.
[21] Honglak Lee,et al. Text-to-Image Generation Grounded by Fine-Grained User Attention , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).
[22] Andrew Zisserman,et al. Automated Flower Classification over a Large Number of Classes , 2008, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing.
[23] Jon Gauthier. Conditional generative adversarial nets for convolutional face generation , 2015 .
[24] Nenghai Yu,et al. Semantics Disentangling for Text-To-Image Generation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[25] Yoshua Bengio,et al. Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[26] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.
[27] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .
[28] Thomas Lukasiewicz,et al. Controllable Text-to-Image Generation , 2019, NeurIPS.
[29] Stella X. Yu,et al. Unsupervised Feature Learning via Non-parametric Instance Discrimination , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[30] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.
[31] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[32] Jeff Donahue,et al. Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.
[33] Ngai-Man Cheung,et al. InfoMax-GAN: Improved Adversarial Image Generation via Information Maximization and Contrastive Learning , 2020, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).
[34] Jing Zhang,et al. MirrorGAN: Learning Text-To-Image Generation by Redescription , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[35] Wenjie Pei,et al. CPGAN: Full-Spectrum Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis , 2019, ArXiv.
[36] Lin Yang,et al. Photographic Text-to-Image Synthesis with a Hierarchically-Nested Adversarial Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[37] Sergio Gomez Colmenarejo,et al. Parallel Multiscale Autoregressive Density Estimation , 2017, ICML.
[38] Nuno Vasconcelos,et al. Audio-Visual Instance Discrimination with Cross-Modal Agreement , 2020, ArXiv.
[39] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[40] Rishi Sharma,et al. A Note on the Inception Score , 2018, ArXiv.
[41] Jaesik Park,et al. ContraGAN: Contrastive Learning for Conditional Image Generation , 2020, NeurIPS.
[42] Stefano Ermon,et al. Understanding the Limitations of Variational Mutual Information Estimators , 2020, ICLR.
[43] Kaiming He,et al. Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[44] Jason Baldridge,et al. Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO , 2020, EACL.
[45] Alexei A. Efros,et al. Contrastive Learning for Unpaired Image-to-Image Translation , 2020, ECCV.
[46] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.
[47] Han Zhang,et al. Improving GANs Using Optimal Transport , 2018, ICLR.
[48] Kaiming He,et al. Improved Baselines with Momentum Contrastive Learning , 2020, ArXiv.
[49] Jordi Pont-Tuset,et al. Connecting Vision and Language with Localized Narratives , 2019, ECCV.
[50] Xin Li,et al. Semantics-Enhanced Adversarial Nets for Text-to-Image Synthesis , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).
[51] Sameer Singh,et al. Image Augmentations for GAN Training , 2020, ArXiv.
[52] Radu Soricut,et al. Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning , 2018, ACL.
[53] Hui Chen,et al. Geometry-Contrastive GAN for Facial Expression Transfer. , 2018, 1802.01822.
[54] Alex Graves,et al. DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.
[55] Honglak Lee,et al. Consistency Regularization for Generative Adversarial Networks , 2020, ICLR.
[56] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[57] Lei Zhang,et al. Object-Driven Text-To-Image Synthesis via Adversarial Training , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[58] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[59] Stefan Wermter,et al. Generating Multiple Objects at Spatially Distinct Locations , 2019, ICLR.
[60] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.
[61] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.