Hierarchical Text-Conditional Image Generation with CLIP Latents
暂无分享,去创建一个
[1] Jonathan Ho. Classifier-Free Diffusion Guidance , 2022, ArXiv.
[2] Yaniv Taigman,et al. Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors , 2022, ECCV.
[3] Zili Yi,et al. CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP , 2022, ArXiv.
[4] Dmytro Okhonko,et al. CM3: A Causal Masked Multimodal Model of the Internet , 2022, ArXiv.
[5] Bo Zhang,et al. Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models , 2022, ICLR.
[6] Saining Xie,et al. SLIP: Self-supervision meets Language-Image Pre-training , 2021, ECCV.
[7] Prafulla Dhariwal,et al. GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models , 2021, ICML.
[8] B. Ommer,et al. High-Resolution Image Synthesis with Latent Diffusion Models , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Pascal Vincent,et al. High Fidelity Visualization of What Your Self-Supervised Representation Knows About , 2021, Trans. Mach. Learn. Res..
[10] Supasorn Suwajanakorn,et al. Diffusion Autoencoders: Toward a Meaningful and Decodable Representation , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[11] Fang Wen,et al. Vector Quantized Diffusion Model for Text-to-Image Synthesis , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Ruiyi Zhang,et al. Towards Language-Free Training for Text-to-Image Generation , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[13] David P. Kreil,et al. CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP , 2021, NeurIPS.
[14] Daniel Cohen-Or,et al. StyleGAN-NADA , 2021, ACM Trans. Graph..
[15] Kurt Keutzer,et al. How Much Can CLIP Benefit Vision-and-Language Tasks? , 2021, ICLR.
[16] Rajshekhar Sunderraman,et al. Improving Text-to-Image Synthesis Using Contrastive Learning , 2021, BMVC.
[17] Jan Kautz,et al. Score-based Generative Modeling in Latent Space , 2021, NeurIPS.
[18] David J. Fleet,et al. Cascaded Diffusion Models for High Fidelity Image Generation , 2021, J. Mach. Learn. Res..
[19] Chang Zhou,et al. CogView: Mastering Text-to-Image Generation via Transformers , 2021, NeurIPS.
[20] Prafulla Dhariwal,et al. Diffusion Models Beat GANs on Image Synthesis , 2021, NeurIPS.
[21] David J. Fleet,et al. Image Super-Resolution via Iterative Refinement , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[22] Daniel Cohen-Or,et al. StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[23] Luc Van Gool,et al. Designing a Practical Degradation Model for Deep Blind Image Super-Resolution , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[24] Alec Radford,et al. Multimodal Neurons in Artificial Neural Networks , 2021 .
[25] Ilya Sutskever,et al. Learning Transferable Visual Models From Natural Language Supervision , 2021, ICML.
[26] Alec Radford,et al. Zero-Shot Text-to-Image Generation , 2021, ICML.
[27] Prafulla Dhariwal,et al. Improved Denoising Diffusion Probabilistic Models , 2021, ICML.
[28] Gigliola Vaglini,et al. Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search , 2021, IMPROVE.
[29] Ming-Hsuan Yang,et al. GAN Inversion: A Survey , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[30] Jing Yu Koh,et al. Cross-Modal Contrastive Learning for Text-to-Image Generation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[31] B. Ommer,et al. Taming Transformers for High-Resolution Image Synthesis , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[32] Rewon Child,et al. Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images , 2020, ICLR.
[33] S. Gelly,et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.
[34] Jiaming Song,et al. Denoising Diffusion Implicit Models , 2020, ICLR.
[35] Ariel Kleiner,et al. Sharpness-Aware Minimization for Efficiently Improving Generalization , 2020, ICLR.
[36] Christopher D. Manning,et al. Contrastive Learning of Medical Visual Representations from Paired Images and Text , 2020, MLHC.
[37] Nicu Sebe,et al. DF-GAN: Deep Fusion Generative Adversarial Networks for Text-to-Image Synthesis , 2020, ArXiv.
[38] Julien Perez,et al. Learning Visual Representations with Caption Annotations , 2020, ECCV.
[39] J. Kautz,et al. NVAE: A Deep Hierarchical Variational Autoencoder , 2020, NeurIPS.
[40] Pieter Abbeel,et al. Denoising Diffusion Probabilistic Models , 2020, NeurIPS.
[41] Stefano Ermon,et al. Improved Techniques for Training Score-Based Generative Models , 2020, NeurIPS.
[42] Justin Johnson,et al. VirTex: Learning Visual Representations from Textual Annotations , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[43] Tom B. Brown,et al. Language Models are Few-Shot Learners , 2020, NeurIPS.
[44] Ali Razavi,et al. Generating Diverse High-Fidelity Images with VQ-VAE-2 , 2019, NeurIPS.
[45] Wei Chen,et al. DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-To-Image Synthesis , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[46] Zhe Gan,et al. AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[47] Frank Hutter,et al. Decoupled Weight Decay Regularization , 2017, ICLR.
[48] Oriol Vinyals,et al. Neural Discrete Representation Learning , 2017, NIPS.
[49] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.
[50] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[51] Alexei A. Efros,et al. Generative Visual Manipulation on the Natural Image Manifold , 2016, ECCV.
[52] Surya Ganguli,et al. Deep Unsupervised Learning using Nonequilibrium Thermodynamics , 2015, ICML.
[53] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[54] Aaron C. Courville,et al. Generative adversarial networks , 2014, Commun. ACM.
[55] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.
[56] Naila Murray,et al. AVA: A large-scale database for aesthetic visual analysis , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[57] Rosalie Kerr,et al. The Big Sleep , 1990, Science.
[58] Karl Pearson F.R.S.. LIII. On lines and planes of closest fit to systems of points in space , 1901 .