Video-P2P: Video Editing with Cross-attention Control
暂无分享,去创建一个
[1] Patrick Esser,et al. Structure and Content-Guided Video Synthesis with Diffusion Models , 2023, 2023 IEEE/CVF International Conference on Computer Vision (ICCV).
[2] Y. Matias,et al. Dreamix: Video Diffusion Models are General Video Editors , 2023, ArXiv.
[3] Mike Zheng Shou,et al. Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation , 2022, 2023 IEEE/CVF International Conference on Computer Vision (ICCV).
[4] Nupur Kumari,et al. Multi-Concept Customization of Text-to-Image Diffusion , 2022, ArXiv.
[5] Akash Gokul,et al. EDICT: Exact Diffusion Inversion via Coupled Transformations , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[6] M. Irani,et al. SinFusion: Training Diffusion Models on a Single Image or Video , 2022, ArXiv.
[7] D. Cohen-Or,et al. Null-text Inversion for Editing Real Images using Guided Diffusion Models , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Alexei A. Efros,et al. InstructPix2Pix: Learning to Follow Image Editing Instructions , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[9] M. Irani,et al. Imagic: Text-Based Real Image Editing with Diffusion Models , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[10] David J. Fleet,et al. Imagen Video: High Definition Video Generation with Diffusion Models , 2022, ArXiv.
[11] D. Erhan,et al. Phenaki: Variable Length Video Generation From Open Domain Textual Description , 2022, ICLR.
[12] Yaniv Taigman,et al. Make-A-Video: Text-to-Video Generation without Text-Video Data , 2022, ICLR.
[13] Yuanzhen Li,et al. DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[14] J. Tenenbaum,et al. Prompt-to-Prompt Image Editing with Cross Attention Control , 2022, ICLR.
[15] Amit H. Bermano,et al. An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion , 2022, ICLR.
[16] Jonathan Ho. Classifier-Free Diffusion Guidance , 2022, ArXiv.
[17] Jing Yu Koh,et al. Scaling Autoregressive Models for Content-Rich Text-to-Image Generation , 2022, Trans. Mach. Learn. Res..
[18] Dani Lischinski,et al. Blended Latent Diffusion , 2022, ACM Trans. Graph..
[19] Wendi Zheng,et al. CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers , 2022, ICLR.
[20] Jie Tang,et al. CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers , 2022, NeurIPS.
[21] Prafulla Dhariwal,et al. Hierarchical Text-Conditional Image Generation with CLIP Latents , 2022, ArXiv.
[22] David J. Fleet,et al. Video Diffusion Models , 2022, NeurIPS.
[23] Tali Dekel,et al. Text2LIVE: Text-Driven Layered Image and Video Editing , 2022, ECCV.
[24] Yaniv Taigman,et al. Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors , 2022, ECCV.
[25] Prafulla Dhariwal,et al. GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models , 2021, ICML.
[26] B. Ommer,et al. High-Resolution Image Synthesis with Latent Diffusion Models , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[27] Daniel Cohen-Or,et al. StyleGAN-NADA , 2021, ACM Trans. Graph..
[28] Guillermo Sapiro,et al. GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions , 2021, ArXiv.
[29] Daniel Cohen-Or,et al. StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[30] Ilya Sutskever,et al. Learning Transferable Visual Models From Natural Language Supervision , 2021, ICML.
[31] Alec Radford,et al. Zero-Shot Text-to-Image Generation , 2021, ICML.
[32] Yong Jae Lee,et al. SinGAN-GIF: Learning a Generative Video Model from a Single GIF , 2021, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).
[33] Jiaming Song,et al. Denoising Diffusion Implicit Models , 2020, ICLR.
[34] Stefano Ermon,et al. SDEdit: Image Synthesis and Editing with Stochastic Differential Equations , 2021, ArXiv.
[35] L. Wolf,et al. Hierarchical Patch VAE-GAN: Generating Diverse Videos from a Single Sample , 2020, NeurIPS.
[36] Pieter Abbeel,et al. Denoising Diffusion Probabilistic Models , 2020, NeurIPS.
[37] Taesung Park,et al. Semantic Image Synthesis With Spatially-Adaptive Normalization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[38] Alexei A. Efros,et al. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[39] Jan Kautz,et al. High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[40] Oriol Vinyals,et al. Neural Discrete Representation Learning , 2017, NIPS.