MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text
暂无分享,去创建一个
Jiebo Luo | Jianlong Fu | Jingkuan Song | Huan Yang | Lianli Gao | Yongsheng Yu | Wenjing Wang | Huiguo He | Junchen Zhu | Wen-Huang Cheng | Zixi Tuo
[1] Jianlong Fu,et al. MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images , 2023, ACM Multimedia.
[2] Jianlong Fu,et al. VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation , 2023, ArXiv.
[3] Z. Li,et al. AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation , 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[4] Seung Wook Kim,et al. Align Your Latents: High-Resolution Video Synthesis with Latent Diffusion Models , 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[5] J. Liu,et al. Sounding Video Generator: A Unified Framework for Text-Guided Sounding Video Generation , 2023, IEEE Transactions on Multimedia.
[6] Jinyu Li,et al. Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers , 2023, ArXiv.
[7] Nicholas Jing Yuan,et al. MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[8] Yaniv Taigman,et al. Make-A-Video: Text-to-Video Generation without Text-Video Data , 2022, ICLR.
[9] B. Ommer,et al. High-Resolution Image Synthesis with Latent Diffusion Models , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[10] João F. Henriques,et al. Audio Retrieval With Natural Language Queries: A Benchmark Study , 2021, IEEE Transactions on Multimedia.
[11] Zeynep Akata,et al. Audio Retrieval with Natural Language Queries , 2021, Interspeech.
[12] Andrew Zisserman,et al. Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).
[13] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[14] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.