Talking Face Generation by Conditional Recurrent Adversarial Network
暂无分享,去创建一个
Jingwen Zhu | Yang Song | Xiaolong Wang | Hairong Qi | Yang Song | H. Qi | Xiaolong Wang | Dawei Li | Jingwen Zhu
[1] Joon Son Chung,et al. You said that? , 2017, BMVC.
[2] Chenliang Xu,et al. Deep Cross-Modal Audio-Visual Generation , 2017, ACM Multimedia.
[3] Chenliang Xu,et al. Lip Movements Generation at a Glance , 2018, ECCV.
[4] Jaakko Lehtinen,et al. Audio-driven facial animation by joint end-to-end learning of pose and emotion , 2017, ACM Trans. Graph..
[5] Honglak Lee,et al. Action-Conditional Video Prediction using Deep Networks in Atari Games , 2015, NIPS.
[6] Yang Song,et al. Decoupled Learning for Conditional Adversarial Networks , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).
[7] Jan Kautz,et al. MoCoGAN: Decomposing Motion and Content for Video Generation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[8] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[9] Joon Son Chung,et al. VoxCeleb: A Large-Scale Speaker Identification Dataset , 2017, INTERSPEECH.
[10] Hang Zhou,et al. Talking Face Generation by Adversarially Disentangled Audio-Visual Representation , 2018, AAAI.
[11] Hairong Qi,et al. Image Super-Resolution by Neural Texture Transfer , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[12] Shuang Wei,et al. Computer vision aided lip movement correction to improve English pronunciation , 2014 .
[13] Jan Kautz,et al. Video-to-Video Synthesis , 2018, NeurIPS.
[14] Yisong Yue,et al. A deep learning approach for generalized speech animation , 2017, ACM Trans. Graph..
[15] Yang Song,et al. Age Progression/Regression by Conditional Adversarial Autoencoder , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[16] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.
[17] Yang Song,et al. Recursive Cross-Domain Face/Sketch Generation from Limited Facial Parts , 2017, ArXiv.
[18] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[19] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.
[20] Naomi Harte,et al. TCD-TIMIT: An Audio-Visual Corpus of Continuous Speech , 2015, IEEE Transactions on Multimedia.
[21] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[22] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[23] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.
[24] Dimitris N. Metaxas,et al. StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).
[25] Antonio Torralba,et al. Generating Videos with Scene Dynamics , 2016, NIPS.
[26] Yoshua Bengio,et al. ObamaNet: Photo-realistic lip-sync from text , 2017, ArXiv.
[27] Eric P. Xing,et al. Dual Motion GAN for Future-Flow Embedded Video Prediction , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[28] Yann LeCun,et al. Deep multi-scale video prediction beyond mean square error , 2015, ICLR.
[29] Seunghoon Hong,et al. Decomposing Motion and Content for Natural Video Sequence Prediction , 2017, ICLR.
[30] 拓海 杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .
[31] Michael I. Jordan,et al. Advances in Neural Information Processing Systems 30 , 1995 .
[32] Yang Song,et al. r-BTN: Cross-Domain Face Composite and Synthesis From Limited Facial Patches , 2018, AAAI.
[33] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.
[34] Nitish Srivastava,et al. Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.
[35] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.
[36] Ira Kemelmacher-Shlizerman,et al. Synthesizing Obama , 2017, ACM Trans. Graph..
[37] Alexei A. Efros,et al. Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).