Capture, Learning, and Synthesis of 3D Speaking Styles
暂无分享,去创建一个
Michael J. Black | Anurag Ranjan | Timo Bolkart | Cassidy Laidlaw | Daniel Cudeiro | Cassidy Laidlaw | Anurag Ranjan | Timo Bolkart | Daniel Cudeiro
[1] Shaun J. Canavan,et al. BP4D-Spontaneous: a high-resolution spontaneous 3D dynamic facial expression database , 2014, Image Vis. Comput..
[2] Georgios Tzimiropoulos,et al. How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[3] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.
[4] Christoph Bregler,et al. Video Rewrite: Driving Visual Speech with Audio , 1997, SIGGRAPH.
[5] Mark Pauly,et al. Realtime performance-based facial animation , 2011, ACM Trans. Graph..
[6] M. Pauly,et al. Example-based facial rigging , 2010, ACM Trans. Graph..
[7] Fei Yang,et al. Facial expression editing in video using a temporally-smooth factorization , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.
[8] Chenliang Xu,et al. Lip Movements Generation at a Glance , 2018, ECCV.
[9] Dimitrios Tzionas,et al. Expressive Body Capture: 3D Hands, Face, and Body From a Single Image , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
[10] Jaakko Lehtinen,et al. Audio-driven facial animation by joint end-to-end learning of pose and emotion , 2017, ACM Trans. Graph..
[11] Michael J. Black,et al. Generating 3D faces using Convolutional Mesh Autoencoders , 2018, ECCV.
[12] Shaun J. Canavan,et al. Multimodal Spontaneous Emotion Corpus for Human Behavior Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[13] Yisong Yue,et al. A deep learning approach for generalized speech animation , 2017, ACM Trans. Graph..
[14] Thomas S. Huang,et al. Real-time speech-driven face animation with expressions using neural networks , 2002, IEEE Trans. Neural Networks.
[15] Moshe Mahler,et al. Dynamic units of visual speech , 2012, SCA '12.
[16] Paul Debevec,et al. The Digital Emily project: photoreal facial modeling and animation , 2009, SIGGRAPH '09.
[17] Lei Xie,et al. Head motion synthesis from speech using deep neural networks , 2015, Multimedia Tools and Applications.
[18] Adrian Hilton,et al. A FACS valid 3D dynamic action unit database with applications to 3D dynamic morphable facial modeling , 2011, 2011 International Conference on Computer Vision.
[19] Qiang Huo,et al. Video-audio driven real-time facial animation , 2015, ACM Trans. Graph..
[20] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.
[21] Subhransu Maji,et al. Visemenet , 2018, ACM Trans. Graph..
[22] Erich Elsen,et al. Deep Speech: Scaling up end-to-end speech recognition , 2014, ArXiv.
[23] Justus Thies,et al. Face2Face: real-time face capture and reenactment of RGB videos , 2019, Commun. ACM.
[24] Lijun Yin,et al. A high-resolution 3D dynamic facial expression database , 2008, 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition.
[25] Yiying Tong,et al. FaceWarehouse: A 3D Facial Expression Database for Visual Computing , 2014, IEEE Transactions on Visualization and Computer Graphics.
[26] Stefanos Zafeiriou,et al. 4DFAB: A Large Scale 4D Database for Facial Expression Analysis and Biometric Applications , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[27] Zhigang Deng,et al. Rigid Head Motion in Expressive Speech Animation: Analysis and Synthesis , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[28] Björn Granström,et al. SynFace—Speech-Driven Facial Animation for Virtual Speech-Reading Support , 2009, EURASIP J. Audio Speech Music. Process..
[29] Kun Zhou,et al. Displaced dynamic expression regression for real-time facial tracking and animation , 2014, ACM Trans. Graph..
[30] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.
[31] Frank K. Soong,et al. A deep bidirectional LSTM approach for video-realistic talking head , 2016, Multimedia Tools and Applications.
[32] Michael J. Black,et al. Learning a model of facial shape and expression from 4D scans , 2017, ACM Trans. Graph..
[33] Luiz Velho,et al. Automatic 3D Facial Expression Analysis in Videos , 2005, AMFG.
[34] Yangang Wang,et al. Online modeling for realtime facial animation , 2013, ACM Trans. Graph..
[35] Wojciech Matusik,et al. Video face replacement , 2011, ACM Trans. Graph..
[36] Lei Xie,et al. Photo-real talking head with deep bidirectional LSTM , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[37] Lei Xie,et al. Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling , 2007, IEEE Transactions on Multimedia.
[38] Jian Zhang,et al. SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.
[39] Jovan Popovic,et al. Deformation transfer for triangle meshes , 2004, ACM Trans. Graph..
[40] Arman Savran,et al. Bosphorus Database for 3D Face Analysis , 2008, BIOID.
[41] Frank K. Soong,et al. Text Driven 3D Photo-Realistic Talking Head , 2011, INTERSPEECH.
[42] A. Esposito,et al. Speech driven facial animation , 2001, PUI '01.
[43] Jaakko Lehtinen,et al. Production-level facial performance capture using deep convolutional neural networks , 2016, Symposium on Computer Animation.
[44] Tony Ezzat,et al. Trainable videorealistic speech animation , 2002, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..
[45] Thabo Beeler,et al. Real-time high-fidelity facial performance capture , 2015, ACM Trans. Graph..
[46] Timo Bolkart,et al. A Groupwise Multilinear Correspondence Optimization for 3D Faces , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).
[47] Kevin Wilson,et al. Looking to listen at the cocktail party , 2018, ACM Trans. Graph..
[48] Frank K. Soong,et al. A new language independent, photo-realistic talking head driven by voice only , 2013, INTERSPEECH.
[49] Jun Wang,et al. A 3D facial expression database for facial behavior research , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).
[50] Eugene Fiume,et al. JALI , 2016, ACM Trans. Graph..
[51] Matthew Brand,et al. Voice puppetry , 1999, SIGGRAPH.
[52] Hai Xuan Pham,et al. Speech-Driven 3D Facial Animation with Implicit Emotional Awareness: A Deep Learning Approach , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).
[53] Björn Stenger,et al. Expressive Visual Text-to-Speech Using Active Appearance Models , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.
[54] Hanspeter Pfister,et al. Face transfer with multilinear models , 2005, ACM Trans. Graph..
[55] Hynek Hermansky,et al. RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..
[56] Keiichi Tokuda,et al. HMM-based text-to-audio-visual speech synthesis , 2000, INTERSPEECH.
[57] Derek Bradley,et al. An anatomically-constrained local deformation model for monocular face capture , 2016, ACM Trans. Graph..
[58] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[59] Jonathan G. Fiscus,et al. Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .
[60] Joo-Ho Lee,et al. Talking heads synthesis from audio with deep neural networks , 2015, 2015 IEEE/SICE International Symposium on System Integration (SII).
[61] Ira Kemelmacher-Shlizerman,et al. Synthesizing Obama , 2017, ACM Trans. Graph..
[62] Luc Van Gool,et al. A 3-D Audio-Visual Corpus of Affective Communication , 2010, IEEE Transactions on Multimedia.
[63] Stefano Berretti,et al. A 3D Dynamic Database for Unconstrained Face Recognition , 2014 .
[64] Frédéric H. Pighin,et al. Expressive speech-driven facial animation , 2005, TOGS.