Influenсe of Phone-Viseme Temporal Correlations on Audiovisual STT and TTS Performance
暂无分享,去创建一个
[1] Gérard Bailly,et al. Learning optimal audiovisual phasing for an HMM-based control model for facial animation , 2007, SSW.
[2] Kate Saenko,et al. AUDIOVISUAL SPEECH RECOGNITION WITH ARTICULATOR POSITIONS AS HIDDEN VARIABLES , 2007 .
[3] Y. Tohkura,et al. Inter-language differences in the influence of visual cues in speech perception. , 1993 .
[4] Andrey Ronzhin,et al. Audio-visual speech asynchrony modeling in a talking head , 2009, INTERSPEECH.
[5] Andrey Ronzhin,et al. Viseme-dependent weight optimization for CHMM-based audio-visual speech recognition , 2010, INTERSPEECH.
[6] Kevin P. Murphy,et al. A coupled HMM for audio-visual speech recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[7] Wesley Mattheyses,et al. On the Importance of Audiovisual Coherence for the Perceived Quality of Synthesized Visual Speech , 2009, EURASIP J. Audio Speech Music. Process..
[8] Valerie Hazan,et al. LANGUAGE EFFECTS ON THE DEGREE OF VISUAL INFLUENCE IN AUDIOVISUAL SPEECH PERCEPTION , 2007 .
[9] Attila Tihanyi,et al. Temporal asymmetry in relations of acoustic and visual features of speech , 2007, 2007 15th European Signal Processing Conference.
[10] C. Browman,et al. Articulatory Phonology: An Overview , 1992, Phonetica.