Sistemas de conversão texto-fala
暂无分享,去创建一个
[1] Jan P. H. van Santen,et al. Assignment of segmental duration in text-to-speech synthesis , 1994, Comput. Speech Lang..
[2] Paul Taylor,et al. The tilt intonation model , 1998, ICSLP.
[3] Gérard Bailly,et al. Characterisation of rhythmic patterns for text-to-speech synthesis , 1994, Speech Communication.
[4] Isabel Trancoso,et al. Grapheme-to-phone using finite-state transducers , 2002, Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002..
[5] António J. S. Teixeira,et al. European portuguese nasal vowels: an EMMA study , 2001, INTERSPEECH.
[6] D. Klatt. Linguistic uses of segmental duration in English: acoustic and perceptual evidence. , 1976, The Journal of the Acoustical Society of America.
[7] Alex Acero,et al. Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .
[8] Mari Ostendorf,et al. TOBI: a standard for labeling English prosody , 1992, ICSLP.
[9] Daniel Hirst,et al. Levels of Representation and Levels of Analysis for the Description of Intonation Systems , 2000 .
[10] Stephen Isard,et al. Segment durations in a syllable frame , 1991 .
[11] Eric Moulines,et al. Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..
[12] Dennis H. Klatt,et al. Software for a cascade/parallel formant synthesizer , 1980 .