Perceptually based automatic prosody labeling and prosodically enriched unit selection improve concatenative text-to-speech synthesis
暂无分享,去创建一个
[1] Gayle M. Ayers. Nuclear Accent Types and Prominence: Some Psycholinguistic Experiments / , 1996 .
[2] Ann K. Syrdal,et al. Inter-transcriber reliability of toBI prosodic labeling , 2000, INTERSPEECH.
[3] Mari Ostendorf,et al. Automatic labeling of prosodic patterns , 1994, IEEE Trans. Speech Audio Process..
[4] Angelien Sanderman,et al. On the perceptual strength of prosodic boundaries and its relation to suprasegmental cues , 1994 .
[5] Eric Moulines,et al. Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones , 1989, Speech Commun..
[6] G. Fant,et al. Speech , Music and Hearing Quarterly Progress and Status Report Preliminaries to the study of Swedish prose reading and reading style , 2007 .
[7] Barbara Heuft,et al. Towards a prominence-based synthesis system , 1997, Speech Commun..
[8] Thierry Dutoit,et al. Diphone concatenation using a harmonic plus noise model of speech , 1997, EUROSPEECH.
[9] Julia Hirschberg,et al. Automatic ToBI prediction and alignment to speed manual labeling of prosody , 2001, Speech Commun..