论文信息 - Quasi-syllabic and quasi-articulatory-gestural units for concatenative speech synthesis

Quasi-syllabic and quasi-articulatory-gestural units for concatenative speech synthesis

In this paper we propose methods of speech segmentation and unit characterization which are motivated by prosodic and physiological principles. In particular, we motivate and describe algorithms for unit-database creation on the basis of quasi-syllables and quasi-articulatory-gestures defined and parameterized purely by acoustic measurements. This approach is intended to overcome the burden of reliance on the phonetic code in concatenative speech synthesis.

Nick Campbell | Parham Mokhtari

[1] Nick Campbell,et al. Acoustic nature and perceptual testing of corpora of emotional speech , 1998, ICSLP.

[2] Parham Mokhtari. An acoustic-phonetic and articulatory study of speech-speaker dichotomy , 1998 .

[3] G. E. Peterson,et al. Segmentation Techniques in Speech Synthesis , 1958 .

[4] G. E. Peterson,et al. A physiological theory of phonetics. , 1966, Journal of speech and hearing research.

[5] D. Broad,et al. Formant estimation by linear trans-formation of the lpc cepstrum , 1989 .

[6] P. Mermelstein. Automatic segmentation of speech into syllabic units. , 1975, The Journal of the Acoustical Society of America.

[7] David J. Broad. Formants in automatic speech recognition , 1972 .