Fundamental Frequency Contour Synthesis for Turkish Text-to-speech

FUNDAMENTAL FREQUENCY CONTOUR SYNTHESIS FOR TURKISH TEXT-TO-SPEECH The subject of this M.S. thesis is fundamental frequency contour synthesis of Turkish sentences for text-to-speech systems. As intonation and stress are the most outstanding determinants of fundamental frequency, the work mainly focuses on these two prosodic phenomena. A detailed analysis was performed on the fundamental frequency contours of the sentences and the findings were presented. Combining the findings with the theoretical background, a fundamental frequency contour synthesis system is devised, relying on a sentence database and utilizing the syntactic structure of sentences based on word categories and word stress information. The work is based on the recordings of a single speaker. Different from the conventional prosody generation systems, prosodic phrasing and accent labeling, requiring an extensive text analysis, are not included in the system. The method yielded encouraging results for future research.

[1]  Lloyd Balderston Swift A Reference Grammar of Modern Turkish , 1997 .

[2]  Bernd Möbius,et al.  Synthesizing German Intonation Contours , 1997 .

[3]  Eser Emine Erguvanlı The Function of Word Order in Turkish Grammar , 1984 .

[4]  Nicholas Poppe American Studies in Altaic Linguistics , 1962 .

[5]  Gregor Möhler Describing intonation with a parametric model , 1998, ICSLP.

[6]  Yoshinori Sagisaka,et al.  Automatic Extraction of F 0 Control Rules Using Statistical Analysis , 1997 .

[7]  Thierry Dutoit,et al.  Fully automatic prosody generator for text-to-speech , 1998, ICSLP.

[8]  D. Talkin Fundamentals of Speech Synthesis and Speech Recognition , 1996 .

[9]  S.E. Levinson,et al.  Speech synthesis in telecommunications , 1993, IEEE Communications Magazine.

[10]  D H Klatt,et al.  Review of text-to-speech conversion for English. , 1987, The Journal of the Acoustical Society of America.

[11]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[12]  Stéphanie de Tournemire,et al.  Automatic transcription of intonation using an identified prosodic alphabet , 1998, ICSLP.

[13]  Stephen E. Levinson,et al.  Speech Synthesis in Telecommunications Synthesis of speech from unrestricted text is now commercially viable for telecommunications applications. , 1993 .

[14]  Yung-Hwan Oh,et al.  Generation of F0 contour using stochastic mapping and vector quantization control parameters , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  Klaus J. Kohler Parametric control of prosodic variables by symbolic input in TTS synthesis , 1994, SSW.

[16]  Jean Véronis,et al.  Towards a reversible symbolic coding of intonation , 1998, ICSLP.