Intonation Rules for Text Reading

Intonation is the cognitive aspect of the ensemble of pitch variations in the course of an utterance. This perceptual impression of speech melody correlates, to a first approximation, with changes in the fundamental frequency (F0) of the signal. This chapter presents the study of intonation patterns for text reading in Standard Colloquial Bengali for the development of rules and appropriate methods for using them in a text-to-speech synthesis system. In the model presented here, the pitch movements at the syllabic level are considered to be basic. Syllabic stylization uses the closest linear match using linear regression and t the pitch movements are expressed in semitones per second. The sentence level intonation pattern is the sequences of the word level patterns constituting the sentence. This chapter also presents the statistical method for the implementation of these obtained rule in TTS. The model is tested by synthesizing several sentences and the perceptual results are satisfactory.

[1]  S. Hiki Control Rule of the Tongue Movement for Dynamic Analog Speech Synthesis , 1970 .

[2]  Chiu-yu Tseng,et al.  The synthesis rules in a Chinese text-to-speech system , 1989, IEEE Trans. Acoust. Speech Signal Process..

[3]  Russell L. Sergeant,et al.  Sensitivity to Unidirectional Frequency Modulation , 1961 .

[4]  Victor Zue,et al.  A hierarchical model for phoneme duration in american English , 1989, EUROSPEECH.

[5]  David Crystal,et al.  A dictionary of linguistics and phonetics , 1997 .

[6]  I. Pollack,et al.  Detection of rate of change of auditory frequency. , 1968, Journal of experimental psychology.

[7]  K. Pike,et al.  The intonation of American English , 1946 .

[8]  Howard C. Nusbaum,et al.  Pronounce : a program for pronunciation by analogy , 1991 .

[9]  Antonio Bonafonte,et al.  Automatic Analysis and Synthesis of Fujisaki's Intonation Model for TTS , 2002 .

[10]  John Hart,et al.  A Perceptual Study of Intonation , 1990 .

[11]  D. Klatt Letter: Interaction between two factors that influence vowel duration. , 1973, The Journal of the Acoustical Society of America.

[12]  Mari Ostendorf,et al.  TOBI: a standard for labeling English prosody , 1992, ICSLP.

[13]  R. J. Ritsma Pitch discrimination and frequency discrimination , 1965 .

[14]  P Taylor,et al.  Analysis and synthesis of intonation using the Tilt model. , 2000, The Journal of the Acoustical Society of America.

[15]  Uwe D. Reichel,et al.  Data-driven extraction of intonation contour classes , 2007, SSW.

[16]  Yoshinori Sagisaka,et al.  Pause characteristics and local phrase-dependency structure in Japanese , 1992, ICSLP.

[17]  Bayya Yegnanarayana,et al.  Modeling syllable duration in Indian languages using neural networks , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Keikichi Hirose,et al.  Analysis of voice fundamental frequency contours for declarative sentences of Japanese , 1984 .