A digital simulation method of the vocal-tract system

Abstract A time-domain simulation method of the vocal-tract system is described. The system, composed of a constant air pressure source, a time-varying narrow section representing the glottis and a tube corresponding to the vocal tract coupled with the nasal cavity was assumed as a vocal-tract model. The acoustic equations that govern the generation and the propagation of acoustic waves inside the model were transformed into the discrete variable representation by applying certain rules; the rectangular rule in space and the trapezoid rule in time. This particular manner of discretization causes spectral distortion due to the frequency warping. A theoretical analysis indicated that this warping can be interpreted as the manifestation of the frequency dependent phase velocity of waves in the discrete system. The magnitude of the warping depends on both the sampling frequency (fs) and the sampling interval in space (X). Eleven French vowels synthesized with fs = 20 kHz and X = 1 cm sounded very natural and highly intelligible, even though a trace of the frequency warping was noticeable at the third formant frequency region on their spectra. When fs = 40 kHz was used, the effect of the spectral distortion became practically negligible for frequencies below 4 kHz.

[1]  H. K. Dunn The Calculation of Vowel Resonances, and an Electrical Vocal Tract , 1950 .

[2]  Louis-Jean Boë Introduction à la phonétique acoustique , 1972 .

[3]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[4]  O. Fujimura,et al.  Sweep-tone measurements of vocal-tract characteristics. , 1971, The Journal of the Acoustical Society of America.

[5]  B. Guerin,et al.  A voice source taking account of coupling with the supraglottal cavities , 1976, ICASSP.

[6]  J. Flanagan,et al.  Signal models for low bit‐rate coding of speech , 1980 .

[7]  T. Chiba The vowel, its nature and structure , 1958 .

[8]  K. Stevens Airflow and Turbulence Noise for Fricative and Stop Consonants: Static Considerations , 1971 .

[9]  Michael Rodney Portnoff A quasi-one-dimensional digital simulation for the time-varying vocal tract. , 1973 .

[10]  G Fant,et al.  The Relations between Area Functions and the Acoustic Signal , 1980, Phonetica.

[11]  K. Stevens,et al.  An Electrical Analog of the Vocal Tract , 1953 .

[12]  A. Rosenberg Effect of glottal pulse shape on the quality of natural vowels. , 1969 .

[13]  J. L. Flanagan,et al.  Synthesis of speech from a dynamic model of the vocal cords and vocal tract , 1975, The Bell System Technical Journal.

[14]  K. Stevens,et al.  Estimation of formant band widths from measurements of transient response of the vocal tract. , 1958, Journal of Speech and Hearing Disorders.

[15]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[16]  James L. Flanagan,et al.  Direct determination of vocal tract wall impedance , 1975 .

[17]  T Kaneko,et al.  Input acoustic-impedance measurement of the subglottal system. , 1976, The Journal of the Acoustical Society of America.

[18]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[19]  M. Mrayati,et al.  Relations entre la forme du conduit vocal et les caractéristiques acoustiques des voyelles françaises , 1976 .