Real-time vocal tract modelling

To date, most speech synthesis techniques have relied upon the representation of the vocal tract by some form of filter, a typical example being linear predictive coding (LPC). This paper describes the development of a physiologically realistic model of the vocal tract using the well-established technique of transmission line modelling (TLM). This technique is based on the principle of wave scattering at transmission line segment boundaries and may be used in one, two, or three dimensions. This work uses this technique to model the vocal tract using a one-dimensional transmission line. A six-port scattering node is applied in the region separating the pharyngeal, oral, and the nasal parts of the vocal tract.

[1]  Mathematical Considerations in Digital Simulations of the Vocal Tract , 1973 .

[2]  C. C. Goodyear,et al.  Measurements of vocal tract shapes using magnetic resonance imaging , 1992 .

[3]  M. Rothenberg A new inverse-filtering technique for deriving the glottal air flow waveform during voicing. , 1970, The Journal of the Acoustical Society of America.

[4]  P. B. Johns,et al.  The consistency and accuracy of the TLM method for diffusion and its relationship to existing methods , 1983 .

[5]  P. Johns,et al.  Solution of Maxwell's equations in three space dimensions and time by the t.l.m. method of numerical analysis , 1975 .

[6]  H. W. Strube Determination of the instant glottal closure from the speech waveform , 1974 .

[7]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[8]  P. E. Suetin,et al.  A solution of the diffusion equation , 1968 .

[9]  D. Paris,et al.  Basic Electromagnetic Theory , 1969 .

[10]  Robert M. Fano,et al.  Electromagnetic Energy Transmission and Radiation , 1968 .

[11]  Pierre Badin,et al.  Vocal tract acoustics using the transmission line matrix (TLM) method , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[12]  P. Morse Vibration and Sound , 1949, Nature.

[13]  Geoff Bristow,et al.  Electronic Speech Synthesis , 1984 .

[14]  G. L.,et al.  Theory of Vibrating Systems and Sound , 1927, Nature.

[15]  P. B. Johns,et al.  A simple explicit and unconditionally stable numerical routine for the solution of the diffusion equation , 1977 .

[16]  M. Sondhi Model for wave propagation in a lossy vocal tract. , 1974, The Journal of the Acoustical Society of America.

[17]  M. Matausek,et al.  A new approach to the determination of the glottal waveform , 1980 .

[18]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[19]  Juan A. Morente,et al.  A THREE-DIMENSIONAL SYMMETRICAL CONDENSED TLM NODE FOR ACOUSTICS , 2001 .

[20]  J L Flanagan,et al.  Voices of men and machines. , 1972, The Journal of the Acoustical Society of America.