Articulatory control of a vocal tract model based on fractional delay waveguide filters

A novel technique to implement and control an acoustic tube model of the human vocal tract is introduced. This model is an extension to the traditional Kelly-Lochbaum model, since not only the diameter of individual uniform tube sections but also their length, i.e., the positions of scattering junctions, can be continuously varied. The vocal tract model is implemented by means of FIR-type interpolation and deinterpolation that are used to locate the junctions. The authors show that in this kind of model the articulators can be presented in a natural manner enabling easy control of the model from articulatory data.<<ETX>>

[1]  P. Badin,et al.  Vocal tract simulation: Implementation of continuous variations of the length in a Kelly-Lochbaum model, effects of area function spatial sampling , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Matti Karjalainen,et al.  Fractional delay digital filters , 1993, 1993 IEEE International Symposium on Circuits and Systems.

[3]  Thomas Baer,et al.  An articulatory synthesizer for perceptual research , 1978 .

[4]  Julius O. Smith,et al.  Physical Modeling Using Digital Waveguides , 1992 .

[5]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[6]  Matti Karjalainen,et al.  Articulatory speech synthesis based on fractional delay waveguide filters , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  H. Strube,et al.  A quasiarticulatory speech synthesizer for German language running in real time , 1989 .

[8]  F. J. Owens,et al.  An optimized multirate sampling technique for the dynamic variation of vocal tract length in the Kelly-Lochbaum speech synthesis model , 1993, IEEE Trans. Speech Audio Process..

[9]  Matti Karjalainen,et al.  Modeling of Woodwind Bores with Finger Holes , 1993, ICMC.

[10]  H. Strube Sampled−data representation of a nonuniform lossless tube of continuously variable length , 1975 .

[11]  Bernd J. Kröger A gestural approach for controlling an articulatory speech synthesizer , 1993, EUROSPEECH.

[12]  John Nicholas Holmes,et al.  Speech synthesis , 1972 .

[13]  C.H. Coker,et al.  A model of articulatory dynamics and control , 1976, Proceedings of the IEEE.