On the ability of a physiologically constrained area function model of the vocal tract to produce normal formant patterns under perturbed conditions.

An area function model of the vocal tract is tested for its ability to produce typical vowel formant frequencies with a perturbation at the lips. The model, which consists of a neutral shape and two weighted orthogonal shaping patterns (modes), has previously been shown to produce a nearly one-to-one mapping between formant frequencies and the weighting coefficients of the modes [Story and Titze, J. Phonetics, 26, 223-260 (1998)]. In this study, a perturbation experiment was simulated by imposing a constant area "lip tube" on the model. The mapping between the mode coefficients and formant frequencies was then recomputed with the lip tube in place and showed that formant frequencies (F1 and F2) representative of the vowels [u,o,u] could no longer be produced with the model. However, when the mode coefficients were allowed to exceed their typical bounding values, the mapping between them and the formant frequencies was expanded such that the vowels [u,o,u] were compensated. The area functions generated by these exaggerated coefficients were shown to be similar to vocal-tract shapes reported for real speakers under similar perturbed conditions [Savariaux, Perrier, and Orliaguet, J. Acoust. Soc. Am., 98, 2428-2442 (1995)]. This suggests that the structure of this particular model captures some of the human ability to configure the vocal-tract shape under both ordinary and extraordinary conditions.

[1]  K. Honda Organization of tongue articulation for vowels , 1996 .

[2]  Victor Sorokin,et al.  Internal model in articulatory control: Evidence from speaking without larynx , 1998, Speech Commun..

[3]  F. Guenther,et al.  A theoretical investigation of reference frames for the planning of speech movements. , 1998, Psychological review.

[4]  H. Strube,et al.  A quasiarticulatory speech synthesizer for German language running in real time , 1989 .

[5]  E. Hoffman,et al.  Vocal tract area functions from magnetic resonance imaging. , 1996, The Journal of the Acoustical Society of America.

[6]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[7]  Shrikanth S. Narayanan,et al.  Acoustics of children's speech: developmental changes of temporal and spectral parameters. , 1999, The Journal of the Acoustical Society of America.

[8]  Brad H. Story,et al.  Parameterization of vocal tract area functions by empirical orthogonal modes , 1998 .

[9]  James Lubker,et al.  Formant frequencies of some fixed-mandible vowels and a model of speech motor programming by predict , 1977 .

[10]  Man Mohan Sondhi,et al.  A hybrid time-frequency domain articulatory speech synthesizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[11]  S. Maeda,et al.  From EMG to Formant Patterns of Vowels: The Implication of Vowel Spaces , 1994, Phonetica.

[12]  I R Titze,et al.  Vocal tract area functions for an adult female speaker based on volumetric imaging. , 1998, The Journal of the Acoustical Society of America.

[13]  Maureen Stone,et al.  Compensatory vowel characteristics resulting from the presence of different types of experimental dental prostheses , 1976 .

[14]  J. Hillenbrand,et al.  Acoustic characteristics of American English vowels. , 1994, The Journal of the Acoustical Society of America.

[15]  Kazuya Takeda,et al.  An Acoustically Oriented Vocal-Tract Model , 1996 .

[16]  M. Stone,et al.  Compensatory alveolar consonant production induced by wearing a dental prosthesis , 1978 .

[17]  P. Ladefoged,et al.  Factor analysis of tongue shapes. , 1971, Journal of the Acoustical Society of America.

[18]  Brad H. Story,et al.  A preliminary study of voice quality transformation based on modifications to the neutral vocal tract area function , 2002, J. Phonetics.

[19]  I R Titze,et al.  The relationship of vocal tract shape to three voice qualities. , 2001, The Journal of the Acoustical Society of America.

[20]  Pascal Perrier,et al.  Compensation strategies for the perturbation of the rounded vowel [u] using a lip-tube : A study of the control space in speech production , 1995 .