The relationship of vocal tract shape to three voice qualities.

Three-dimensional vocal tract shapes and consequent area functions representing the vowels [i, ae, a, u] have been obtained from one male and one female speaker using magnetic resonance imaging (MRI). The two speakers were trained vocal performers and both were adept at manipulation of vocal tract shape to alter voice quality. Each vowel was performed three times, each with one of the three voice qualities: normal, yawny, and twangy. The purpose of the study was to determine some ways in which the vocal tract shape can be manipulated to alter voice quality while retaining a desired phonetic quality. To summarize any overall tract shaping tendencies mean area functions were subsequently computed across the four vowels produced within each specific voice quality. Relative to normal speech, both the vowel area functions and mean area functions showed, in general, that the oral cavity is widened and tract length increased for the yawny productions. The twangy vowels were characterized by shortened tract length, widened lip opening, and a slightly constricted oral cavity. The resulting acoustic characteristics of these articulatory alterations consisted of the first two formants (F1 and F2) being close together for all yawny vowels and far apart for all the twangy vowels.

[1]  I R Titze,et al.  Vocal tract area functions for an adult female speaker based on volumetric imaging. , 1998, The Journal of the Acoustical Society of America.

[2]  E. Hoffman,et al.  Vocal tract area functions from magnetic resonance imaging. , 1996, The Journal of the Acoustical Society of America.

[3]  P. W. Nye,et al.  Analysis of vocal tract shape and dimensions using magnetic resonance imaging: vowels. , 1991, The Journal of the Acoustical Society of America.

[4]  J. Laver The phonetic description of voice quality , 1980 .

[5]  Pierre Badin,et al.  Deriving vocal-tract area functions from midsagittal profiles and formant frequencies: A new model for vowels and fricative consonants based on experimental data , 1995, Speech Commun..

[6]  K Honda,et al.  Acoustic characteristics of the piriform fossa in models and humans. , 1997, The Journal of the Acoustical Society of America.

[7]  Hideki Kasuya,et al.  Accurate measurement of vocal tract shapes from magnetic resonance images of child, female and male subjects , 1994, ICSLP.

[8]  P. Ladefoged,et al.  The sounds of the world's languages , 1996 .

[9]  W. Fitch,et al.  Morphology and development of the human vocal tract: a study using magnetic resonance imaging. , 1999, The Journal of the Acoustical Society of America.

[10]  Ursula Gisela Goldstein,et al.  An articulatory model for the vocal tracts of growing children , 1980 .

[11]  K. Stevens,et al.  Development of a Quantitative Description of Vowel Articulation , 1955 .

[12]  Shrikanth S. Narayanan,et al.  An articulatory study of fricative consonants using magnetic resonance imaging , 1995 .

[13]  P. Mermelstein Determination of the vocal-tract shape from measured formant frequencies. , 1967, The Journal of the Acoustical Society of America.

[14]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[15]  H. Traunmüller Conventional, Biological and Environmental Factors in Speech Communication: A Modulation Theory , 1994, Phonetica.

[16]  Man Mohan Sondhi,et al.  A hybrid time-frequency domain articulatory speech synthesizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[17]  I R Titze,et al.  Some technical considerations in voice perturbation measurements. , 1987, Journal of speech and hearing research.

[18]  D. Broadbent,et al.  Information Conveyed by Vowels , 1957 .