Individual variation of the hypopharyngeal cavities and its acoustic effects

Morphological measurements of the hypopharynx are conducted to investigate the correlation between fine structures of the vocal tract and speaker characteristics. The hypopharynx includes the laryngeal tube and bilateral cavities of the piriform fossa. MRI data during sustained phonation of the five Japanese vowels by four subjects are obtained to analyze intra- and inter-speaker variation of the hypopharynx. Morphological analysis on the mid-sagittal and transverse planes revealed that the shape of the hypopharynx was relatively stable, regardless of vowel type, in contrast to relatively large inter-speaker variation, and these results are confirmed quantitatively by a simple similarity method. The small intra-speaker variation of the hypopharynx is confirmed by further morphological analysis using high-quality MRI data for one of the subjects, obtained by using the “phonation-synchronized method” and “custom laryngeal coil.” Furthermore, acoustical effects of the individual variation of the hypopharynx are estimated by using a transmission line model. Vocal tract area function of one of the subjects above the hypopharynx is combined with the hypopharyngeal cavities of other subjects, and their transfer functions are calculated. The results show that the inter-speaker variation of the hypopharynx affects spectra in the frequency range beyond approximately 2.5 kHz.

[1]  J. Sundberg Articulatory interpretation of the "singing formant". , 1974, The Journal of the Acoustical Society of America.

[2]  René Causse,et al.  Input impedance of brass musical instruments—Comparison between experiment and numerical models , 1984 .

[3]  Sadaoki Furui,et al.  Perception of Voice Individuality and Physical Correlates , 1985 .

[4]  P. W. Nye,et al.  Analysis of vocal tract shape and dimensions using magnetic resonance imaging: vowels. , 1991, The Journal of the Acoustical Society of America.

[5]  C A Moore,et al.  The correspondence of vocal tract resonance with volumes obtained from magnetic resonance images. , 1992, Journal of speech and hearing research.

[6]  J. Dang,et al.  Morphological and acoustical analysis of the nasal and the paranasal cavities. , 1994, The Journal of the Acoustical Society of America.

[7]  Hideki Kasuya,et al.  Uniform and Non-uniform Normalization of Vocal Tracts Measured by MRI Across Male, Female and Child Subjects , 1995, IEICE Trans. Inf. Syst..

[8]  Shrikanth S. Narayanan,et al.  An articulatory study of fricative consonants using magnetic resonance imaging , 1995 .

[9]  Hideki Kasuya,et al.  Dimensional differences in the vocal tract shapes measured from MR images across boy, female and male subjects. , 1995 .

[10]  E. Hoffman,et al.  Vocal tract area functions from magnetic resonance imaging. , 1996, The Journal of the Acoustical Society of America.

[11]  Hideki Kasuya,et al.  Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance images , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[12]  Shinji Maeda,et al.  Human palate and related structures: their articulatory consequences , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[13]  M. Akagi,et al.  Relationship between physical characteristics and speaker individualities in speech spectral envelopes , 1996 .

[14]  K Honda,et al.  Acoustic characteristics of the piriform fossa in models and humans. , 1997, The Journal of the Acoustical Society of America.

[15]  Shrikanth S. Narayanan,et al.  Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part I. The laterals , 1997 .

[16]  W. Fitch,et al.  Morphology and development of the human vocal tract: a study using magnetic resonance imaging. , 1999, The Journal of the Acoustical Society of America.

[17]  P. Robinson,et al.  Myoelectric control and assessment of teleoperated robot devices , 2003 .

[18]  Kiyoshi Honda,et al.  A method of tooth superimposition on MRI data for accurate measurement of vocal tract shape and dimensions , 2004 .

[19]  Kiyoshi Honda,et al.  Exploring Human Speech Production Mechanisms by MRI , 2004, IEICE Trans. Inf. Syst..