Comparison of vocal tract transfer functions calculated using one-dimensional and three-dimensional acoustic simulation methods

Acoustic characteristics of the vocal tract have been investigated extensively in the literature using a onedimensional (1D) acoustic simulation method. Because the 1D method assumes plane wave propagation only, it is recognized to be valid only in the low frequency region (below about 4 or 5 kHz). Recently, a three-dimensional (3D) acoustic simulation method was developed, to obtain more precise acoustic characteristics of the vocal tract. In the present study, from a male’s vocal tract shapes, transfer functions were calculated using the 1D and 3D methods and compared with each other to evaluate the valid frequency range of the 1D method. As a result, when acoustic effects of the piriform fossae were considered in the 1D method, the transfer functions agreed with each other up to 7 kHz (ignoring small dips). The 3D method showed that a deep dip was generated at around 8 kHz by the transverse resonance mode in the pharynx. Above this dip frequency, the transfer functions disagreed with each other. Thus, the 1D method is valid up to 7 kHz for this subject. Because this subject has a relatively large vocal tract, in general the upper limit of the valid frequency range could exceed 8 kHz.

[1]  S Adachi,et al.  An acoustical study of sound production in biphonic singing, Xöömij. , 1999, The Journal of the Acoustical Society of America.

[2]  F. Asano,et al.  An optimum computer‐generated pulse signal suitable for the measurement of very long impulse responses , 1995 .

[3]  P. W. Nye,et al.  Analysis of vocal tract shape and dimensions using magnetic resonance imaging: vowels. , 1991, The Journal of the Acoustical Society of America.

[4]  Shinji Maeda,et al.  A digital simulation method of the vocal-tract system , 1982, Speech Commun..

[5]  Kiyoshi Honda,et al.  Transfer functions of solid vocal-tract models constructed from ATR MRI database of Japanese vowel production , 2009 .

[6]  Keisuke Kinoshita,et al.  Measurement of cricothyroid articulation using high-resolution MRI and 3d pattern matching , 2005, MAVEBA.

[7]  Tatsuya Kitamura,et al.  Acoustic interaction between the right and left piriform fossae in generating spectral dips. , 2013, The Journal of the Acoustical Society of America.

[8]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[9]  E. Hoffman,et al.  Vocal tract area functions from magnetic resonance imaging. , 1996, The Journal of the Acoustical Society of America.

[10]  K Honda,et al.  Acoustic characteristics of the piriform fossa in models and humans. , 1997, The Journal of the Acoustical Society of America.

[11]  B. Story A parametric model of the vocal tract area function for vowel and consonant simulation. , 2005, The Journal of the Acoustical Society of America.

[12]  Shinobu Masaki,et al.  Measurement of temporal changes in vocal tract area function from 3D cine-MRI data. , 2006, The Journal of the Acoustical Society of America.

[13]  Man Mohan Sondhi,et al.  A hybrid time-frequency domain articulatory speech synthesizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[14]  N Nagai,et al.  Measurement of sound-pressure distribution in replicas of the oral cavity. , 1992, The Journal of the Acoustical Society of America.

[15]  W. Fitch,et al.  Morphology and development of the human vocal tract: a study using magnetic resonance imaging. , 1999, The Journal of the Acoustical Society of America.

[16]  Coarticulation • Suprasegmentals,et al.  Acoustic Phonetics , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[17]  Tatsuya Kitamura,et al.  Acoustic analysis of the vocal tract during vowel production by finite-difference time-domain method. , 2008, The Journal of the Acoustical Society of America.

[18]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .