Measurement of temporal changes in vocal tract area function from 3D cine-MRI data.

A 3D cine-MRI technique was developed based on a synchronized sampling method [Masaki et al., J. Acoust. Soc. Jpn. E 20, 375-379 (1999)] to measure the temporal changes in the vocal tract area function during a short utterance /aiueo/ in Japanese. A time series of head-neck volumes was obtained after 640 repetitions of the utterance produced by a male speaker, from which area functions were extracted frame-by-frame. A region-based analysis showed that the volumes of the front and back cavities tend to change reciprocally and that the areas near the larynx and posterior edge of the hard palate were almost constant throughout the utterance. The lower four formants were calculated from all the area functions and compared with those of natural speech sounds. The mean absolute percent error between calculated and measured formants among all the frames was 4.5%. The comparison of vocal tract shapes for the five vowels with those from the static MRI method suggested a problem of MRI observation of the vocal tract: data from static MRI tend to result in a deviation from natural vocal tract geometry because of the gravity effect.

[1]  W S Levine,et al.  Modeling tongue surface contours from Cine-MRI images. , 2001, Journal of speech, language, and hearing research : JSLHR.

[2]  A. Alwan,et al.  Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part I. The laterals. , 1997, The Journal of the Acoustical Society of America.

[3]  Shinobu Masaki,et al.  Difference in vocal tract shape between upright and supine postures: Observations by an open-type MRI scanner , 2005 .

[4]  Kiyoshi Honda,et al.  A method of tooth superimposition on MRI data for accurate measurement of vocal tract shape and dimensions , 2004 .

[5]  Hideki Kasuya,et al.  Accurate measurement of vocal tract shapes from magnetic resonance images of child, female and male subjects , 1994, ICSLP.

[6]  W S Levine,et al.  Modeling the motion of the internal tongue from tagged cine-MRI images. , 2001, The Journal of the Acoustical Society of America.

[7]  Arne Kjell Foldvik,et al.  MRI (magnetic resonance imaging) film of articulatory movements , 1990, ICSLP.

[8]  Shrikanth S. Narayanan,et al.  An articulatory study of fricative consonants using magnetic resonance imaging , 1995 .

[9]  Shrikanth S. Narayanan,et al.  Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part II. The rhotics. , 1997, The Journal of the Acoustical Society of America.

[10]  M. Stone A three-dimensional model of tongue movement based on ultrasound and x-ray microbeam data. , 1990, The Journal of the Acoustical Society of America.

[11]  René Causse,et al.  Input impedance of brass musical instruments—Comparison between experiment and numerical models , 1984 .

[12]  I Narabayashi,et al.  Blueberry juice: preliminary evaluation as an oral contrast agent in gastrointestinal MR imaging. , 1995, Radiology.

[13]  P. Boesiger,et al.  SENSE: Sensitivity encoding for fast MRI , 1999, Magnetic resonance in medicine.

[14]  Shrikanth Narayanan,et al.  An approach to real-time magnetic resonance imaging for speech production. , 2003, The Journal of the Acoustical Society of America.

[15]  I R Titze,et al.  Vocal tract area functions for an adult female speaker based on volumetric imaging. , 1998, The Journal of the Acoustical Society of America.

[16]  Shrikanth S. Narayanan,et al.  Geometry, kinematics, and acoustics of Tamil liquid consonants. , 1999, The Journal of the Acoustical Society of America.

[17]  Shinobu Masaki,et al.  MRI-based speech production study using a synchronized sampling method , 1999 .

[18]  Kiyoshi Honda,et al.  Exploring Human Speech Production Mechanisms by MRI , 2004, IEICE Trans. Inf. Syst..

[19]  I R Titze,et al.  The relationship of vocal tract shape to three voice qualities. , 2001, The Journal of the Acoustical Society of America.

[20]  S Adachi,et al.  An acoustical study of sound production in biphonic singing, Xöömij. , 1999, The Journal of the Acoustical Society of America.

[21]  P. W. Nye,et al.  Analysis of vocal tract shape and dimensions using magnetic resonance imaging: vowels. , 1991, The Journal of the Acoustical Society of America.

[22]  C. Higgins,et al.  Quantification of cardiac function by conventional and cine magnetic resonance imaging , 2007, CardioVascular and Interventional Radiology.

[23]  Shrikanth S. Narayanan,et al.  Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part I. The laterals , 1997 .

[24]  Steve R. Gunn,et al.  Using MRI to image the moving vocal tract during speech , 1997, EUROSPEECH.

[25]  Arne Kjell Foldvik,et al.  A time-evolving three-dimensional vocal tract model by means of magnetic resonance imaging (MRI) , 1993, EUROSPEECH.

[26]  J. Rokkaku,et al.  Measurements of the three-dimensional shape of the vocal tract based on the magnetic resonance imaging technique , 1986 .

[27]  Shinji Maeda,et al.  Compensatory Articulation During Speech: Evidence from the Analysis and Synthesis of Vocal-Tract Shapes Using an Articulatory Model , 1990 .

[28]  K Honda,et al.  Acoustic characteristics of the piriform fossa in models and humans. , 1997, The Journal of the Acoustical Society of America.

[29]  P. Ladefoged,et al.  Factor analysis of tongue shapes. , 1971, Journal of the Acoustical Society of America.

[30]  Katsuhiko Shirai,et al.  ARTICULATORY MODEL AND THE ESTIMATION OF ARTICULATORY PARAMETERS BY NONLINEAR REGRESSION METHOD. , 1976 .

[31]  Kiyoshi Honda,et al.  Individual variation of the hypopharyngeal cavities and its acoustic effects , 2005 .

[32]  E. Hoffman,et al.  Vocal tract area functions from magnetic resonance imaging. , 1996, The Journal of the Acoustical Society of America.