An approach to real-time magnetic resonance imaging for speech production.

Magnetic resonance imaging (MRI) has served as a valuable tool for studying static postures in speech production. Now, recent improvements in temporal resolution are making it possible to examine the dynamics of vocal-tract shaping during fluent speech using MRI. The present study uses spiral k-space acquisitions with a low flip-angle gradient echo pulse sequence on a conventional GE Signa 1.5-T CV/i scanner. This strategy allows for acquisition rates of 8-9 images per second and reconstruction rates of 20-24 images per second, making veridical movies of speech production now possible. Segmental durations, positions, and interarticulator timing can all be quantitatively evaluated. Data show clear real-time movements of the lips, tongue, and velum. Sample movies and data analysis strategies are presented.

[1]  Bob S. Hu,et al.  Fast Spiral Coronary Artery Imaging , 1992, Magnetic resonance in medicine.

[2]  R F Wolf,et al.  On the relation between the dimensions and resonance characteristics of the vocal tract: a study with MRI. , 1992, Magnetic resonance imaging.

[3]  Abeer Alwan,et al.  Noise source models for fricative consonants , 2000, IEEE Trans. Speech Audio Process..

[4]  Alexander Zimmermann,et al.  Assessment of consonant articulation in glossectomee speech by dynamic MRI , 2002, INTERSPEECH.

[5]  Timothy F. Cootes,et al.  Use of active shape models for locating structures in medical images , 1994, Image Vis. Comput..

[6]  K. Moll,et al.  Cinefluorographic Study of Selected Allophones of English /I/ , 1975, Phonetica.

[7]  J M Rubin,et al.  Pseudo-three-dimensional reconstruction of ultrasonic images of the tongue. , 1989, The Journal of the Acoustical Society of America.

[8]  Abeer Alwan,et al.  Acoustic modelling of American English /r/ , 1997, EUROSPEECH.

[9]  Marc M. Van Hulle,et al.  A phase-based approach to the estimation of the optical flow field using spatial filtering , 2002, IEEE Trans. Neural Networks.

[10]  Shrikanth S. Narayanan,et al.  An articulatory study of fricative consonants using magnetic resonance imaging , 1995 .

[11]  J D Subtelny,et al.  Cineradiographic study of sibilants. , 1972, Folia phoniatrica.

[12]  Didier Demolin,et al.  REAL TIME MRI AND ARTICULATORY COORDINATIONS IN VOWELS , 2000 .

[13]  Douglas C. Noll,et al.  Deblurring for non‐2D fourier transform magnetic resonance imaging , 1992, Magnetic resonance in medicine.

[14]  F G Shellock,et al.  Dynamic study of the upper airway with ultrafast spoiled GRASS MR imaging , 1992, Journal of magnetic resonance imaging : JMRI.

[15]  Bob S. Hu,et al.  Real‐time interactive coronary MRA , 2001, Magnetic resonance in medicine.

[16]  M. Stone A three-dimensional model of tongue movement based on ultrasound and x-ray microbeam data. , 1990, The Journal of the Acoustical Society of America.

[17]  Dani Byrd,et al.  MAGNETOMETER AND X-RAY MICROBEAM COMPARISON , 1999 .

[18]  Timothy F. Cootes,et al.  The Use of Active Shape Models for Locating Structures in Medical Images , 1993, IPMI.

[19]  M H Cohen,et al.  Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements. , 1992, The Journal of the Acoustical Society of America.

[20]  John N. Carter,et al.  Dynamic Magnetic Resonance Imaging: new tools for speech research , 1999 .

[21]  Shrikanth S. Narayanan,et al.  Acoustic modeling of American English , 2000 .

[22]  C A Moore,et al.  The correspondence of vocal tract resonance with volumes obtained from magnetic resonance images. , 1992, Journal of speech and hearing research.

[23]  L. Raphael,et al.  Cross-sectional tongue shape and linguopalatal contact patterns in [s], [∫], [f], and [1] , 1992 .

[24]  P. W. Nye,et al.  Analysis of vocal tract shape and dimensions using magnetic resonance imaging: vowels. , 1991, The Journal of the Acoustical Society of America.

[25]  J. Perkell Physiology of speech production: results and implications of a quantitative cineradiographic study , 1969 .

[26]  D G Nishimura,et al.  A Velocity k‐Space Analysis of Flow Effects in Echo‐Planar and Spiral Imaging , 1995, Magnetic resonance in medicine.

[27]  R C Grimm,et al.  Real‐time interactive magnetic resonance imaging , 1990, Magnetic resonance in medicine.

[28]  E. Hoffman,et al.  Vocal tract area functions from magnetic resonance imaging. , 1996, The Journal of the Acoustical Society of America.

[29]  Jianwu Dang,et al.  MRI measurements and acoustic investigation of the nasal and paranasal cavities , 1993 .

[30]  Abeer Alwan,et al.  From MRI and acoustic data to articulatory synthesis: a case study of the lateral approximants in American English , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[31]  Shrikanth S. Narayanan,et al.  Geometry, kinematics, and acoustics of Tamil liquid consonants. , 1999, The Journal of the Acoustical Society of America.

[32]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[33]  C J Hardy,et al.  Real‐time interactive MRI on a conventional scanner , 1997, Magnetic resonance in medicine.

[34]  J. Pauly,et al.  A homogeneity correction method for magnetic resonance imaging with time-varying gradients. , 1991, IEEE transactions on medical imaging.

[35]  Shrikanth S. Narayanan,et al.  Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part I. The laterals , 1997 .

[36]  J. Schenck The role of magnetic susceptibility in magnetic resonance imaging: MRI magnetic compatibility of the first and second kinds. , 1996, Medical physics.

[37]  P. Delattre,et al.  A DIALECT STUDY OF AMERICAN R’S BY X-RAY MOTION PICTURE , 1968 .