Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images

We describe a method for unsupervised region segmentation of an image using its spatial frequency domain representation. The algorithm was designed to process large sequences of real-time magnetic resonance (MR) images containing the 2-D midsagittal view of a human vocal tract airway. The segmentation algorithm uses an anatomically informed object model, whose fit to the observed image data is hierarchically optimized using a gradient descent procedure. The goal of the algorithm is to automatically extract the time-varying vocal tract outline and the position of the articulators to facilitate the study of the shaping of the vocal tract during speech production.

[1]  Julie Fontecave Jallon,et al.  Semi-automatic extraction of vocal tract movements from cineradiographic data , 2006, INTERSPEECH.

[2]  Roland Bammer,et al.  Parallel imaging reconstruction for arbitrary trajectories using k‐space sparse matrices (kSPA) , 2007, Magnetic resonance in medicine.

[3]  M H Cohen,et al.  Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements. , 1992, The Journal of the Acoustical Society of America.

[4]  José M. N. Leitão,et al.  Unsupervised contour representation and estimation using B-splines and a minimum description length criterion , 2000, IEEE Trans. Image Process..

[5]  C.-C. Jay Kuo,et al.  Wavelet descriptor of planar curves: theory and applications , 1996, IEEE Trans. Image Process..

[6]  Olivier D. Faugeras,et al.  Designing spatially coherent minimizing flows for variational problems based on active contours , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[7]  Demetri Terzopoulos,et al.  Deformable models in medical image analysis: a survey , 1996, Medical Image Anal..

[8]  Maureen C. Stone Toward a model of three-dimensional tongue movement , 1991 .

[9]  K. McInturff,et al.  The Fourier transform of linearly varying functions with polygonal support , 1991 .

[10]  G. Golub,et al.  Separable nonlinear least squares: the variable projection method and its applications , 2003 .

[11]  Natalia A. Schmid,et al.  Complexity regularized shape estimation from noisy Fourier data , 2002, Proceedings. International Conference on Image Processing.

[12]  Shrikanth S. Narayanan,et al.  Semi-Automatic Processing of Real-time MR Image Sequences for Speech Production Studies , 2006 .

[13]  Raj Mittra,et al.  Fourier transform of a polygonal shape function and its application in electromagnetics , 1983 .

[14]  Shrikanth Narayanan,et al.  Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans. , 2006, The Journal of the Acoustical Society of America.

[15]  Shrikanth Narayanan,et al.  An approach to real-time magnetic resonance imaging for speech production. , 2003, The Journal of the Acoustical Society of America.

[16]  Jean-Philippe Pons,et al.  Generalized Surface Flows for Mesh Processing , 2007 .

[17]  Shinobu Masaki,et al.  Measurement of temporal changes in vocal tract area function from 3D cine-MRI data. , 2006, The Journal of the Acoustical Society of America.

[18]  Eric Vatikiotis-Bateson,et al.  The Haskins optically corrected ultrasound system (HOCUS). , 2005, Journal of speech, language, and hearing research : JSLHR.

[19]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[20]  Chi-Fang Huang,et al.  On the calculation of the Fourier transform of a polygonal shape function , 1989 .

[21]  W S Levine,et al.  Modeling tongue surface contours from Cine-MRI images. , 2001, Journal of speech, language, and hearing research : JSLHR.

[22]  L Saltzman Elliot,et al.  A Dynamical Approach to Gestural Patterning in Speech Production , 1989 .

[23]  Paul W. Fieguth,et al.  A functional articulatory dynamic model for speech production , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).