论文信息 - Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images

Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images

We describe a method for unsupervised region segmentation of an image using its spatial frequency domain representation. The algorithm was designed to process large sequences of real-time magnetic resonance (MR) images containing the 2-D midsagittal view of a human vocal tract airway. The segmentation algorithm uses an anatomically informed object model, whose fit to the observed image data is hierarchically optimized using a gradient descent procedure. The goal of the algorithm is to automatically extract the time-varying vocal tract outline and the position of the articulators to facilitate the study of the shaping of the vocal tract during speech production.

Shrikanth S. Narayanan | Erik Bresch | E. Bresch

[1] Julie Fontecave Jallon,et al. Semi-automatic extraction of vocal tract movements from cineradiographic data , 2006, INTERSPEECH.

[2] Roland Bammer,et al. Parallel imaging reconstruction for arbitrary trajectories using k‐space sparse matrices (kSPA) , 2007, Magnetic resonance in medicine.

[3] M H Cohen,et al. Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements. , 1992, The Journal of the Acoustical Society of America.

[4] José M. N. Leitão,et al. Unsupervised contour representation and estimation using B-splines and a minimum description length criterion , 2000, IEEE Trans. Image Process..

[5] C.-C. Jay Kuo,et al. Wavelet descriptor of planar curves: theory and applications , 1996, IEEE Trans. Image Process..

[6] Olivier D. Faugeras,et al. Designing spatially coherent minimizing flows for variational problems based on active contours , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[7] Demetri Terzopoulos,et al. Deformable models in medical image analysis: a survey , 1996, Medical Image Anal..

[8] Maureen C. Stone. Toward a model of three-dimensional tongue movement , 1991 .

[9] K. McInturff,et al. The Fourier transform of linearly varying functions with polygonal support , 1991 .

[10] G. Golub,et al. Separable nonlinear least squares: the variable projection method and its applications , 2003 .

[11] Natalia A. Schmid,et al. Complexity regularized shape estimation from noisy Fourier data , 2002, Proceedings. International Conference on Image Processing.

[12] Shrikanth S. Narayanan,et al. Semi-Automatic Processing of Real-time MR Image Sequences for Speech Production Studies , 2006 .

[13] Raj Mittra,et al. Fourier transform of a polygonal shape function and its application in electromagnetics , 1983 .

[14] Shrikanth Narayanan,et al. Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans. , 2006, The Journal of the Acoustical Society of America.

[15] Shrikanth Narayanan,et al. An approach to real-time magnetic resonance imaging for speech production. , 2003, The Journal of the Acoustical Society of America.

[16] Jean-Philippe Pons,et al. Generalized Surface Flows for Mesh Processing , 2007 .

[17] Shinobu Masaki,et al. Measurement of temporal changes in vocal tract area function from 3D cine-MRI data. , 2006, The Journal of the Acoustical Society of America.

[18] Eric Vatikiotis-Bateson,et al. The Haskins optically corrected ultrasound system (HOCUS). , 2005, Journal of speech, language, and hearing research : JSLHR.

[19] Demetri Terzopoulos,et al. Snakes: Active contour models , 2004, International Journal of Computer Vision.

[20] Chi-Fang Huang,et al. On the calculation of the Fourier transform of a polygonal shape function , 1989 .

[21] W S Levine,et al. Modeling tongue surface contours from Cine-MRI images. , 2001, Journal of speech, language, and hearing research : JSLHR.

[22] L Saltzman Elliot,et al. A Dynamical Approach to Gestural Patterning in Speech Production , 1989 .

[23] Paul W. Fieguth,et al. A functional articulatory dynamic model for speech production , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).