Segmentation of tongue shapes during vowel production in magnetic resonance images based on statistical modelling

Quantification of the anatomic and functional aspects of the tongue is pertinent to analyse the mechanisms involved in speech production. Speech requires dynamic and complex articulation of the vocal tract organs, and the tongue is one of the main articulators during speech production. Magnetic resonance imaging has been widely used in speech-related studies. Moreover, the segmentation of such images of speech organs is required to extract reliable statistical data. However, standard solutions to analyse a large set of articulatory images have not yet been established. Therefore, this article presents an approach to segment the tongue in two-dimensional magnetic resonance images and statistically model the segmented tongue shapes. The proposed approach assesses the articulator morphology based on an active shape model, which captures the shape variability of the tongue during speech production. To validate this new approach, a dataset of mid-sagittal magnetic resonance images acquired from four subjects was used, and key aspects of the shape of the tongue during the vocal production of relevant European Portuguese vowels were evaluated.

[1]  Carl-Fredrik Westin,et al.  Efficient and robust nonlocal means denoising of MR data based on salient features matching , 2012, Comput. Methods Programs Biomed..

[2]  Dorin Comaniciu,et al.  Marginal Space Deep Learning: Efficient Architecture for Volumetric Image Parsing , 2016, IEEE Transactions on Medical Imaging.

[3]  Marie-Odile Berger,et al.  A guided approach for automatic segmentation and modeling of the vocal tract in MRI images , 2011, 2011 19th European Signal Processing Conference.

[4]  Nicola A Miller,et al.  Using active shape modeling based on MRI to study morphologic and pitch-related functional changes affecting vocal structures and the airway. , 2014, Journal of voice : official journal of the Voice Foundation.

[5]  Georg Thimm,et al.  Tracking Articulators in X-ray Movies of the Vocal Tract , 1999, CAIP.

[6]  T. Nazzi,et al.  Phonetic processing when learning words , 2016 .

[7]  Jianwu Dang,et al.  Tongue shape synthesis based on Active Shape Model , 2012, 2012 8th International Symposium on Chinese Spoken Language Processing.

[8]  Ting Peng,et al.  A shape-based framework to segmentation of tongue contours from MRI data , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  João Manuel R. S. Tavares,et al.  ANALYSIS OF TONGUE SHAPE AND MOTION IN SPEECH PRODUCTION USING STATISTICAL MODELING , 2009 .

[10]  Pierre Badin,et al.  Three-dimensional linear modeling of tongue: Articulatory data and models , 2006 .

[11]  Ahmed S. Fahmy,et al.  Active Shape Model with Inter-profile Modeling Paradigm for Cardiac Right Ventricle Segmentation , 2012, MICCAI.

[12]  Yang Wang,et al.  Extraction of tongue contour in real-time magnetic resonance imaging sequences , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  William S. Levine,et al.  Controlling the shape of a muscular hydrostat : A tongue or tentacle , 2005 .

[14]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[15]  K G Munhall,et al.  Functional imaging during speech production. , 2001, Acta psychologica.

[16]  Kamel Hamrouni,et al.  Statistical models of shape and spatial relation-application to hippocampus segmentation , 2014, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[17]  B C Sonies,et al.  Ultrasonic visualization of tongue motion during speech. , 1981, The Journal of the Acoustical Society of America.

[18]  C. Taylor,et al.  Active shape models - 'Smart Snakes'. , 1992 .

[19]  João Manuel R S Tavares,et al.  Inter-speaker speech variability assessment using statistical deformable models from 3.0 Tesla magnetic resonance images , 2012, Proceedings of the Institution of Mechanical Engineers. Part H, Journal of engineering in medicine.

[20]  Shinobu Masaki,et al.  Measurement of temporal changes in vocal tract area function from 3D cine-MRI data. , 2006, The Journal of the Acoustical Society of America.

[21]  João Manuel R S Tavares,et al.  Morphologic differences in the vocal tract resonance cavities of voice professionals: an MRI-based study. , 2013, Journal of voice : official journal of the Voice Foundation.

[22]  João Manuel R. S. Tavares,et al.  Speaker-specific articulatory assessment and measurements during Portuguese speech production based on Magnetic Resonance Images , 2012 .