Sensitivity of Quantitative RT-MRI Metrics of Vocal Tract Dynamics to Image Reconstruction Settings

Real-time Magnetic Resonance Imaging (RT-MRI) is a powerful method for quantitative analysis of speech. Current state-of-the-art methods use constrained reconstruction to achieve high frame rates and spatial resolution. The reconstruction involves two free parameters that can be retrospectively selected: 1) the temporal resolution and 2) the regularization parameter λ, which balances temporal regularization and fidelity to the collected MRI data. In this work, we study the sensitivity of derived quantitative measures of vocal tract function to these two parameters. Specifically, the cross-distance between the tongue tip and the alveolar ridge was investigated for different temporal resolutions (21, 42, 56 and 83 frames per second) and values of the regularization parameter. Data from one subject is included. The phrase ‘one two three four five’ was repeated 8 times at a normal pace. The results show that 1) a high regularization factor leads to lower cross-distance values 2) using a low value for the regularization parameter gives poor reproducibility and 3) a temporal resolution of at least 42 frames per second is desirable to achieve good reproducibility for all utterances in this speech task. The process employed here can be generalized to quantitative imaging of the vocal tract and other body parts.

[1]  Shrikanth Narayanan,et al.  Are Articulatory Settings Mechanically Advantageous for Speech Motor Control? , 2014, PloS one.

[2]  Per Christian Hansen,et al.  Analysis of Discrete Ill-Posed Problems by Means of the L-Curve , 1992, SIAM Rev..

[3]  Shrikanth S. Narayanan,et al.  Enhanced airway-tissue boundary segmentation for real-time magnetic resonance imaging data , 2014 .

[4]  Shrikanth S. Narayanan,et al.  Timing effects of syllable structure and stress on nasals: A real-time MRI examination , 2009, J. Phonetics.

[5]  Shrikanth S. Narayanan,et al.  Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images , 2009, IEEE Transactions on Medical Imaging.

[6]  Jonghye Woo,et al.  Tongue motion patterns in post-glossectomy and typical speakers: a principal components analysis. , 2014, Journal of speech, language, and hearing research : JSLHR.

[7]  Shrikanth S. Narayanan,et al.  Characterizing Post-Glossectomy Speech Using Real-time MRI , 2013 .

[8]  Marc E Miquel,et al.  Recommendations for real‐time speech MRI , 2016, Journal of magnetic resonance imaging : JMRI.

[9]  Jens Frahm,et al.  Real-time MRI comparisons of brass players: A methodological pilot study. , 2015, Human movement science.

[10]  Shrikanth S. Narayanan,et al.  Real-time magnetic resonance imaging investigation of resonance tuning in soprano singing. , 2010, The Journal of the Acoustical Society of America.

[11]  Shrikanth Narayanan,et al.  A fast and flexible MRI system for the study of dynamic vocal tract shaping , 2017, Magnetic resonance in medicine.

[12]  Shrikanth S. Narayanan,et al.  Evaluation of swallow function after tongue cancer treatment using real-time magnetic resonance imaging: a pilot study. , 2013, JAMA otolaryngology-- head & neck surgery.

[13]  Shrikanth S. Narayanan,et al.  Data-driven analysis of realtime vocal tract MRI using correlated image regions , 2010, INTERSPEECH.

[14]  Shrikanth Narayanan,et al.  Paralinguistic mechanisms of production in human "beatboxing": a real-time magnetic resonance imaging study. , 2013, The Journal of the Acoustical Society of America.

[15]  Athanasios Katsamanis,et al.  Direct Estimation of Articulatory Kinematics from Real-Time Magnetic Resonance Image Sequences , 2011, INTERSPEECH.

[16]  R. Boubertakh,et al.  Towards clinical assessment of velopharyngeal closure using MRI: evaluation of real-time MRI sequences at 1.5 and 3 T. , 2012, The British journal of radiology.

[17]  Caitlin Smith Complex Tongue Shaping in Lateral Liquid Production Without Constriction-Based Goals , 2014 .

[18]  Shrikanth S. Narayanan,et al.  Velic coordination in French nasals: a real-time magnetic resonance imaging study , 2013, INTERSPEECH.

[19]  Shrikanth S. Narayanan,et al.  An analysis of vocal tract shaping in English sibilant fricatives using real-time magnetic resonance imaging , 2008, INTERSPEECH.

[20]  Shrikanth Narayanan,et al.  An approach to real-time magnetic resonance imaging for speech production. , 2003, The Journal of the Acoustical Society of America.

[21]  Athanasios Katsamanis,et al.  Statistical multi-stream modeling of real-time MRI articulatory speech data , 2010, INTERSPEECH.