Subject-specific biomechanical modelling of the oropharynx with application to speech production

In this work, we develop a 3D subject-specific biomechanical model of the oropharynx in order to investigate and simulate speech production. Our muscle-activated model is generated based on the subject-specific anatomy captured from dynamic volumetric cine-MRI data. Our model includes an air-tight deformable airway that enables speech synthesis. We simulate our model based on actual tissue motion tracked from the tongue during speech production, which we extract from the tagged-MRI data. We quantitatively validate our model on MRI data achieving an average target point tracking error of 1.15mm ± 0.632, and an acoustic formant frequency estimation error of 6.01% ± 4.92%.

[1]  Tom Vercauteren,et al.  Diffeomorphic demons: Efficient non-parametric image registration , 2009, NeuroImage.

[2]  Yohan Payan,et al.  A fast and robust patient specific Finite Element mesh registration technique: Application to 60 clinical cases , 2010, Medical Image Anal..

[3]  Ghassan Hamarneh,et al.  Active Learning for Interactive 3D Image Segmentation , 2011, MICCAI.

[4]  P. Perrier,et al.  A biomechanical model of cardinal vowel production: muscle activations and the impact of gravity on tongue positioning. , 2009, The Journal of the Acoustical Society of America.

[5]  Ian Stavness,et al.  Unified skinning of rigid and deformable models for anatomical simulations , 2014, SIGGRAPH ASIA Technical Briefs.

[6]  Jerry L. Prince,et al.  3D Tongue Motion from Tagged and Cine MR Images , 2013, MICCAI.

[7]  Sidney Fels,et al.  Subject-Specific Biomechanical Modelling of the Tongue : Analysis of Muscle Activations During Speech , 2013 .

[8]  Ian Stavness,et al.  Automatic Prediction of Tongue Muscle Activations Using a Finite Element Model , 2022 .

[9]  Andriy Myronenko,et al.  Point Set Registration: Coherent Point Drift , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Yohan Payan,et al.  A 3D dynamical biomechanical tongue model to study speech motor control , 2006, physics/0606148.

[11]  Uri M. Ascher,et al.  Real-Time Numerical Solution of Webster's Equation on A Nonuniform Grid , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  R. E. English,et al.  Towards Articulatory Speech Synthesis with a Dynamic 3 D Finite Element Tongue Model , 2006 .

[13]  Yohan Payan,et al.  Coupled hard–soft tissue simulation with contact and constraints applied to jaw–tongue–hyoid dynamics , 2011 .

[14]  S Adachi,et al.  Clinical Science , 1982, Journal of dental research.

[15]  Jerry L. Prince,et al.  Imaging heart motion using harmonic phase MRI , 2000, IEEE Transactions on Medical Imaging.

[16]  S. Delp,et al.  A 3D model of muscle reveals the causes of nonuniform strains in the biceps brachii. , 2005, Journal of biomechanics.