Robust tongue tracking in ultrasound images: a multi-hypothesis approach

Ultrasound (US) imaging is an excellent means of observing tongue motion during speech. Tracking the tongue contour in US video is required for analysis of this motion, but most currently available techniques suffer from either a lack of temporal consistency or a lack of robustness to difficult conditions such as a rapidly deforming tongue or momentarily poor image quality. This paper proposes a new algorithm combining active contours, active shape models and particle filtering that addresses these shortcomings. The strength of this approach lies in the fact that it maintains multiple tongue shape hypotheses simultaneously. Experimental results show that this approach outperforms a classic active contour algorithm as well as a shape-constrained variant thereof, particularly in difficult tracking conditions.

[1]  Timothy F. Cootes,et al.  Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[3]  M. Stone A guide to analysing tongue motion from ultrasound images , 2005, Clinical linguistics & phonetics.

[4]  Ian R. Fasel,et al.  Deep Belief Networks for Real-Time Extraction of Tongue Contours from Ultrasound During Speech , 2010, 2010 20th International Conference on Pattern Recognition.

[5]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[6]  Lisa Tang,et al.  Tongue Contour Tracking in Dynamic Ultrasound via Higher-order Mrfs and Ecient Fusion Moves , 2012 .

[7]  G. Hamarneh,et al.  Combining snakes and active shape models for segmenting the human left ventricle in echocardiographic images , 2000, Computers in Cardiology 2000. Vol.27 (Cat. 00CH37163).

[8]  Petros Maragos,et al.  Tongue tracking in Ultrasound images with Active Appearance Models , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[9]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[10]  C. Kambhamettu,et al.  Automatic contour tracking in ultrasound images , 2005, Clinical linguistics & phonetics.

[11]  Chandra Kambhamettu,et al.  Automatic extraction and tracking of the tongue contours , 1999, IEEE Transactions on Medical Imaging.

[12]  Catherine Laporte,et al.  Statistical shape analysis in ultrasound video sequences: tongue tracking and population analysis , 2013 .

[13]  Lucie Ménard,et al.  Interactive segmentation of tongue contours in ultrasound video sequences using quality maps , 2014, Medical Imaging.

[14]  Wolfram Burgard,et al.  Monte Carlo Localization: Efficient Position Estimation for Mobile Robots , 1999, AAAI/IAAI.