Tongue Contour Tracking in Dynamic Ultrasound via Higher-order Mrfs and Ecient Fusion Moves

Analyses of the human tongue motion as captured from 2D dynamic ultrasound data often requires segmentation of the mid-sagittal tongue contours. However, semi-automatic extraction of the tongue shape presents practical challenges. We approach this segmentation problem by proposing a novel higher-order Markov random field energy minimization framework. For efficient energy minimization, we propose two novel schemes to sample the solution space efficiently. To cope with the unpredictable tongue motion dynamics, we also propose to temporally adapt regularization based on contextual information. Unlike previous methods, we employ the latest optimization techniques to solve the tracking problem under one unified framework. Our method was validated on a set of 63 clinical data sequences, which allowed for comparative analyses with three other competing methods. Experimental results demonstrate that our method can segment sequences containing over 500 frames with mean accuracy of 3mm, approaching the accuracy of manual segmentations created by trained clinical observers.

[1]  Kashif Rajpoot,et al.  Local-phase based 3D boundary detection using monogenic signal and its application to real-time 3-D echocardiography images , 2009, 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[2]  Purang Abolmaesumi,et al.  A real-time intrasubject elastic registration algorithm for dynamic 2-D ultrasound images. , 2009, Ultrasound in medicine & biology.

[3]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[4]  D. Boukerroui,et al.  Phase based level set segmentation of ultrasound images , 2009, 2009 9th International Conference on Information Technology and Applications in Biomedicine.

[5]  Stefan Klein,et al.  Nonrigid registration of dynamic medical imaging data using nD + t B-splines and a groupwise optimization approach , 2011, Medical Image Anal..

[6]  Chandra Kambhamettu,et al.  A Coarse-to-Fine Deformable Contour Optimization Framework , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  J. Alison Noble,et al.  Ultrasound image segmentation: a survey , 2006, IEEE Transactions on Medical Imaging.

[8]  Peter Kovesi,et al.  Image Features from Phase Congruency , 1995 .

[9]  Purang Abolmaesumi,et al.  An interacting multiple model probabilistic data association filter for cavity boundary extraction from ultrasound images , 2004, IEEE Transactions on Medical Imaging.

[10]  Olov Engwall,et al.  Combining MRI, EMA and EPG measurements in a three-dimensional tongue model , 2003, Speech Commun..

[11]  Vladimir Kolmogorov,et al.  Optimizing Binary MRFs via Extended Roof Duality , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Luigi P. Badano,et al.  Assessment of diastolic function , 2011 .

[13]  Purang Abolmaesumi,et al.  Real-time extraction of carotid artery contours from ultrasound images , 2000, Proceedings 13th IEEE Symposium on Computer-Based Medical Systems. CBMS 2000.

[14]  Jonathan C Irish,et al.  Increased midsagittal tongue velocity as indication of articulatory compensation in patients with lateral partial glossectomies , 2008, Head & neck.

[15]  Xiaobo Li,et al.  Boundary Point Detection for Ultrasound Image Segmentation using Gumbel Distributions , 2018, SIGMAP.

[16]  Its'hak Dinstein,et al.  New maximum likelihood motion estimation schemes for noisy ultrasound images , 2002, Pattern Recognit..

[17]  Y N Sun,et al.  A self-learning segmentation framework--the Taguchi approach. , 2000, Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society.

[18]  Peter Kovesi,et al.  MATLAB Functions for Computer Vision and Image Analysis , 2004 .

[19]  Dorin Comaniciu,et al.  3D ultrasound tracking of the left ventricle using one-step forward prediction and data fusion of collaborative trackers , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  R. Gilbert,et al.  Quantitative three‐dimensional ultrasound analysis of tongue protrusion, grooving and symmetry: Data from 12 normal speakers and a partial glossectomee , 2005, Clinical linguistics & phonetics.

[21]  Timothy F. Cootes,et al.  Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Petros Maragos,et al.  Multimodality acquisition of articulatory data and processing , 2008, 2008 16th European Signal Processing Conference.

[23]  Robert Rohling,et al.  Bone Segmentation and Fracture Detection in Ultrasound Using 3D Local Phase Features , 2008, MICCAI.

[24]  Hiroshi Ishikawa,et al.  Transformation of General Binary MRF Minimization to the First-Order Case , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Ghassan Hamarneh,et al.  Medial-Based Deformable Models in Nonconvex Shape-Spaces for Medical Image Segmentation , 2012, IEEE Transactions on Medical Imaging.

[26]  Michalis A. Savelonas,et al.  A genetically optimized level set approach to segmentation of thyroid ultrasound images , 2007, Applied Intelligence.

[27]  Ian R. Fasel,et al.  Deep Belief Networks for Real-Time Extraction of Tongue Contours from Ultrasound During Speech , 2010, 2010 20th International Conference on Pattern Recognition.

[28]  D. Adam,et al.  Automatic ventricular cavity boundary detection from sequential ultrasound images using simulated annealing. , 1989, IEEE transactions on medical imaging.

[29]  Chandra Kambhamettu,et al.  Automatic extraction and tracking of the tongue contours , 1999, IEEE Transactions on Medical Imaging.

[30]  Jérôme Aubin Compensation for a labial perturbation : An acoustic and articulatory study of child and adult French speakers , 2006 .

[31]  Miguel Á. Carreira-Perpiñán,et al.  Predicting tongue shapes from a few landmark locations , 2008, INTERSPEECH.

[32]  Leonardo Bocchi,et al.  Segmentation of Ultrasound Breast Images: Optimization of Algorithm Parameters , 2011, EvoApplications.

[33]  Andrew Blake,et al.  Fusion Moves for Markov Random Field Optimization , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Vladimir Kolmogorov,et al.  Minimizing Nonsubmodular Functions with Graph Cuts-A Review , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Radim Sára,et al.  A Weak Structure Model for Regular Pattern Recognition Applied to Facade Images , 2010, ACCV.

[36]  C. Kambhamettu,et al.  Automatic contour tracking in ultrasound images , 2005, Clinical linguistics & phonetics.

[37]  Michael Brady,et al.  Velocity Estimation in Ultrasound Images: A Block Matching Approach , 2003, IPMI.

[38]  James S. Duncan,et al.  A coupled deformable model for tracking myocardial borders from real-time echocardiography using an incompressibility constraint , 2010, Medical Image Anal..

[39]  Nassir Navab,et al.  Dense image registration through MRFs and efficient linear programming , 2008, Medical Image Anal..

[40]  Lisa Tang,et al.  Graph-based tracking of the tongue contour in ultrasound sequences with adaptive temporal regularization , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[41]  Andrew Blake,et al.  Evaluating a robust contour tracker on echocardiographic sequences , 1999, Medical Image Anal..

[42]  J. Revell,et al.  Combined Ultrasound Speckle Pattern Similarity Measures , 2004 .

[43]  Ahror Belaid,et al.  Phase based level set segmentation of ultrasound images , 2009, 2009 9th International Conference on Information Technology and Applications in Biomedicine.

[44]  G. Hamarneh,et al.  Medial-based Deformable Models in Non-convex Shape-spaces for Medical Image Segmentation using Genetic Algorithms , 2011 .

[45]  Max A. Viergever,et al.  elastix: A Toolbox for Intensity-Based Medical Image Registration , 2010, IEEE Transactions on Medical Imaging.

[46]  T. Mackie,et al.  Fast free-form deformable registration via calculus of variations , 2004, Physics in medicine and biology.

[47]  Stefan A. Frisch,et al.  Semiautomatic measures of velar stop closure location using the EdgeTrak software. , 2008 .

[48]  Min Li,et al.  A Level Set Approach for Shape Recovery of Open Contours , 2006, ACCV.

[49]  Mert R. Sabuncu,et al.  Learning Task-Optimal Registration Cost Functions for Localizing Cytoarchitecture and Function in the Cerebral Cortex , 2010, IEEE Transactions on Medical Imaging.

[50]  Olga Veksler,et al.  Fast Approximate Energy Minimization via Graph Cuts , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[51]  K. Kirk Shung,et al.  Non-Rigid Ultrasound Image Registration Based on Intensity and Local Phase Information , 2008, J. Signal Process. Syst..

[52]  Ghassan Hamarneh,et al.  Optimal Weights for Convex Functionals in Medical Image Segmentation , 2009, ISVC.

[53]  Daniel Recasens,et al.  Lingual kinematics and coarticulation for alveolopalatal and velar consonants in Catalan. , 2010, The Journal of the Acoustical Society of America.

[54]  Prisca Stenneken,et al.  Analysis of vowel-consonant-vowel sequences in patients with partial glossectomies using 2D ultrasound imaging , 2008 .

[55]  Michael Felsberg,et al.  The monogenic signal , 2001, IEEE Trans. Signal Process..

[56]  Qi Duan,et al.  Comparing optical-flow based methods for quantification of myocardial deformations on RT3D ultrasound , 2006, 3rd IEEE International Symposium on Biomedical Imaging: Nano to Macro, 2006..

[57]  Lisa Tang,et al.  Reliability-Driven, Spatially-Adaptive Regularization for Deformable Registration , 2010, WBIR.

[58]  Mikkel B. Stegmann,et al.  Active appearance models: Theory and cases , 2000 .

[59]  Jun Xie,et al.  Segmentation of kidney from ultrasound images based on texture and shape priors , 2005, IEEE Transactions on Medical Imaging.

[60]  Karl Espen Nielsen,et al.  Practical use of Block-Matching in 3D Speckle Tracking , 2009 .

[61]  Khalil Iskarous Detecting the edge of the tongue: A tutorial , 2005, Clinical linguistics & phonetics.

[62]  Petros Maragos,et al.  Tongue tracking in Ultrasound images with Active Appearance Models , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).