Automatic segmentation of speech articulators from real-time midsagittal MRI based on supervised learning

[1]  Gérard Bailly,et al.  Acoustic-to-articulatory inversion using speech recognition and trajectory formation based on phoneme hidden Markov models , 2009, INTERSPEECH.

[2]  B. Lindblom,et al.  Acoustical consequences of lip, tongue, jaw, and larynx movement. , 1970, The Journal of the Acoustical Society of America.

[3]  Pierre Badin,et al.  A Comparative Study of the Precision of Carstens and Northern Digital Instruments Electromagnetic Articulographs. , 2017, Journal of speech, language, and hearing research : JSLHR.

[4]  Jens Frahm,et al.  Real‐time MRI at a resolution of 20 ms , 2010, NMR in biomedicine.

[5]  Pierre Badin Fricative consonants: acoustic and X-ray measurements , 1991 .

[6]  Ian R. Fasel,et al.  Deep Belief Networks for Real-Time Extraction of Tongue Contours from Ultrasound During Speech , 2010, 2010 20th International Conference on Pattern Recognition.

[7]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[8]  Shrikanth Narayanan,et al.  An approach to real-time magnetic resonance imaging for speech production. , 2003, The Journal of the Acoustical Society of America.

[9]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[10]  Pascal Perrier,et al.  A theory of speech motor control and supporting data from speakers with normal hearing and with profound hearing loss , 2000, J. Phonetics.

[11]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[12]  Jens Frahm,et al.  Real‐time MRI of swallowing: intraoral pressure reduction supports larynx elevation , 2016, NMR in biomedicine.

[13]  Shrikanth S. Narayanan,et al.  Enhanced airway-tissue boundary segmentation for real-time magnetic resonance imaging data , 2014 .

[14]  Thomas Hueber,et al.  Tongue tracking in ultrasound images using eigentongue decomposition and artificial neural networks , 2015, INTERSPEECH.

[15]  Jens Frahm,et al.  Real‐time MRI of speaking at a resolution of 33 ms: Undersampled radial FLASH with nonlinear inverse reconstruction , 2013, Magnetic resonance in medicine.

[16]  C. Kambhamettu,et al.  Automatic contour tracking in ultrasound images , 2005, Clinical linguistics & phonetics.

[17]  R. Schweizer,et al.  On the Physiology of Normal Swallowing as Revealed by Magnetic Resonance Imaging in Real Time , 2014, Gastroenterology research and practice.

[18]  Jens Frahm,et al.  On the temporal fidelity of nonlinear inverse reconstructions for real- time MRI – The motion challenge. , 2014 .

[19]  Jun Wang,et al.  Individual articulator's contribution to phoneme production , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[20]  G. Bailly,et al.  Linear degrees of freedom in speech production: analysis of cineradio- and labio-film data and articulatory-acoustic modeling. , 2001, The Journal of the Acoustical Society of America.

[21]  Jens Frahm,et al.  High-speed real-time magnetic resonance imaging of fast tongue movements in elite horn players. , 2015, Quantitative imaging in medicine and surgery.

[22]  P. Ladefoged,et al.  The sounds of the world's languages , 1996 .

[23]  K. Hiiemae,et al.  Tongue movements in feeding and speech. , 2003, Critical reviews in oral biology and medicine : an official publication of the American Association of Oral Biologists.

[24]  Gérard Bailly,et al.  Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images , 2002, J. Phonetics.

[25]  Yohan Payan,et al.  Speech biomechanics: What have we learned and modeled since Joseph Perkell’s tongue model In 1974? , 2016 .

[26]  Louis-Jean Boë,et al.  Suivi de contours d’articulateurs orofaciaux à partir d’IRM dynamique (Orofacial articulators tracking from dynamic MRI)[In French] , 2016, JEPTALNRECITAL.

[27]  T. Hueber,et al.  Analyse du conduit vocal par imagerie ultrasonore , 2009 .

[28]  Gunnar Fant,et al.  Acoustic Theory Of Speech Production , 1960 .

[29]  S E Bishara,et al.  A computer assisted photogrammetric analysis of soft tissue changes after orthodontic treatment. Part I: Methodology and reliability. , 1995, American journal of orthodontics and dentofacial orthopedics : official publication of the American Association of Orthodontists, its constituent societies, and the American Board of Orthodontics.

[30]  Johan Sundberg,et al.  Professional Opera Tenors’ Vocal Tract Configurations in Registers , 2010, Folia Phoniatrica et Logopaedica.

[31]  Péla Simon,et al.  Les consonnes françaises : mouvements et positions articulatoires à la lumière de la radiocinématographie , 1967 .

[32]  Kiyoshi Honda,et al.  ACOUSTICS2008/1772 Acoustic characteristics of solid vocal tracts modeled from ATR MRI database of Japanese vowel production , 2008 .

[33]  António J. S. Teixeira,et al.  Unsupervised segmentation of the vocal tract from real-time MRI sequences , 2015, Comput. Speech Lang..

[34]  Martin Styner,et al.  Comparison and Evaluation of Methods for Liver Segmentation From CT Datasets , 2009, IEEE Transactions on Medical Imaging.

[35]  Shrikanth Narayanan,et al.  A fast and flexible MRI system for the study of dynamic vocal tract shaping , 2017, Magnetic resonance in medicine.

[36]  K. Moll,et al.  Cinefluorgraphic techniques in speech research. , 1960, Journal of speech and hearing research.

[37]  M H Cohen,et al.  Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements. , 1992, The Journal of the Acoustical Society of America.

[38]  Brad H. Story,et al.  Parameterization of vocal tract area functions by empirical orthogonal modes , 1998 .

[39]  Jens Frahm,et al.  Real-time MRI: recent advances using radial FLASH. , 2012 .

[40]  Shrikanth S. Narayanan,et al.  Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images , 2009, IEEE Transactions on Medical Imaging.

[41]  Athanasios Katsamanis,et al.  Rapid semi-automatic segmentation of real-time magnetic resonance images for parametric vocal tract analysis , 2010, INTERSPEECH.

[42]  Didier Demolin,et al.  Segmentation of the airway from the surrounding tissues on magnetic resonance images: a comparative study , 1998, ICSLP.

[43]  R. Sokal,et al.  THE COMPARISON OF DENDROGRAMS BY OBJECTIVE METHODS , 1962 .

[44]  Maureen Stone,et al.  Robust contour tracking in ultrasound tongue image sequences , 2016, Clinical linguistics & phonetics.

[45]  Kiyoshi Honda,et al.  A method of tooth superimposition on MRI data for accurate measurement of vocal tract shape and dimensions , 2004 .

[46]  Hamid Seifoddini,et al.  Single linkage versus average linkage clustering in machine cells formation applications , 1989 .

[47]  Pierre Badin,et al.  Development and implementation of fiducial markers for vocal tract MRI imaging and speech articulatory modelling , 2013, INTERSPEECH.

[48]  Philip Hoole,et al.  Electromagnetic articulography in coarticulation research , 1997 .

[49]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[50]  Shrikanth S. Narayanan,et al.  Vocal tract cross-distance estimation from real-time MRI using region-of-interest analysis , 2013, INTERSPEECH.

[51]  Shinobu Masaki,et al.  MRI-based speech production study using a synchronized sampling method , 1999 .

[52]  Julie Fontecave Jallon,et al.  A semi-automatic method for extracting vocal tract movements from X-ray films , 2009, Speech Commun..

[53]  Cornelis H. Slump,et al.  MRI modalitiy transformation in demon registration , 2009, 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[54]  Pierre Badin,et al.  Three-dimensional modeling of speech organs: Articulatory data and models , 2006 .

[55]  A. Serrurier,et al.  A three-dimensional articulatory model of the velum and nasopharyngeal wall based on MRI and CT data. , 2008, The Journal of the Acoustical Society of America.

[56]  Shinobu Masaki,et al.  Integrated magnetic resonance imaging methods for speech science and technology , 2008 .

[57]  Manuel Guizar-Sicairos,et al.  Efficient subpixel image registration algorithms. , 2008, Optics letters.

[58]  J. Rokkaku,et al.  Measurements of the three-dimensional shape of the vocal tract based on the magnetic resonance imaging technique , 1986 .

[59]  Gérard Chollet,et al.  Eigentongue Feature Extraction for an Ultrasound-Based Silent Speech Interface , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[60]  Athanasios Katsamanis,et al.  Direct Estimation of Articulatory Kinematics from Real-Time Magnetic Resonance Image Sequences , 2011, INTERSPEECH.

[61]  Phil Hoole,et al.  Recording speech articulation in dialogue: Evaluating a synchronized double electromagnetic articulography setup , 2013, J. Phonetics.

[62]  P. Mermelstein Articulatory model for the study of speech production. , 1973, The Journal of the Acoustical Society of America.

[63]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[64]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[65]  António J. S. Teixeira,et al.  Critical Articulators Identification from RT-MRI of the Vocal Tract , 2017, INTERSPEECH.

[66]  Laurent Girin,et al.  Speaker-Adaptive Acoustic-Articulatory Inversion Using Cascaded Gaussian Mixture Regression , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[67]  Philip J. B. Jackson,et al.  Statistical identification of articulation constraints in the production of speech , 2009, Speech Commun..

[68]  Shigeru Kiritani,et al.  X-ray microbeam method for measurement of articulatory dynamics-techniques and results , 1986, Speech Commun..

[69]  Tony F. Chan,et al.  Active contours without edges , 2001, IEEE Trans. Image Process..

[70]  Pascal Perrier,et al.  Gesture planning integrating knowledge of the motor plant's dynamics: A literature review from motor control and speech motor control , 2012 .

[71]  Shinji Maeda,et al.  Compensatory Articulation During Speech: Evidence from the Analysis and Synthesis of Vocal-Tract Shapes Using an Articulatory Model , 1990 .

[72]  C A Fowler,et al.  Coordination and Coarticulation in Speech Production , 1993, Language and speech.

[73]  Peter Birkholz,et al.  A Gesture-Based Concept for Speech Movement Control in Articulatory Speech Synthesis , 2007, COST 2102 Workshop.

[74]  P. Perrier,et al.  A biomechanical modeling study of the effects of the orbicularis oris muscle and jaw posture on lip shape. , 2013, Journal of speech, language, and hearing research : JSLHR.

[75]  Louis-Jean Boë,et al.  Tracking Contours of Orofacial Articulators from Real-Time MRI of Speech , 2016, INTERSPEECH.

[76]  Marc E Miquel,et al.  Recommendations for real‐time speech MRI , 2016, Journal of magnetic resonance imaging : JMRI.

[77]  Thomas H. Shawker,et al.  Distinguisability of tongue shape during vowel production , 1985 .

[78]  George H. Weiss,et al.  Analysis of real-time ultrasound images of tongue configuration using a grid-digitizing system , 1983 .

[79]  Arne Kjell Foldvik,et al.  MRI (magnetic resonance imaging) film of articulatory movements , 1990, ICSLP.

[80]  Timothy F. Cootes,et al.  Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[81]  Shrikanth Narayanan,et al.  Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research (TC). , 2014, The Journal of the Acoustical Society of America.

[82]  Marleen de Bruijne,et al.  Shape Particle Filtering for Image Segmentation , 2004, MICCAI.

[83]  Louis-Jean Boë,et al.  The tongue in speech and feeding: Comparative articulatory modelling , 2012, J. Phonetics.

[84]  Keiichi Tokuda,et al.  Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model , 2008, Speech Commun..

[85]  Marie-Odile Berger,et al.  A guided approach for automatic segmentation and modeling of the vocal tract in MRI images , 2011, 2011 19th European Signal Processing Conference.