An investigation of articulatory setting using real-time magnetic resonance imaging.

This paper presents an automatic procedure to analyze articulatory setting in speech production using real-time magnetic resonance imaging of the moving human vocal tract. The procedure extracts frames corresponding to inter-speech pauses, speech-ready intervals and absolute rest intervals from magnetic resonance imaging sequences of read and spontaneous speech elicited from five healthy speakers of American English and uses automatically extracted image features to quantify vocal tract posture during these intervals. Statistical analyses show significant differences between vocal tract postures adopted during inter-speech pauses and those at absolute rest before speech; the latter also exhibits a greater variability in the adopted postures. In addition, the articulatory settings adopted during inter-speech pauses in read and spontaneous speech are distinct. The results suggest that adopted vocal tract postures differ on average during rest positions, ready positions and inter-speech pauses, and might, in that order, involve an increasing degree of active control by the cognitive speech planning mechanism.

[1]  Philip J. B. Jackson,et al.  Statistical identification of articulation constraints in the production of speech , 2009, Speech Commun..

[2]  Bryan Gick,et al.  Articulatory settings of French and English monolinguals and bilinguals , 2006 .

[3]  B. Lindblom,et al.  Acoustical consequences of lip, tongue, jaw, and larynx movement. , 1970, The Journal of the Acoustical Society of America.

[4]  Panayiotis G. Georgiou,et al.  SailAlign: Robust long speech-text alignment , 2011 .

[5]  J. Perkell Physiology of speech production: results and implications of a quantitative cineradiographic study , 1969 .

[6]  Shrikanth Narayanan,et al.  An approach to real-time magnetic resonance imaging for speech production. , 2003, The Journal of the Acoustical Society of America.

[7]  J. Laver The phonetic description of voice quality , 1980 .

[8]  Shinji Maeda,et al.  Compensatory Articulation During Speech: Evidence from the Analysis and Synthesis of Vocal-Tract Shapes Using an Articulatory Model , 1990 .

[9]  I R Titze,et al.  The relationship of vocal tract shape to three voice qualities. , 2001, The Journal of the Acoustical Society of America.

[10]  Alan Wrench,et al.  An Ultrasound Protocol for Comparing Tongue Contours: Upright vs Supine , 2011, ICPhS.

[11]  Brad H. Story,et al.  A preliminary study of voice quality transformation based on modifications to the neutral vocal tract area function , 2002, J. Phonetics.

[12]  Shrikanth S. Narayanan,et al.  Investigating articulatory setting - pauses, ready position, and rest - using real-time MRI , 2010, INTERSPEECH.

[13]  H. Traunmüller Conventional, Biological and Environmental Factors in Speech Communication: A Modulation Theory , 1994, Phonetica.

[14]  Alan A Wrench,et al.  A MULTI-CHANNEL/MULTI-SPEAKER ARTICULATORY DATABASE FOR CONTINUOUS SPEECH RECOGNITION RESEARCH , 2000 .

[15]  R. H. Bernacki,et al.  Effects of noise on speech production: acoustic and perceptual analyses. , 1988, The Journal of the Acoustical Society of America.

[16]  P. Mermelstein Articulatory model for the study of speech production. , 1973, The Journal of the Acoustical Society of America.

[17]  Marion Dohen,et al.  An acoustic and articulatory study of Lombard speech: global effects on the utterance , 2006, INTERSPEECH.

[18]  Henry Sweet A Primer of Phonetics , 2009 .

[19]  L Saltzman Elliot,et al.  A Dynamical Approach to Gestural Patterning in Speech Production , 1989 .

[20]  Tamar Flash,et al.  Computational approaches to motor control , 2001, Current Opinion in Neurobiology.

[21]  S. F. Bockman,et al.  Generalizing the formula for areas of polygons to moments , 1989 .

[22]  Karsten Koch,et al.  Language-Specific Articulatory Settings: Evidence from Inter-Utterance Rest Position , 2004, Phonetica.

[23]  D H Whalen,et al.  Posterior pharyngeal wall position in the production of speech. , 2003, Journal of speech, language, and hearing research : JSLHR.

[24]  Shrikanth Narayanan,et al.  Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans. , 2006, The Journal of the Acoustical Society of America.

[25]  Dani Byrd,et al.  Analysis of pausing behavior in spontaneous speech using real-time magnetic resonance imaging of articulation. , 2009, The Journal of the Acoustical Society of America.

[26]  Shrikanth S. Narayanan,et al.  Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images , 2009, IEEE Transactions on Medical Imaging.

[27]  Dani Byrd,et al.  The elastic phrase: modeling the dynamics of boundary-adjacent lengthening , 2003, J. Phonetics.

[28]  R. Swiecinski An EMA Study of Articulatory Settings in Polish Speakers of English , 2013 .

[29]  J. Esling,et al.  Voice Quality Settings and the Teaching of Pronunciation. , 1983 .

[30]  Felix Schaeffler,et al.  Measuring language-specific phonetic settings , 2010 .

[31]  D. Rosenbaum,et al.  Posture-based motion planning: applications to grasping. , 2001, Psychological review.

[32]  John Laver The Concept of Articulatory Settings: An Historical Survey , 1978 .