Articulatory settings facilitate mechanically advantageous motor control of vocal tract articulators

It was recently shown that vocal tract postures assumed during pauses in read speech are significantly different from those assumed at absolute rest. This paper examines whether the former category of “articulatory settings” are more mechanically advantageous than absolute rest postures with respect to speech articulation. Appropriate task and articulator variables are extracted from real-time Magnetic Resonance Imaging (rtMRI) data of five speakers reading aloud. Locally-weighted regression is then used to calculate Jacobian matrices representing the transformation between articulatory task velocities and postural velocities. A measure of mechanical advantage is proposed based on the obtained Jacobian. Speech-ready postures and postures during inter-speech pauses are observed to be significantly more mechanically advantageous as compared to rest postures. Furthermore, other postures, such as those that occur during the production of different vowels and consonants, are shown to have mechanical advantages that lie in between this continuum. These results could provide insights into understanding postural motor control and other linguistic phenomena, such as sonority hierarchies, in speech production. Index Terms: speech production, real-time MRI, articulatory setting, postural motor control, task dynamics, forward kinematics, vocal tract shaping.

[1]  Henry Sweet A Primer of Phonetics , 2009 .

[2]  Shrikanth S. Narayanan,et al.  Exploiting speech production information for automatic speech and speaker modeling and recognition - possibilities and new opportunities , 2012, Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference.

[3]  Jonathan Cagan,et al.  Formal Engineering Design Synthesis , 2005 .

[4]  Shrikanth S. Narayanan,et al.  Statistical methods for estimation of direct and differential kinematics of the vocal tract , 2013, Speech Commun..

[5]  Bryan Gick,et al.  Articulatory settings of French and English monolinguals and bilinguals , 2006 .

[6]  Felix Schaeffler,et al.  Measuring language-specific phonetic settings , 2010 .

[7]  D. Rosenbaum,et al.  Posture-based motion planning: applications to grasping. , 2001, Psychological review.

[8]  J. Esling,et al.  Voice Quality Settings and the Teaching of Pronunciation. , 1983 .

[9]  Shrikanth Narayanan,et al.  An approach to real-time magnetic resonance imaging for speech production. , 2003, The Journal of the Acoustical Society of America.

[10]  J. Laver The phonetic description of voice quality , 1980 .

[11]  Karsten Koch,et al.  Language-Specific Articulatory Settings: Evidence from Inter-Utterance Rest Position , 2004, Phonetica.

[12]  Alan A Wrench,et al.  A MULTI-CHANNEL/MULTI-SPEAKER ARTICULATORY DATABASE FOR CONTINUOUS SPEECH RECOGNITION RESEARCH , 2000 .

[13]  Shrikanth S. Narayanan,et al.  An investigation of articulatory setting using real-time magnetic resonance imaging. , 2013, The Journal of the Acoustical Society of America.

[14]  Dani Byrd,et al.  Analysis of pausing behavior in spontaneous speech using real-time magnetic resonance imaging of articulation. , 2009, The Journal of the Acoustical Society of America.

[15]  Shrikanth S. Narayanan,et al.  Investigating articulatory setting - pauses, ready position, and rest - using real-time MRI , 2010, INTERSPEECH.

[16]  John Laver The Concept of Articulatory Settings: An Historical Survey , 1978 .

[17]  Shrikanth Narayanan,et al.  Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans. , 2006, The Journal of the Acoustical Society of America.