Motor control primitives arising from a dynamical systems model of vocal tract articulation

We have previously presented a computational approach to derive interpretable movement primitives from speech articulation data using a convolutive Nonnegative Matrix Factorization with sparseness constraints (cNMFsc) technique (Ramanarayanan et al., Interspeech 2011; Ramanarayanan et al., J. Acoust. Soc. Am. 134(2), in press). However, it is not clear whether finding such a dictionary of primitives can be useful for speech motor control, particularly in finding a low-dimensional subspace for such control. In this paper, we examine this possibility in two steps. First, we use the iterative Linear Quadratic Gaussian (iLQG) algorithm to derive a set of control inputs to a dynamical systems model of the vocal tract that produces a desired movement sequence. Second, we use the cNMFsc algorithm to find a small dictionary of control input “primitives'' that can be used to drive said dynamical systems model of the vocal tract to produce the desired range of articulatory movement. We show, using both qualitative ...

[1]  Emanuel Todorov,et al.  Iterative Linear Quadratic Regulator Design for Nonlinear Biological Movement Systems , 2004, ICINCO.

[2]  T. Gelder,et al.  Mind as Motion: Explorations in the Dynamics of Cognition , 1995 .

[3]  Shrikanth S. Narayanan,et al.  Statistical methods for estimation of direct and differential kinematics of the vocal tract , 2013, Speech Commun..

[4]  Louis Goldstein,et al.  A task-dynamic toolkit for modeling the effects of prosodic structure on articulation , 2008, Speech Prosody 2008.

[5]  P. Rubin,et al.  CASY: The Haskins Configurable Articulatory Synthesizer , 2003 .

[6]  E. Bizzi,et al.  Article history: , 2005 .

[7]  Athanasios Katsamanis,et al.  Automatic Data-Driven Learning of Articulatory Primitives from Real-Time MRI Data Using Convolutive NMF with Sparseness Constraints , 2011, INTERSPEECH.

[8]  Tamar Flash,et al.  Motor primitives in vertebrates and invertebrates , 2005, Current Opinion in Neurobiology.

[9]  Louis Goldstein,et al.  Dynamics and articulatory phonology , 1996 .

[10]  Paris Smaragdis,et al.  Convolutive Speech Bases and Their Application to Supervised Speech Separation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  Francesco Lacquaniti,et al.  Control of Fast-Reaching Movements by Muscle Synergy Combinations , 2006, The Journal of Neuroscience.

[12]  H. Akaike Likelihood of a model and information criteria , 1981 .

[13]  Barak A. Pearlmutter,et al.  Convolutive Non-Negative Matrix Factorisation with a Sparseness Constraint , 2006, 2006 16th IEEE Signal Processing Society Workshop on Machine Learning for Signal Processing.

[14]  D J Ostry,et al.  Coarticulation of jaw movements in speech production: is context sensitivity in speech kinematics centrally planned? , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[15]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[16]  Gregory Shakhnarovich,et al.  Sparse Coding for Learning Interpretable Spatio-Temporal Primitives , 2010, NIPS.

[17]  L Saltzman Elliot,et al.  A Dynamical Approach to Gestural Patterning in Speech Production , 1989 .

[18]  F.A. Mussa-Ivaldi,et al.  Neural primitives for motion control , 2004, IEEE Journal of Oceanic Engineering.

[19]  E. Todorov,et al.  A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems , 2005, Proceedings of the 2005, American Control Conference, 2005..

[20]  Robert A. Jacobs,et al.  Properties of Synergies Arising from a Theory of Optimal Motor Behavior , 2006, Neural Computation.

[21]  Shrikanth S. Narayanan,et al.  Spatio-temporal articulatory movement primitives during speech production: extraction, interpretation, and validation. , 2013, The Journal of the Acoustical Society of America.

[22]  Emilio Bizzi,et al.  Shared and specific muscle synergies in natural motor behaviors. , 2005, Proceedings of the National Academy of Sciences of the United States of America.