Studying the Link Between Inter-Speaker Coordination and Speech Imitation Through Human-Machine Interactions

According to accounts of inter-speaker coordination based on internal predictive models, speakers tend to imitate each other each time they need to coordinate their behavior. According to accounts based on the notion of dynamical coupling, imitation should be observed only if it helps stabilizing the specific coordinative pattern produced by the interlocutors or if it is a direct consequence of inter-speaker coordination. To compare these accounts, we implemented an artificial agent designed to repeat a speech utterance while coordinating its behavior with that of a human speaker performing the same task. We asked 10 Italian speakers to repeat the utterance /topkop/ simultaneously with the agent during short time intervals. In some interactions, the agent was parameterized to cooperate with the speakers (by producing its syllables simultaneously with those of the human) while in others it was parameterized to compete with them (by producing its syllables in-between those of the human). A positive correlation between the stability of inter-speaker coordination and the degree of f0 imitation was observed only in cooperative interactions. However, in line with accounts based on prediction, speakers imitate the f0 of the agent regardless of whether this is parameterized to cooperate or to compete with them.

[1]  Leonardo Lancia,et al.  Characterization of laryngealization as irregular vocal fold vibration and interaction with prosodic prominence , 2016, J. Phonetics.

[2]  Véronique Delvaux,et al.  Role of imitation in the emergence of phonological systems , 2015, J. Phonetics.

[3]  Yoshiki Kuramoto,et al.  Chemical Oscillations, Waves, and Turbulence , 1984, Springer Series in Synergetics.

[4]  Dani Byrd,et al.  Dynamic action units slip in speech production errors , 2007, Cognition.

[5]  M. Pickering,et al.  Forward models and their implications for production, comprehension, and dialogue , 2013, Behavioral and Brain Sciences.

[6]  Daniel C. Richardson,et al.  Synchrony and swing in conversation: coordination, temporal dynamics and communication , 2008 .

[7]  Shie Mannor,et al.  Activity and Gait Recognition with Time-Delay Embeddings , 2010, AAAI.

[8]  Karl J. Friston,et al.  A Duet for one , 2015, Consciousness and Cognition.

[9]  J. Kelso,et al.  The human dynamic clamp as a paradigm for social interaction , 2014, Proceedings of the National Academy of Sciences.

[10]  A. Yu,et al.  Phonetic Imitation from an Individual-Difference Perspective: Subjective Attitude, Personality and “Autistic” Traits , 2013, PloS one.

[11]  Werner Verhelst,et al.  An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modification of speech , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  V L Gracco,et al.  On the registration of time and the patterning of speech movements. , 1997, Journal of speech, language, and hearing research : JSLHR.

[13]  Denis G. Pelli,et al.  ECVP '07 Abstracts , 2007, Perception.

[14]  J. Schwartz,et al.  Converging toward a common speech code: imitative and perceptuo-motor recalibration processes in speech production , 2013, Front. Psychol..

[15]  H. Abarbanel,et al.  Determining embedding dimension for phase-space reconstruction using a geometrical construction. , 1992, Physical review. A, Atomic, molecular, and optical physics.

[16]  Ross Bencina,et al.  PortAudio - an Open Source Cross Platform Audio API , 2001, ICMC.

[17]  S. Hirche,et al.  Rhythm Patterns Interaction - Synchronization Behavior for Human-Robot Joint Action , 2014, PloS one.

[18]  Jürgen Kurths,et al.  Analysis of Bivariate Coupling by Means of Recurrence , 2008 .

[19]  Michael J. Richardson,et al.  Language Use, Coordination, and the Emergence of Cooperative Action , 2008 .

[20]  H. S. Kim,et al.  Nonlinear dynamics , delay times , and embedding windows , 1999 .