Gesture Networks: Introducing Dynamic Time Warping and Network Analysis for the Kinematic Study of Gesture Ensembles

ABSTRACT We introduce applications of established methods in time-series and network analysis that we jointly apply here for the kinematic study of gesture ensembles. We define a gesture ensemble as the set of gestures produced during discourse by a single person or a group of persons. Here we are interested in how gestures kinematically relate to one another. We use a bivariate time-series analysis called dynamic time warping to assess how similar each gesture is to other gestures in the ensemble in terms of their velocity profiles (as well as studying multivariate cases with gesture velocity and speech amplitude envelope profiles). By relating each gesture event to all other gesture events produced in the ensemble, we obtain a weighted matrix that essentially represents a network of similarity relationships. We can therefore apply network analysis that can gauge, for example, how diverse or coherent certain gestures are with respect to the gesture ensemble. We believe these analyses promise to be of great value for gesture studies, as we can come to understand how low-level gesture features (kinematics of gesture) relate to the higher-order organizational structures present at the level of discourse.

[1]  Rick Dale,et al.  Complex Communication Dynamics: Exploring the Structure of an Academic Talk , 2019, Cogn. Sci..

[2]  Hannes Rieser,et al.  On Factoring Out a Gesture Typology from the Bielefeld Speech-and-Gesture-Alignment Corpus (SAGA) , 2009, Gesture Workshop.

[3]  Kevin M. Cury,et al.  DeepLabCut: markerless pose estimation of user-defined body parts with deep learning , 2018, Nature Neuroscience.

[4]  Stefanie Shattuck-Hufnagel,et al.  The Prosodic Characteristics of Non-referential Co-speech Gestures in a Sample of Academic-Lecture-Style Speech , 2018, Front. Psychol..

[5]  Toni Giorgino,et al.  Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package , 2009 .

[6]  Stefan Kopp,et al.  Gesture and speech in interaction: An overview , 2014, Speech Commun..

[7]  Fred Cummins,et al.  The temporal relation between beat gestures and speech , 2011 .

[8]  A. Kendon Gesture: Visible Action as Utterance , 2004 .

[9]  Irene Kimbara Gesture Form Convergence in Joint Description , 2008 .

[10]  Sotaro Kita,et al.  How Do Gestures Influence Thinking and Speaking? The Gesture-for-Conceptualization Hypothesis , 2017, Psychological review.

[11]  Francis K. H. Quek,et al.  Hand motion gestural oscillations and multimodal discourse , 2003, ICMI '03.

[12]  Volker Dellwo,et al.  Amplitude envelope kinematics of speech: Parameter extraction and applications , 2017 .

[13]  Evelyn McClave,et al.  Gestural beats: The rhythm hypothesis , 1994 .

[14]  Riccardo Fusaroli,et al.  Investigating Conversational Dynamics: Interactive Alignment, Interpersonal Synergy, and Collective Task Performance , 2016, Cogn. Sci..

[15]  Francis K. H. Quek The Catchment Feature Model: A Device for Multimodal Fusion and a Bridge between Signal and Sense , 2004, EURASIP J. Adv. Signal Process..

[16]  David McNeill,et al.  Language and Gesture: Catchments and contexts: non-modular factors in speech and gesture production , 2000 .

[17]  Sebastian Wallot,et al.  Recurrence Quantification Analysis of Processes and Products of Discourse: A Tutorial in R , 2017 .

[18]  Rashid Ansari,et al.  Multimodal human discourse: gesture and speech , 2002, TCHI.

[19]  Francis K. H. Quek,et al.  Catchments, prosody and discourse , 2001 .

[20]  P. V. van Geert,et al.  Asymmetric Dynamic Attunement of Speech and Gestures in the Construction of Children’s Understanding , 2016, Front. Psychol..

[21]  Marianne Gullberg,et al.  Discourse Reference Is Bimodal: How Information Status in Speech Interacts with Presence and Viewpoint of Gestures , 2017 .

[22]  Michael K. Tanenhaus,et al.  Embodied communication: Speakers’ gestures affect listeners’ actions , 2009, Cognition.

[23]  N. Eagle,et al.  Network Diversity and Economic Development , 2010, Science.

[24]  Sarajane Marques Peres,et al.  Studies in automated hand gesture analysis: an overview of functional types and gesture phases , 2016, Lang. Resour. Evaluation.

[25]  Wim Pouw,et al.  The quantification of gesture–speech synchrony: A tutorial and validation of multimodal data acquisition using device-based and video-based motion tracking , 2019, Behavior research methods.

[26]  Asif A. Ghazanfar,et al.  The Natural Statistics of Audiovisual Speech , 2009, PLoS Comput. Biol..

[27]  Gesture and the Sonic Event in Karnatak Music , 2013 .

[28]  Eamonn J. Keogh,et al.  Extracting Optimal Performance from Dynamic Time Warping , 2016, KDD.

[29]  Núria Esteve-Gibert,et al.  Prosodic structure shapes the temporal realization of intonation and manual gesture movements. , 2013, Journal of speech, language, and hearing research : JSLHR.

[30]  Sotaro Kita,et al.  What does cross-linguistic variation in semantic coordination of speech and gesture reveal? Evidence for an interface representation of spatial thinking and speaking , 2003 .

[31]  Christopher T. Kello,et al.  Rhythm in speech and animal vocalizations: a cross‐species perspective , 2019, Annals of the New York Academy of Sciences.

[32]  Jürgen Kurths,et al.  Synchronization - A Universal Concept in Nonlinear Sciences , 2001, Cambridge Nonlinear Science Series.

[33]  S. Shattuck-Hufnagel,et al.  Dimensionalizing co-speech gestures , 2019 .

[34]  Francis K. H. Quek,et al.  Gestural Origo and Loci-Transitions in Natural Discourse Segmentation , 2001 .

[35]  Eamonn Keogh,et al.  On the effect of endpoints on dynamic time warping , 2016 .

[36]  Louis Goldstein,et al.  Quantitative analysis of multimodal speech data , 2018, J. Phonetics.

[37]  James A. Dixon,et al.  Entrainment and Modulation of Gesture–Speech Synchrony Under Delayed Auditory Feedback , 2018, Cogn. Sci..

[38]  N. Marwan,et al.  Recurrence quantification analysis : theory and best practices , 2015 .

[39]  J. Radinsky,et al.  Method for Analyzing Gestural Communication in Musical Groups , 2017 .

[40]  Jelena Krivokapić,et al.  Gestural coordination at prosodic boundaries and its role for prosodic structure and speech planning processes , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[41]  Emiel Krahmer,et al.  Reduction in gesture during the production of repeated references , 2015 .

[42]  M. Swerts,et al.  Adaptation in Gesture: Converging Hands or Converging Minds?. , 2012 .

[43]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[44]  Linda B. Smith,et al.  Developmentally Changing Attractor Dynamics of Manual Actions with Objects in Late Infancy , 2018, Complex..

[45]  Simon Garrod,et al.  Joint Action, Interactive Alignment, and Dialog , 2009, Top. Cogn. Sci..

[46]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[47]  D. McNeill Gesture and Thought , 2005 .

[48]  Jürgen Kurths,et al.  Synchronization: Phase locking and frequency entrainment , 2001 .