Automated Pattern Analysis in Gesture Research: Similarity Measuring in 3D Motion Capture Models of Communicative Action

The question of how to model similarity between gestures plays an important role in current studies in the domain of human communication. Most research into recurrent patterns in co-verbal gestures – manual communicative movements emerging spontaneously during conversation – is driven by qualitative analyses relying on observational comparisons between gestures. Due to the fact that these kinds of gestures are not bound to well-formedness conditions, however, we propose a quantitative approach consisting of a distance-based similarity model for gestures recorded and represented in motion capture data streams. To this end, we model gestures by flexible feature representations, namely gesture signatures, which are then compared via signature-based distance functions such as the Earth Mover's Distance and the Signature Quadratic Form Distance. Experiments on real conversational motion capture data evidence the appropriateness of the proposed approaches in terms of their accuracy and efficiency. Our contribution to gesture similarity research and gesture data analysis allows for new quantitative methods of identifying patterns of gestural movements in human face-to-face interaction, i.e., in complex multimodal data sets.

[1]  D. McNeill So you think gestures are nonverbal , 1985 .

[2]  Hsiao-Lung Chan,et al.  Human identification by quantifying similarity and dissimilarity in electrocardiogram phase space , 2009, Pattern Recognit..

[3]  Irene Mittelberg,et al.  The exbodied mind: Cognitive-semiotic principles as motivating forces in gesture , 2013 .

[4]  Matt Huenerfauth,et al.  Collecting a Motion-Capture Corpus of American Sign Language for Data-Driven Generation Research , 2010, SLPAT@NAACL.

[5]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[6]  Stefan Kopp,et al.  How Do Iconic Gestures Convey Visuo-Spatial Information? Bringing Together Empirical, Theoretical, and Simulation Studies , 2011, Gesture Workshop.

[7]  Eve Sweetser Looking at space to study mental spaces: Co-speech gesture as a crucial data source in cognitive linguistics , 2007 .

[8]  Thomas Seidl,et al.  Signature matching distance for content-based image retrieval , 2013, ICMR.

[9]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[10]  Thomas Seidl,et al.  Efficient Filter Approximation Using the Earth Mover's Distance in Very Large Multimedia Databases with Feature Signatures , 2014, CIKM.

[11]  Thomas Seidl,et al.  A comparative study of similarity measures for content-based multimedia retrieval , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[12]  Thomas Seidl,et al.  On stability of signature-based similarity measures for content-based image retrieval , 2012, Multimedia Tools and Applications.

[13]  Peter Krapp,et al.  90. Kulturwissenschaftliche Orientierung in der Gestenforschung , 2016 .

[14]  Jana Bressem,et al.  70. A linguistic perspective on the notation of form features in gestures , 2013 .

[15]  Mark Turner,et al.  Multimodal Construction Grammar , 2012 .

[16]  Bernhard Schölkopf,et al.  The Kernel Trick for Distances , 2000, NIPS.

[17]  Jens Edlund,et al.  Kinetic data for large-scale analysis and modeling of face-to-face conversation , 2011, AVSP.

[18]  Irene Mittelberg Interne und externe Metonymie: Jakobsonsche Kontiguitätsbeziehungen in redebegleitenden gesten , 2010 .

[19]  Alan Cienki 11. Cognitive Linguistics: Spoken language and gesture as expressions of conceptualization , 2013 .

[20]  G. W. Hughes,et al.  Minimum Prediction Residual Principle Applied to Speech Recognition , 1975 .

[21]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[22]  Thomas Seidl,et al.  Spatiotemporal Similarity Search in 3D Motion Capture Gesture Streams , 2015, SSTD.

[23]  Thies Pfeiffer,et al.  Gesture Semantics Reconstruction Based on Motion Capturing and Complex Event Processing: a Circular Shape Example , 2013, SIGDIAL Conference.

[24]  Thomas Seidl,et al.  Signature Quadratic Form Distance , 2010, CIVR '10.

[25]  Jakub Lokoc,et al.  Ptolemaic access methods: Challenging the reign of the metric space model , 2013, Inf. Syst..

[26]  Lei Chen,et al.  Robust and fast similarity search for moving object trajectories , 2005, SIGMOD '05.

[27]  Ellen Fricke Phonaestheme, Kinaestheme und multimodale Grammatik: Wie Artikulationen zu Typen werden, die bedeuten können , 2010 .

[28]  Thomas Seidl,et al.  Indexing the signature quadratic form distance for efficient content-based multimedia retrieval , 2011, ICMR.

[29]  Francis K. H. Quek,et al.  Catchments, prosody and discourse , 2001 .

[30]  Alan Cienki,et al.  Image schemas and gesture , 2005 .