Beat It! Gesture-based Prominence Annotation as a Window to Individual Prosody Processing Strategies

In recent work [1], we have suggested a novel approach for fine-grained and fast prominence annotation by naı̈ve listeners. Our approach relies on annotators’ “drummed” replications of a perceived utterance, modulating their drumming velocity in accordance with the perceptual prominence of consecutive linguistic units (syllables, words). The drumming velocity is then used as a fine-grained operationalization of prosodic prominence. This intuitive method exploits the established link between prominence and speech-accompanying gesture [2, 3]. Due to its speed and ease, it allows for the rapid annotation of large amounts of data and yields results that are comparable to fine-grained expert annotations of prominence. In the present study, we evaluated our method further by (1) comparing the intra-sentential prosodic variation as measured with traditional annotations and the drumming method. Our results show that “drummed” prominences capture speaking-style related variability similarly to conventional annotation methods. Additionally (2), we examined whether individual listener strategies can be identified with the help of Random Forests. This method allows for estimating the individual impact of established prominence correlates on prominence impressions. Our analyses unveil individual listener strategies for blending and integrating top-down, bottom-up and context cues into impressions of prosodic prominence.

[1]  Petra Wagner Great expectations - introspective vs. perceptual prominence ratings and their acoustic correlates , 2005, INTERSPEECH.

[2]  Anders Eriksson,et al.  Syllable prominence: a matter of vocal effort, phonetic distinct-ness and top-down processing , 2001, INTERSPEECH.

[3]  Petra Wagner,et al.  PromDrum - Exploiting the prosody-gesture link for intuitive, fast and fine-grained prominence annotation , 2016 .

[4]  Petra Wagner,et al.  Different parts of the same elephant: A roadmap to disentangle and connect different perspectives on prosodic prominence , 2015, ICPhS.

[5]  Maria Wolters,et al.  Prediction of word prominence , 1997, EUROSPEECH.

[6]  Mark Hasegawa-Johnson,et al.  Signal-based and expectation-based factors in the perception of prosodic prominence , 2010 .

[7]  Stefan Baumann,et al.  The perceptual prominence of pitch accent types in German , 2015, ICPhS.

[8]  Carlos Gussenhoven,et al.  Fundamental frequency declination in Dutch: testing three hy-potheses , 1988 .

[9]  Petra Wagner,et al.  Using generalized additive models and random forests to model prosodic prominence in German , 2013, INTERSPEECH.

[10]  K. D. de Jong The supraglottal articulation of prominence in English: linguistic stress as localized hyperarticulation. , 1995, The Journal of the Acoustical Society of America.

[11]  Martti Vainio,et al.  Tonal features, intensity, and word order in the perception of prominence , 2006, J. Phonetics.

[12]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[13]  Dani Byrd,et al.  Spatiotemporal coupling between speech and manual motor actions , 2014, J. Phonetics.