Beware of the individual: Evaluating prominence perception in spontaneous speech

Much of the existing research on prominence perception has focused on read speech in American English and German. The present paper presents two experiments that build on and extend insights from these studies in two ways. Firstly, we elicit prominence judgments on spontaneous speech. Secondly, we investigate gradient rather than binary prominence judgments by introducing a finger tapping task. We then provide a within-participant comparison of gradient prominence results with binary prominence judgments to evaluate their correspondence. Our results show that participants exhibit different success rates in tapping the prominence pattern of spontaneous data, but generally tapping results correlate well with binary prominence judgments within individuals. Random forest analyses of the acoustic parameters involved show that pitch accentuation and duration play important roles in both binary judgments and prominence tapping patterns. We can also confirm earlier findings from read speech that differences exist between participants in the relative importance rankings of various signal and systematic properties.

[1]  Barbara Samlowski,et al.  Exploiting the speech-gesture link to capture fine-grained prosodic prominence impressions and listening strategies , 2019, J. Phonetics.

[2]  José Ignacio Hualde,et al.  Sound, structure and meaning: The bases of prominence ratings in English, French and Spanish , 2019, J. Phonetics.

[3]  Bodo Winter,et al.  What makes a word prominent? Predicting untrained German listeners' perceptual judgments , 2018, J. Phonetics.

[4]  Henrik Niemann,et al.  Integrating the discreteness and continuity of intonational categories , 2017, J. Phonetics.

[5]  Youri Maryn,et al.  A Comparison of Cepstral Peak Prominence Measures From Two Acoustic Analysis Programs. , 2017, Journal of voice : official journal of the Voice Foundation.

[6]  Andreas Ziegler,et al.  ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R , 2015, 1508.04409.

[7]  Stefanie Shattuck-Hufnagel,et al.  New Methods for Prosodic Transcription: Capturing Variability as a Source of Information , 2016 .

[8]  Antje Schweitzer,et al.  Attention, please! Expanding the GECO database , 2015, ICPhS.

[9]  Lenka Weingartová,et al.  Short-term Spectral Slope Measures and their Sensitivity to Speaker, Vowel Identity and Prominence Ukazatele krátkodobého spektrálního sklonu a jejich citlivost na mluvčího, identitu vokálu a prominenci , 2014 .

[10]  Ailbhe Ní Chasaide,et al.  The voice prominence hypothesis: the interplay of F0 and voice source features in accentuation , 2013, INTERSPEECH.

[11]  Klaus J. Kohler,et al.  The Perception of Lexical Stress in German: Effects of Segmental Duration and Vowel Quality in Different Prosodic Patterns , 2012, Phonetica.

[12]  Mark Hasegawa-Johnson,et al.  Signal-based and expectation-based factors in the perception of prosodic prominence , 2010 .

[13]  J. Cole,et al.  Please Scroll down for Article Language and Cognitive Processes the Role of Syntactic Structure in Guiding Prosody Perception with Ordinary Listeners and Everyday Speech the Role of Syntactic Structure in Guiding Prosody Perception with Ordinary Listeners and Everyday Speech , 2022 .

[14]  Achim Zeileis,et al.  Conditional variable importance for random forests , 2008, BMC Bioinformatics.

[15]  Achim Zeileis,et al.  Bias in random forest variable importance measures: Illustrations, sources and a solution , 2007, BMC Bioinformatics.

[16]  P. Bühlmann,et al.  Survival ensembles. , 2006, Biostatistics.

[17]  G. M. Cambier Langeveld,et al.  Temporal Marking of Accents and Boundaries , 2000 .

[18]  J. Catlin,et al.  On the word-frequency effect. , 1969 .

[19]  D. Fry Experiments in the Perception of Stress , 1958 .

[20]  D. Fry Duration and Intensity as Physical Correlates of Linguistic Stress , 1954 .