Acoustic and articulatory behaviors underlying emotion strength perception are studied by analyzing acted emotional speech. Listeners evaluated emotion identity, strength and confidence. Parameters related to pitch, loudness and articulatory kinematics are associated with a 2-level (strong/weak) representation of the emotion strength. Two-class discriminant analyses show averaged leave-one-out accuracies of 65.8% and 63.8% in the acoustic and articulatory domains, respectively. Two-factor ANOVA (emotion type/strength) indicates that the listeners assess the emotion strength based on the nature of perceived emotions in the arousal dimension. Only hot anger and happiness show significant differences in pitch use in the strength contrast. Such contrasts are also observed in tongue lowering and/or advancing. The strength contrast by listeners may mainly rely upon pitch and loudness. However, interactions between the acoustic and articulatory parameters in strength perception are complex.
[1]
Paul Boersma,et al.
Praat: doing phonetics by computer
,
2003
.
[2]
Carlos Busso,et al.
Analysis of Emotionally Salient Aspects of Fundamental Frequency for Emotion Detection
,
2009,
IEEE Transactions on Audio, Speech, and Language Processing.
[3]
Shrikanth S. Narayanan,et al.
An articulatory study of emotional speech production
,
2005,
INTERSPEECH.
[4]
Kim E. A. Silverman,et al.
Vocal cues to speaker affect: testing two models
,
1984
.
[5]
Paul Boersma,et al.
Praat, a system for doing phonetics by computer
,
2002
.
[6]
P. Laukka,et al.
A dimensional approach to vocal expression of emotion
,
2005
.