The effects of text-to-speech system quality on emotional states and frontal alpha band power

The tolerance limit for acceptable multimedia quality is changing as more and more high quality services approach the market. Thus, negative emotional reactions towards low quality services may cause user disappointment and are likely to increase churn rate. The current study analyzes how different levels of synthetic speech quality, obtained from different text-to-speech (TTS) systems, affect the emotional response of a user. This is achieved using two methods: subjective, by means of user reports; and neurophysiological by means of electroencephalography (EEG) analysis. More specifically, we analyzed the frontal alpha band power and correlated this with the subjective ratings based on the Self-Assessment Manikin scale. We found an increase in neuronal activity in the left frontal area with decreasing quality and argue that this is due to user disappointment with low quality TTS systems as they become harder to understand.

[1]  Philip A. Gable,et al.  The role of asymmetric frontal cortical activity in emotion-related phenomena: A review and update , 2010, Biological Psychology.

[2]  A. Gabrielsson Emotions in strong experiences with music. , 2001 .

[3]  Simon King,et al.  The Blizzard Challenge 2009 , 2009 .

[4]  Sebastian Möller,et al.  Too tired for calling? A physiological measure of fatigue caused by bandwidth limitations , 2012, 2012 Fourth International Workshop on Quality of Multimedia Experience.

[5]  P. Wilson,et al.  The Nature of Emotions , 2012 .

[6]  Daphne N. Yu,et al.  High-resolution EEG mapping of cortical activation related to working memory: effects of task difficulty, type of processing, and practice. , 1997, Cerebral cortex.

[7]  H. Jasper,et al.  The ten-twenty electrode system of the International Federation. The International Federation of Clinical Neurophysiology. , 1999, Electroencephalography and clinical neurophysiology. Supplement.

[8]  E. Harmon-Jones,et al.  State anger and prefrontal brain activity: evidence that insult-related relative left-prefrontal activation is associated with experienced anger and aggression. , 2001, Journal of personality and social psychology.

[9]  Sebastian Möller,et al.  Perception of low-quality videos analyzed by means of electroencephalography , 2012, 2012 Fourth International Workshop on Quality of Multimedia Experience.

[10]  M. Bradley,et al.  Measuring emotion: the Self-Assessment Manikin and the Semantic Differential. , 1994, Journal of behavior therapy and experimental psychiatry.

[11]  John J. B. Allen,et al.  Frontal EEG asymmetry as a moderator and mediator of emotion , 2004, Biological Psychology.

[12]  L. Trainor,et al.  Frontal brain electrical activity (EEG) distinguishes valence and intensity of musical emotions , 2001 .

[13]  Arnaud Delorme,et al.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis , 2004, Journal of Neuroscience Methods.

[14]  Thierry Pun,et al.  DEAP: A Database for Emotion Analysis ;Using Physiological Signals , 2012, IEEE Transactions on Affective Computing.

[15]  E. Harmon-Jones On the relationship of frontal brain activity and anger: Examining the role of attitude toward anger , 2004 .

[16]  Simon Haykin,et al.  Selected topics in signal processing , 1989 .