Analysing user physiological responses for affective video summarisation

Abstract Video summarisation techniques aim to abstract the most significant content from a video stream. This is typically achieved by processing low-level image, audio and text features which are still quite disparate from the high-level semantics that end users identify with (the ‘semantic gap’). Physiological responses are potentially rich indicators of memorable or emotionally engaging video content for a given user. Consequently, we investigate whether they may serve as a suitable basis for a video summarisation technique by analysing a range of user physiological response measures, specifically electro-dermal response (EDR), respiration amplitude (RA), respiration rate (RR), blood volume pulse (BVP) and heart rate (HR), in response to a range of video content in a variety of genres including horror, comedy, drama, sci-fi and action. We present an analysis framework for processing the user responses to specific sub-segments within a video stream based on percent rank value normalisation. The application of the analysis framework reveals that users respond significantly to the most entertaining video sub-segments in a range of content domains. Specifically, horror content seems to elicit significant EDR, RA, RR and BVP responses, and comedy content elicits comparatively lower levels of EDR, but does seem to elicit significant RA, RR, BVP and HR responses. Drama content seems to elicit less significant physiological responses in general, and both sci-fi and action content seem to elicit significant EDR responses. We discuss the implications this may have for future affective video summarisation approaches.

[1]  Alan Hanjalic,et al.  Adaptive extraction of highlights from a sport video based on excitement modeling , 2005, IEEE Transactions on Multimedia.

[2]  Stephen H. Fairclough,et al.  A research agenda for physiological computing , 2004, Interact. Comput..

[3]  J. Stainer,et al.  The Emotions , 1922, Nature.

[4]  F. C. BARTLETT Psychology of Behaviour , 1951, Nature.

[5]  Ioannis Pitas,et al.  Information theory-based shot cut/fade detection and video summarization , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Jonathan Klein,et al.  Frustrating the user on purpose: a step toward building an affective computer , 2002, Interact. Comput..

[8]  Nevenka Dimitrova Context and Memory in Multimedia Content Analysis , 2004, IEEE Multim..

[9]  R. Krauss,et al.  Facial and autonomic manifestations of the dimensional structure of emotion , 1984 .

[10]  Ling Guan,et al.  Semantic Retrieval of Multimedia by Concept Languages , 2006 .

[11]  Thomas S. Huang,et al.  Efficient access to video content in a unified framework , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[12]  Russell Beale,et al.  Affect and Emotion in Human-Computer Interaction, From Theory to Applications , 2008, Affect and Emotion in Human-Computer Interaction.

[13]  Hua Wang,et al.  Communicating emotions in online chat using physiological sensors and animated text , 2004, CHI EA '04.

[14]  Borko Furht,et al.  Encyclopedia of Multimedia , 2006 .

[15]  D. Damos Multiple-task performance , 2020 .

[16]  Stefan Koelsch,et al.  The Role of Harmonic Expectancy Violations in Musical Emotions: Evidence from Subjective, Physiological, and Neural Responses , 2006, Journal of Cognitive Neuroscience.

[17]  J. Gross,et al.  Emotion elicitation using films , 1995 .

[18]  N. Frijda,et al.  Emotions and respiratory patterns: review and critical analysis. , 1994, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[19]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[20]  Robert D. Ward,et al.  Physiological responses to different WEB page designs , 2003, Int. J. Hum. Comput. Stud..

[21]  Pierre Philippot,et al.  Respiratory feedback in the generation of emotion , 2002 .

[22]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[23]  P. Lang,et al.  Affective judgment and psychophysiological response: Dimensional covariation in the evaluation of pictorial stimuli. , 1989 .

[24]  Raj Acharya,et al.  Color Space Quantization for Color-Content-Based Query Systems , 2004, Multimedia Tools and Applications.

[25]  Patrick Gomez,et al.  Respiratory responses during affective picture viewing , 2004, Biological Psychology.

[26]  Mauro Barbieri,et al.  Video summarization: methods and landscape , 2003, SPIE ITCom.

[27]  Jennifer A. Healey,et al.  Wearable and automotive systems for affect recognition from physiology , 2000 .

[28]  Li-Qun Xu,et al.  User-oriented affective video content analysis , 2001, Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL 2001).

[29]  Jonas K. Olofsson,et al.  Affective picture processing: An integrative review of ERP findings , 2008, Biological Psychology.

[30]  Christine L. Lisetti,et al.  Emotion recognition from physiological signals using wireless sensors for presence technologies , 2004, Cognition, Technology & Work.

[31]  P M Monti,et al.  Anger arousal by a motion picture: a methodological note. , 1977, The American journal of psychiatry.

[32]  Kiyoharu Aizawa,et al.  Evaluation of video summarization for a large number of cameras in ubiquitous home , 2005, MULTIMEDIA '05.

[33]  Geographic Video Content , 2008, Encyclopedia of Multimedia.

[34]  G. Bower Affect and Cognition , 1983, A Configuration Approach to Mindset Agency Theory.

[35]  Chia-Hung Yeh,et al.  Techniques for movie content analysis and skimming: tutorial and overview on video abstraction techniques , 2006, IEEE Signal Processing Magazine.

[36]  A. Kramer,et al.  Physiological metrics of mental workload: A review of recent progress , 1990, Multiple-task performance.

[37]  K. Scherer,et al.  Psychophysiological responses to appraisal dimensions in a computer game , 2004 .

[38]  Annie Lang,et al.  The effects of production pacing and arousing content on the information processing of television messages , 1999 .

[39]  Jennifer Healey,et al.  Toward Machine Emotional Intelligence: Analysis of Affective Physiological State , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Malcolm Jones,et al.  The Confidence Man , 2002 .

[41]  Lisa Feldman Barrett,et al.  The Structure of Emotion , 2006 .

[42]  R. Simons,et al.  Image motion and context: a between- and within-subjects comparison. , 2000, Psychophysiology.

[43]  W. Snodgrass Physiology , 1897, Nature.

[44]  John Zimmerman,et al.  Interface Design for MyInfo: a Personal News Demonstrator Combining Web and TV Content , 2003, INTERACT.

[45]  Raimondo Schettini,et al.  Erratum to: An innovative algorithm for key frame extraction in video summarization , 2006, Journal of Real-Time Image Processing.

[46]  Roland Göcke,et al.  The Composite Sensing of Affect , 2008, Affect and Emotion in Human-Computer Interaction.

[47]  H. Lüders,et al.  Comments , 2002, Clinical Neurophysiology.

[48]  S. Steinhauer,et al.  Respiratory sinus arrhythmia as an index of emotional response in young adults. , 2004, Psychophysiology.

[49]  V. Ghini,et al.  An audio-video summarization scheme based on audio and video analysis , 2006, CCNC 2006. 2006 3rd IEEE Consumer Communications and Networking Conference, 2006..

[50]  John R. Smith,et al.  Hierarchical video summarization based on context clustering , 2003, SPIE ITCom.

[51]  Alan Hanjalic,et al.  Affective video content representation and modeling , 2005, IEEE Transactions on Multimedia.

[52]  Fatma Nasoz,et al.  Emotion Recognition from Physiological Signals for Presence Technologies , 2004 .

[53]  Changsheng Xu,et al.  Live sports event detection based on broadcast video and web-casting text , 2006, MM '06.

[54]  N. Birbaumer,et al.  The Structure of emotion : psychophysiological, cognitive, and clinical aspects , 1993 .

[55]  P. Gomez,et al.  Affective and physiological responses to environmental noises and music. , 2004, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[56]  P. Ekman,et al.  Autonomic nervous system activity distinguishes among emotions. , 1983, Science.

[57]  Fumiko Satoh,et al.  Learning personalized video highlights from detailed MPEG-7 metadata , 2002, Proceedings. International Conference on Image Processing.

[58]  Hiroshi Nittono,et al.  Level of interest in video clips modulates event-related potentials to auditory probes. , 2005, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[59]  R. Simons,et al.  Roll ‘em!: The effects of picture motion on emotional responses , 1998 .

[60]  R. Piferi,et al.  An alternative approach for achieving cardiovascular baseline: viewing an aquatic video. , 2000, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[61]  Noboru Babaguchi,et al.  Generation of personalized abstract of sports video , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[62]  O. Van den Bergh,et al.  Hyperventilation beyond fight/flight: respiratory responses during emotional imagery. , 2001, Psychophysiology.

[63]  Dirk Hagemann,et al.  The assessment of affective reactivity using films: Validity, reliability and sex differences , 1999 .

[64]  Kenneth E. Sawin,et al.  Tea for three: control of fission yeast polarity , 2005, Nature Cell Biology.

[65]  Harry W. Agius,et al.  Video summarisation: A conceptual framework and survey of the state of the art , 2008, J. Vis. Commun. Image Represent..

[66]  Alberto Del Bimbo,et al.  Retrieval of Commercials by Semantic Content: The Semiotic Perspective , 2004, Multimedia Tools and Applications.

[67]  Kiyoharu Aizawa,et al.  Efficient retrieval of life log based on context and content , 2004, CARPE'04.

[68]  Mohan S. Kankanhalli,et al.  A new approach to automatic music video summarization , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[69]  Noboru Babaguchi,et al.  Video Summarization for Large Sports Video Archives , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[70]  Regan L. Mandryk,et al.  Physiological indicators for the evaluation of co-located collaborative play , 2004, CSCW.