Affective Visualization and Retrieval for Music Video

In modern times, music video (MV) has become an important favorite pastime to people because of its conciseness, convenience, and the ability to bring both audio and visual experiences to audiences. As the amount of MVs is explosively increasing, it has become an important task to develop new techniques for effective MV analysis, retrieval, and management. By stimulating the human affective response mechanism, affective video content analysis extracts the affective information contained in videos, and, with the affective information, natural, user-friendly, and effective MV access strategies could be developed. In this paper, a novel integrated system (i.MV) is proposed for personalized MV affective analysis, visualization, and retrieval. In i.MV, we not only perform the personalized MV affective analysis, which is a challenging and insufficiently covered problem in current affective content analysis field, but also propose novel affective visualization to convert the abstract affective states intuitive and friendly to users. Based on the affective analysis and visualization, affective information based MV retrieval is achieved. Both comprehensive experiments and subjective user studies on a large MV dataset demonstrate that our personalized affective analysis is more effective than the previous algorithms. In addition, affective visualization is proved to be more suitable for affective information-based MV retrieval than the commonly used affective state representation strategies.

[1]  Homer H. Chen,et al.  Music emotion recognition: the role of individuality , 2007, HCM '07.

[2]  Hang-Bong Kang,et al.  Affective content detection using HMMs , 2003, ACM Multimedia.

[3]  Alan Hanjalic,et al.  Adaptive extraction of highlights from a sport video based on excitement modeling , 2005, IEEE Transactions on Multimedia.

[4]  Mohan S. Kankanhalli,et al.  Content-based music structure analysis with applications to music semantics understanding , 2004, MULTIMEDIA '04.

[5]  Shiliang Zhang,et al.  i.MTV: an integrated system for mtv affective analysis , 2008, ACM Multimedia.

[6]  Hang-Bong Kang Emotional event detection using relevance feedback , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[7]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[8]  Hang-Bong Kang,et al.  Analysis of scene context related with emotional events , 2002, MULTIMEDIA '02.

[9]  Dinh Phung,et al.  “You Tube and I Find”—Personalizing Multimedia Content Access , 2008, Proceedings of the IEEE.

[10]  Brian D. Davison,et al.  Learning to personalize , 2000, CACM.

[11]  Alan F. Smeaton,et al.  Improving the Quality of the Personalized Electronic Program Guide , 2004, User Modeling and User-Adapted Interaction.

[12]  Suhuai Luo,et al.  Video Adaptation based on Affective Content with MPEG-21 DIA Framework , 2007, 2007 IEEE Symposium on Computational Intelligence in Image and Signal Processing.

[13]  Loong Fah Cheong,et al.  Affective understanding in film , 2006, IEEE Trans. Circuits Syst. Video Technol..

[14]  A. Mehrabian Pleasure-arousal-dominance: A general framework for describing and measuring individual differences in Temperament , 1996 .

[15]  Ying Li,et al.  MAGICAL demonstration: system for automated metadata generation for instructional content , 2006, MM '06.

[16]  H. Schlosberg Three dimensions of emotion. , 1954, Psychological review.

[17]  John Zimmerman,et al.  Framework for personalized multimedia summarization , 2005, MIR '05.

[18]  Svetha Venkatesh,et al.  An Embedded Suggestive Interface for Making Home Videos , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[19]  Tao Mei,et al.  Video collage , 2007, ACM Multimedia.

[20]  Gareth J. F. Jones,et al.  Affect-based indexing and retrieval of films , 2005, MULTIMEDIA '05.

[21]  Rittwik Jana,et al.  IMS-TV: An IMS-based architecture for interactive, personalized IPTV , 2008, IEEE Communications Magazine.

[22]  Marcel Worring,et al.  Optimization of interactive visual-similarity-based search , 2008, TOMCCAP.

[23]  J. Russell,et al.  Evidence for a three-factor theory of emotions , 1977 .

[24]  Svetha Venkatesh,et al.  Affect computing in film through sound energy dynamics , 2001, MULTIMEDIA '01.

[25]  Alan Hanjalic,et al.  Affective video content representation and modeling , 2005, IEEE Transactions on Multimedia.

[26]  A. Hanjalic,et al.  Extracting moods from pictures and sounds: towards truly personalized TV , 2006, IEEE Signal Processing Magazine.

[27]  P. Lang The network model of emotion: motivational connections , 1993 .

[28]  Ling-Yu Duan,et al.  Hierarchical movie affective content analysis based on arousal and valence features , 2008, ACM Multimedia.

[29]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[30]  Nicu Sebe,et al.  Personalized multimedia retrieval: the new trend? , 2007, MIR '07.

[31]  C.-C. Jay Kuo,et al.  Audio content analysis for online audiovisual data segmentation and classification , 2001, IEEE Trans. Speech Audio Process..

[32]  Lie Lu,et al.  Automatic mood detection and tracking of music audio signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[33]  John Zimmerman,et al.  Video scouting demonstration: smart content selection and recording , 2000, MM 2000.

[34]  Peter Y. K. Cheung,et al.  A computation method for video segmentation utilizing the pleasure-arousal-dominance emotional information , 2007, ACM Multimedia.

[35]  Geoffrey I. Webb,et al.  # 2001 Kluwer Academic Publishers. Printed in the Netherlands. Machine Learning for User Modeling , 1999 .

[36]  Shiliang Zhang,et al.  Affective MTV analysis based on arousal and valence features , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[37]  Kristina Höök,et al.  The sensual evaluation instrument: developing an affective evaluation tool , 2006, CHI.

[38]  Min Xu,et al.  Affective content analysis in comedy and horror videos by audio emotional event detection , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[39]  Katia P. Sycara,et al.  WebMate: a personal agent for browsing and searching , 1998, AGENTS '98.

[40]  Carlo Strapparava,et al.  User Modelling for News Web Sites with Word Sense Based Techniques , 2004, User Modeling and User-Adapted Interaction.

[41]  Peter Y. K. Cheung,et al.  User Attention Based Arousal Content Modeling , 2006, 2006 International Conference on Image Processing.

[42]  H. Zettl Sight, Sound, Motion: Applied Media Aesthetics , 1973 .