Personalized video adaptation based on video content analysis

Personalized video adaptation is expected to satisfy individual users' needs on video content. Multimedia data mining plays a significant role of video annotation to meet users' preference on video content. In this paper, a comprehensive solution for personalized video adaptation is proposed based on video content mining. Video content mining targets both cognitive content and affective content. Cognitive content refers to those semantic events, which are very specific for the video domains. Sometimes, users might prefer "emotional decision" to select their interested video content. Therefore, we introduce affective content which causes audiences' strong reactions. For cognitive content mining, features are extracted from multiple modalities. Machine learning module is further performed to get some middle-level features, such as specific audio sounds, semantic video shots and so on. Those middle-level features are used to detect cognitive content by using Hidden Markov Models. For affective content mining, affective content is detected with three affective levels: "low", "medium" and "high". Considering affective levels might have no sharp boundaries, fuzzy c mean clustering is used on low-level features to simulate user's perceptions. The adaptation is later implemented based on MPEG-21 Digital Item Adaptation framework. One of the challenges is how to quantify users' preference on video content. Information Entropy (IE) and Membership Functions are calculated to decide priorities for resource allocation for cognitive content and affective content respectively.

[1]  Mahmoud Naghshineh,et al.  End-to-end QoS provisioning in multimedia wireless/mobile networks using an adaptive framework , 1997, IEEE Commun. Mag..

[2]  Greg M. Smith,et al.  Passionate views : film, cognition, and emotion , 1999 .

[3]  Christian Timmerer,et al.  Bitstream syntax description: a tool for multimedia resource adaptation within MPEG-21 , 2003, Signal Process. Image Commun..

[4]  Jaakko J. Sauvola,et al.  A Content Model for the Mobile Adaptation of Multimedia Information , 2001, J. VLSI Signal Process..

[5]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[6]  Shih-Fu Chang,et al.  Real-time content-based adaptive streaming of sports videos , 2001, Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL 2001).

[7]  Ahmed Karmouch,et al.  Policy-Driven Personalized Multimedia Services for Mobile Users , 2003, IEEE Trans. Mob. Comput..

[8]  Mohan S. Kankanhalli,et al.  Creating audio keywords for event detection in soccer video , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[9]  Douglas L. Jones,et al.  Design and evaluation of a cross-layer adaptation framework for mobile multimedia systems , 2003, IS&T/SPIE Electronic Imaging.

[10]  Jon Crowcroft,et al.  Quality-of-Service Routing for Supporting Multimedia Applications , 1996, IEEE J. Sel. Areas Commun..

[11]  Yui-Lam Chan,et al.  New architecture for dynamic frame-skipping transcoder , 2002, IEEE Trans. Image Process..

[12]  Steve Young,et al.  The HTK book , 1995 .

[13]  Domenico Cotroneo,et al.  A user-driven adaptation strategy for mobile video streaming applications , 2005, 25th IEEE International Conference on Distributed Computing Systems Workshops.

[14]  F. Ren,et al.  Semi-automatic emotion recognition from textual input based on the constructed emotion thesaurus , 2005, 2005 International Conference on Natural Language Processing and Knowledge Engineering.

[15]  Peter Parnes,et al.  Characterizing user access to videos on the World Wide Web , 1999, Electronic Imaging.

[16]  Wei-Po Lee,et al.  A user-centered remote control system for personalized multimedia channel selection , 2004, IEEE Transactions on Consumer Electronics.

[17]  Richard Han,et al.  Dynamic adaptation in an image transcoding proxy for mobile Web browsing , 1998, IEEE Wirel. Commun..

[18]  P. Venkat Rangan,et al.  Architectures for personalized multimedia , 1994, IEEE MultiMedia.

[19]  Michael S. Lew Next-Generation Web Searches for Visual Content , 2000, Computer.

[20]  Jiebo Luo,et al.  Large-scale multimodal semantic concept detection for consumer video , 2007, MIR '07.

[21]  Hang-Bong Kang,et al.  Affective content detection using HMMs , 2003, ACM Multimedia.

[22]  Yue-Kai Huang,et al.  Visual/Acoustic Emotion Recognition , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[23]  M. Hicks,et al.  User-specified adaptive scheduling in a streaming media network , 2003, 2003 IEEE Conference onOpen Architectures and Network Programming..

[24]  Jane Yung-jen Hsu,et al.  Toward semantic indexing and retrieval using hierarchical audio models , 2005, Multimedia Systems.

[25]  Zhihong Zeng,et al.  Audio-Visual Affect Recognition , 2007, IEEE Transactions on Multimedia.

[26]  Shrikanth S. Narayanan,et al.  Toward detecting emotions in spoken dialogs , 2005, IEEE Transactions on Speech and Audio Processing.

[27]  Noboru Babaguchi,et al.  Event based indexing of broadcasted sports video by intermodal collaboration , 2002, IEEE Trans. Multim..

[28]  Qi Tian,et al.  A unified framework for semantic shot classification in sports video , 2005, IEEE Trans. Multim..

[29]  Tao Li,et al.  Content-based music similarity search and emotion detection , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[30]  Ofer Hadar,et al.  Optimal transrating via DCT coefficients modification and dropping , 2005, ITRE 2005. 3rd International Conference on Information Technology: Research and Education, 2005..

[31]  Alan Hanjalic,et al.  Affective video content representation and modeling , 2005, IEEE Transactions on Multimedia.

[32]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[33]  Steven McCanne,et al.  An active service framework and its application to real-time multimedia transcoding , 1998, SIGCOMM '98.

[34]  Chabane Djeraba Content-based multimedia indexing and retrieval , 2002, IEEE MultiMedia.

[35]  Avideh Zakhor,et al.  Applications of Video-Content Analysis and Retrieval , 2002, IEEE Multim..