Video personalization and summarization system for usage environment

A video personalization and summarization system is designed and implemented incorporating usage environment to dynamically generate a personalized video summary. The personalization system adopts the three-tier server-middleware-client architecture in order to select, adapt, and deliver rich media content to the user. The server stores the content sources along with their corresponding MPEG-7 metadata descriptions. Our semantic metadata is provided through the use of the VideoAnnEx MPEG-7 Video Annotation Tool. When the user initiates a request for content, the client communicates the MPEG-21 usage environment description along with the user query to the middleware. The middleware is powered by the personalization engine and the content adaptation engine. Our personalization engine includes the VideoSue Summarization on Usage Environment engine that selects the optimal set of desired contents according to user preferences. Afterwards, the adaptation engine performs the required transformations and compositions of the selected contents for the specific usage environment using our VideoEd Editing and Composition Tool. Finally, two personalization and summarization systems are demonstrated for the IBM Websphere Portal Server and for pervasive PDA devices.

[1]  John R. Smith,et al.  VideoAnnEx: IBM MPEG-7 Annotation Tool for Multimedia Indexing and Concept Learning , 2003 .

[2]  John R. Smith,et al.  Universal Tuner: A Video Streaming System for CPU/Power-Constrained Mobile Devices , 2001 .

[3]  John R. Smith,et al.  VideoAL: a novel end-to-end MPEG-7 video automatic labeling system , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[4]  Kiyoharu Aizawa,et al.  Summarizing wearable video , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[5]  John R. Smith,et al.  Video summarization and personalization for pervasive mobile devices , 2001, IS&T/SPIE Electronic Imaging.

[6]  Ching-Yung Lin,et al.  Personalized video summary using visual semantic annotations and automatic speech transcriptions , 2002, 2002 IEEE Workshop on Multimedia Signal Processing..

[7]  Dragutin Petkovic,et al.  Using audio time scale modification for video browsing , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[8]  John R. Smith,et al.  Hierarchical video summarization based on context clustering , 2003, SPIE ITCom.

[9]  HongJiang Zhang,et al.  A model of motion attention for video skimming , 2002, Proceedings. International Conference on Image Processing.

[10]  Xin Liu,et al.  Summarizing video by minimizing visual content redundancies , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[11]  John R. Smith,et al.  Universal MPEG content access using compressed-domain system stream editing techniques , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[12]  Bernard Mérialdo,et al.  Automatic construction of personalized TV news programs , 1999, MULTIMEDIA '99.

[13]  Boon-Lock Yeo,et al.  Video browsing using clustering and scene transitions on compressed sequences , 1995, Electronic Imaging.

[14]  John R. Smith,et al.  CPU/power-constrained mobile devices , 2001, MULTIMEDIA '01.

[15]  Shih-Fu Chang,et al.  Determining computable scenes in films and their structures using audio-visual memory models , 2000, ACM Multimedia.