Video summarization and personalization for pervasive mobile devices

We have designed and implemented a video semantic summarization system, which includes an MPEG-7 compliant annotation interface, a semantic summarization middleware, a real-time MPEG-1/2 video transcoder on PCs, and an application interface on color/black-and-white Palm-OS PDAs. We designed a video annotation tool, VideoAnn, to annotate semantic labels associated with video shots. Videos are first segmentated into shots based on their visual-audio characteristics. They are played back using an interactive interface, which facilitate and fasten the annotation process. Users can annotate the video content with the units of temporal shots or spatial regions. The annotated results are stored in the MPEG-7 XML format. We also designed and implemented a video transmission system, Universal Tuner, for wireless video streaming. This system transcodes MPEG-1/2 videos or live TV broadcasting videos to the BW or indexed color Palm OS devices. In our system, the complexity of multimedia compression and decompression algorithms is adaptively partitioned between the encoder and decoder. In the client end, users can access the summarized video based on their preferences, time, keywords, as well as the transmission bandwidth and the remaining battery power on the pervasive devices.

[1]  John R. Smith,et al.  Learning to annotate video databases , 2001, IS&T/SPIE Electronic Imaging.

[2]  John R. Smith,et al.  Universal Tuner: A Video Streaming System for CPU/Power-Constrained Mobile Devices , 2001 .

[3]  Kiyoharu Aizawa,et al.  Summarizing wearable video , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[4]  Ying Li,et al.  Semantic video content abstraction based on multiple cues , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[5]  Xin Liu,et al.  Generating optimal video summaries , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[6]  Tomio Echigo,et al.  Meta-data framework for constructing individualized video digest , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[7]  Dragutin Petkovic,et al.  Using audio time scale modification for video browsing , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[8]  Bernard Mérialdo,et al.  Automatic construction of personalized TV news programs , 1999, MULTIMEDIA '99.

[9]  Robin Kravets,et al.  Application‐driven power management for mobile communication , 2000, Wirel. Networks.

[10]  Xin Liu,et al.  Summarizing video by minimizing visual content redundancies , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[11]  Gary J. Sullivan,et al.  USING THE DRAFT H.26L VIDEO CODING STANDARD FOR MOBILE APPLICATIONS , 2001 .

[12]  Kuniaki Uehara,et al.  Video Summarization Based on Semantic Representation , 1998, AMCP.

[13]  Noboru Babaguchi,et al.  Generation of personalized abstract of sports video , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[14]  Boon-Lock Yeo,et al.  Video browsing using clustering and scene transitions on compressed sequences , 1995, Electronic Imaging.

[15]  Jianying Hu,et al.  Combined-media video tracking for summarization , 2001, MULTIMEDIA '01.