Interactive visualization of video content and associated description for semantic annotation

In this paper, we present an intuitive graphic framework introduced for the effective visualization of video content and associated audio-visual description, with the aim to facilitate a quick understanding and annotation of the semantic content of a video sequence. The basic idea consists in the visualization of a 2D feature space in which the shots of the considered video sequence are located. Moreover, the temporal position and the specific content of each shot can be displayed and analysed in more detail. The selected features are decided by the user, and can be updated during the navigation session. In the main window, shots of the considered video sequence are displayed in a Cartesian plane, and the proposed environment offers various functionalities for automatically and semi-automatically finding and annotating the shot clusters in such feature space. With this tool the user can therefore explore graphically how the basic segments of a video sequence are distributed in the feature space, and can recognize and annotate the significant clusters and their structure. The experimental results show that browsing and annotating documents with the aid of the proposed visualization paradigms is easy and quick, since the user has a fast and intuitive access to the audio-video content, even if he or she has not seen the document yet.

[1]  Noboru Babaguchi,et al.  Video Summarization for Large Sports Video Archives , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[2]  John R. Smith,et al.  VideoAnnEx: IBM MPEG-7 Annotation Tool for Multimedia Indexing and Concept Learning , 2003 .

[3]  Stefan Rüger,et al.  Info Navigator: A visualization tool for document searching and browsing , 2003 .

[4]  Riccardo Leonardi,et al.  The Future-Viewer visual environment for semantic characterization of video sequences , 2005, IEEE International Conference on Image Processing 2005.

[5]  J. E. Jackson A User's Guide to Principal Components , 1991 .

[6]  Tao Mei,et al.  Video Collage: A Novel Presentation of Video Sequence , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[7]  Robert R. Korfhage,et al.  Visualization of a Document Collection: The VIBE System , 1993, Inf. Process. Manag..

[8]  Qi Tian,et al.  Spatial visualization for content-based image retrieval , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[9]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[10]  Marco Ceccarelli,et al.  The color browser: a content driven linear video browsing tool , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[11]  Chia-Hung Yeh,et al.  Techniques for movie content analysis and skimming: tutorial and overview on video abstraction techniques , 2006, IEEE Signal Processing Magazine.

[12]  J. Edward Jackson,et al.  A User's Guide to Principal Components. , 1991 .

[13]  Julius T. Tou,et al.  Pattern Recognition Principles , 1974 .

[14]  Marcel Worring,et al.  Systematic evaluation of logical story unit segmentation , 2002, IEEE Trans. Multim..

[15]  Chong-Wah Ngo,et al.  Video summarization and scene detection by graph modeling , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Ebroul Izquierdo,et al.  An Object- and User-Driven System for Semantic-Based Image Annotation and Retrieval , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Riccardo Leonardi,et al.  An Intuitive Graphic Environment for Navigation and Classification of Multimedia Documents , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[18]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[19]  Marcel Worring,et al.  VideOlympics: Real-Time Evaluation of Multimedia Retrieval Systems , 2008, IEEE MultiMedia.

[20]  Riccardo Leonardi,et al.  Semantic Indexing of Multimedia Documents , 2002, IEEE Multim..

[21]  Zhu Liu,et al.  Multimedia content analysis-using both audio and visual clues , 2000, IEEE Signal Process. Mag..

[22]  Marcel Worring,et al.  The Mediamill Semantic Video Search Engine , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[23]  Robert R. Korfhage,et al.  Visualization of a Document Collection with Hypertext Links: the VIBE System , 1991 .

[24]  Werner Bailer,et al.  A Tool Supporting Annotation and Analysis of Videos , 2007 .

[25]  Wei-Ying Ma,et al.  Recent Advances and Challenges of Semantic Image/Video Search , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[26]  J. Edward Jackson,et al.  A User's Guide to Principal Components: Jackson/User's Guide to Principal Components , 2004 .

[27]  Thomas Sikora,et al.  HIERARCHICAL IMAGE BROWSING SYSTEM WITH EMBEDDED RELEVANCE FEEDBACK , 2003 .