THU-ICRC at rush summarization of TRECVID 2007

In this paper, we describe the THU-ICRC system for the rush summarization task of TRECVID07. Our main objective is to abstract a minimal length rush video by removing useless (or low-quality) and redundant frames and reserving important objects and events by video parsing, cast indexing and important factor analysis. In detail, by video parsing and cast indexing, our approach first constructs story boards to let user know about the main scenes and main actors in the video. Then it detects and removes useless frames, e.g. color bar, near-monochrome/ abrupt/shaking frames, and clap boards etc. Finally, we construct the video skimming by key frame clustering, important factor analysis and repetitive segment detection. Particularly, by the two-stage redundancy removing in both key frame level and video sequence level, we achieve a better performance to shorten the video length. Extensive experiments were carried out on 42 testing videos. Good results demonstrate the effectiveness of the proposed method.

[1]  Si Wu,et al.  Video quality classification based home video segmentation , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[2]  Alan L. Yuille,et al.  Detecting and reading text in natural scenes , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[3]  Shingo Uchihashi,et al.  Video Manga: generating semantically meaningful video summaries , 1999, MULTIMEDIA '99.

[4]  Mubarak Shah,et al.  Detection and representation of scenes in videos , 2005, IEEE Transactions on Multimedia.

[5]  Jun Wu,et al.  Tsinghua University at TRECVID 2004: Shot Boundary Detection and High-Level Feature Extraction , 2004, TRECVID.

[6]  Tao Wang,et al.  Cast indexing for videos by NCuts and page ranking , 2007, CIVR '07.

[7]  Mubarak Shah,et al.  Scene detection in Hollywood movies and TV shows , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[8]  Peng Wang,et al.  Scene Segmentation and Categorization Using NCuts , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Xin Liu,et al.  Video summarization with minimal visual content redundancies , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[10]  Paul Over,et al.  The trecvid 2007 BBC rushes summarization evaluation pilot , 2007, TVS '07.

[11]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[12]  Yuan Li,et al.  Robust Head Tracking with Particles Based on Multiple Cues Fusion , 2006, ECCV Workshop on HCI.

[13]  Aggelos K. Katsaggelos,et al.  Rate-distortion optimal video summary generation , 2005, IEEE Transactions on Image Processing.

[14]  Masaharu Ogawa,et al.  A highlight scene detection and video summarization system using audio feature for a personal video recorder , 2005, IEEE Transactions on Consumer Electronics.