Hierarchical video summarization in reference subspace

In this paper, a hierarchical video structure summarization approach using Laplacian Eigenmap is proposed, where a small set of reference frames is selected from the video sequence to form a reference subspace to measure the dissimilarity between two arbitrary frames. In the proposed summarization scheme, the shot-level key frames are first detected from the continuity of inter-frame dissimilarity, and the sub-shot level and scene level representative frames are then summarized by using k-mean clustering. The experiment is carried on both test videos and movies, and the results show that in comparison with a similar approach using latent semantic analysis, the proposed approach using Laplacian Eigenmap can achieve a better recall rate in keyframe detection, and gives an efficient hierarchical summarization at sub shot, shot and scene levels subsequently.

[1]  Danny Crookes,et al.  Low-power systolic array processor architecture for FSBM video motion estimation , 2006 .

[2]  Chia-Hung Yeh,et al.  Techniques for movie content analysis and skimming: tutorial and overview on video abstraction techniques , 2006, IEEE Signal Processing Magazine.

[3]  Malcolm Slaney,et al.  Hierarchical segmentation using latent semantic indexing in scale space , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[4]  Danny Crookes,et al.  FPGA-based minutia matching for biometric fingerprint image database retrieval , 2008, Journal of Real-Time Image Processing.

[5]  Shingo Uchihashi,et al.  Video Manga: generating semantically meaningful video summaries , 1999, MULTIMEDIA '99.

[6]  Danny Crookes,et al.  Advances in Video Summarization and Skimming , 2009 .

[7]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[8]  Michael J. Black,et al.  Summarization of videotaped presentations: automatic analysis of motion and gesture , 1998, IEEE Trans. Circuits Syst. Video Technol..

[9]  Stephen W. Smoliar,et al.  An integrated system for content-based video retrieval and browsing , 1997, Pattern Recognit..

[10]  Yuxiao Hu,et al.  Face recognition using Laplacianfaces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[12]  A. Hanjalic,et al.  Extracting moods from pictures and sounds: towards truly personalized TV , 2006, IEEE Signal Processing Magazine.

[13]  Fabrice Souvannavong,et al.  Latent semantic indexing for semantic content detection of video shots , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[14]  Yueting Zhuang,et al.  Adaptive key frame extraction using unsupervised clustering , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[15]  Shih-Ping Liou,et al.  Automatic key-frame selection for content-based video indexing and access , 1999, Electronic Imaging.

[16]  Danny Crookes,et al.  Area-efficient high-speed 3D DWT processor architecture , 2007 .

[17]  Yukinobu Taniguchi,et al.  An intuitive and efficient access interface to real-time incoming video based on automatic indexing , 1995, MULTIMEDIA '95.

[18]  Xin Liu,et al.  Video summarization and retrieval using singular value decomposition , 2003, Multimedia Systems.

[19]  Danny Crookes,et al.  Approach to automatic video motion segmentation , 2007 .

[20]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying production effects , 1999, Multimedia Systems.

[21]  Dan Schonfeld,et al.  Statistical sequential analysis for real-time video scene change detection on compressed multimedia bitstream , 2003, IEEE Trans. Multim..

[22]  Andreas Girgensohn,et al.  Time-Constrained Keyframe Selection Technique , 2004, Multimedia Tools and Applications.

[23]  Michael Mills,et al.  A magnifier tool for video data , 1992, CHI.

[24]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[25]  Jeho Nam,et al.  Dynamic video summarization and visualization , 1999, MULTIMEDIA '99.

[26]  Alan Hanjalic,et al.  Shot-boundary detection: unraveled and resolved? , 2002, IEEE Trans. Circuits Syst. Video Technol..

[27]  Mikhail Belkin,et al.  Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering , 2001, NIPS.

[28]  Oliver Schreer,et al.  User Requirements for Multimedia Indexing and Retrieval of Unedited Audio-Visual Footage - RUSHES , 2008, 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services.

[29]  H. J. Kim,et al.  Kernel principal component analysis for texture classification , 2001, IEEE Signal Processing Letters.

[30]  Xin Liu,et al.  Video summarization using singular value decomposition , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[31]  Wen Gao,et al.  Unsupervised sports video scene clustering and its applications to story units detection , 2005, Visual Communications and Image Processing.