Clip based video summarization and ranking

In this paper, we present a new algorithm for video clip summarization and ranking, which is mainly based on a clip based video similarity measure and the affinity propagation clustering (AP) algorithm. We propose a proportional max-weighted bipartite matching algorithm for clip similarity measure. This method first generates a basic frame set and a corresponding proportion value set from each clip. Then it models two clips as a weighted bipartite graph, where the weight values are determined by both the direct frame similarities and the proportion values. Then the max-weighted bipartite matching is employed to measure the similarity between two clips. This method achieves good retrieval performance when the length of two clips varies greatly. With these clip similarities, clips are clustered using affinity propagation. The clips in one cluster generally describe the same video event. Video ranking is based on the cluster size and the average information entropy of each event. Experimental results are given to illustrate the proposed algorithm.

[1]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[2]  Wei Xiong,et al.  Query by video clip , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[3]  Sang Hyun Kim,et al.  An efficient algorithm for video sequence matching using the modified Hausdorff distance and the directed divergence , 2002, IEEE Trans. Circuits Syst. Video Technol..

[4]  Ishwar K. Sethi,et al.  Video clip recognition using joint audio-visual processing model , 2002, Object recognition supported by user interaction for service robots.

[5]  J. Ponce,et al.  Segmenting, modeling, and matching video clips containing multiple moving objects , 2004, CVPR 2004.

[6]  Seth Teller,et al.  Video matching , 2004, SIGGRAPH 2004.

[7]  Qi Tian,et al.  Fast and robust search method for short video clips from large video collection , 2004, ICPR 2004.

[8]  John M. Gauch,et al.  Identification of new commercials using repeated video sequence detection , 2005, IEEE International Conference on Image Processing 2005.

[9]  Alan Hanjalic,et al.  Adaptive extraction of highlights from a sport video based on excitement modeling , 2005, IEEE Transactions on Multimedia.

[10]  Chong-Wah Ngo,et al.  Video summarization and scene detection by graph modeling , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Mohan S. Kankanhalli,et al.  Automatic music video summarization based on audio-visual-text analysis and alignment , 2005, SIGIR '05.

[12]  Chong-Wah Ngo,et al.  Hot Event Detection and Summarization by Graph Modeling and Matching , 2005, CIVR.

[13]  Olivier Buisson,et al.  Content-based video copy detection in large databases: a local fingerprints statistical similarity search approach , 2005, IEEE International Conference on Image Processing 2005.

[14]  Tat-Seng Chua,et al.  Retrieval of News Video Using Video Sequence Matching , 2005, 11th International Multimedia Modelling Conference.

[15]  Aggelos K. Katsaggelos,et al.  Rate-distortion optimal video summary generation , 2005, IEEE Transactions on Image Processing.

[16]  Jung-Hwan Oh,et al.  Scenario based dynamic video abstractions using graph matching , 2005, MULTIMEDIA '05.

[17]  Silvio Jamil Ferzoli Guimarães,et al.  Counting of Video Clip Repetitions using a Modified BMH Algorithm: Preliminary Results , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[18]  Tiecheng Liu,et al.  Content-Adaptive Video Summarization Combining Queueing and Clustering , 2006, 2006 International Conference on Image Processing.

[19]  Chia-Wen Lin,et al.  Fast coarse-to-fine video retrieval using shot-level spatio-temporal statistics , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Yuxin Peng,et al.  Clip-based similarity measure for query-dependent clip retrieval and video summarization , 2006, IEEE Trans. Circuits Syst. Video Technol..

[21]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[22]  Cordelia Schmid,et al.  Segmenting, Modeling, and Matching Video Clips Containing Multiple Moving Objects , 2007, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Janko Calic,et al.  Efficient Layout of Comic-Like Video Summaries , 2007, IEEE Transactions on Circuits and Systems for Video Technology.