An efficient graph theoretic approach to video scene clustering

In this paper, a graph theoretic approach is presented for video scene clustering in sports and news videos. In the approach, video shots are grouped into clusters of similar scenes based on shot color attributes. First, the similarity between video shots is measured by shot color histogram intersection. To obtain scene likeness matrix in a maximum-a-posterior (MAP) probability manner, a thresholding method is proposed on shot similarity matrix. Then, a graph is constructed based on the scene likeness that similar shots have an edge between them. Based on the constructed graph, a graph partitioning method is proposed to cluster video shots into different scenes such that the connectivity of video shots within one cluster is higher than that between different clusters. The advantage of the graph partitioning method is that the cluster number need not to be known as a prior. The graph partitioning method is compared with conventional k-means clustering method; in which the cluster number is determined by the cluster validity measure. The main contributions of the paper lie in the formulation of video scene clustering to a graph partitioning problem, and the comparison with the conventional k-means clustering method. Experimental results are presented to show the effectiveness and efficiency of the proposed graph theoretic approach.

[1]  Dragutin Petkovic,et al.  Content-based representation and retrieval of visual media: A state-of-the-art review , 1996, Multimedia Tools and Applications.

[2]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[3]  Daniel B. Szyld,et al.  A Block Ordering Method for Sparse Matrices , 1990, SIAM J. Sci. Comput..

[4]  Boon-Lock Yeo,et al.  Extracting story units from long programs for video browsing and navigation , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[5]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[6]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[7]  Kohji Fukunaga,et al.  Introduction to Statistical Pattern Recognition-Second Edition , 1990 .

[8]  Alan Hanjalic,et al.  An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis , 1999, IEEE Trans. Circuits Syst. Video Technol..

[9]  A. Murat Tekalp,et al.  Robust color histogram descriptors for video segment retrieval and identification , 2002, IEEE Trans. Image Process..

[10]  Shih-Fu Chang,et al.  Algorithms and system for segmentation and structure analysis in soccer video , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[11]  Pedro Larrañaga,et al.  An empirical comparison of four initialization methods for the K-Means algorithm , 1999, Pattern Recognit. Lett..

[12]  Sanjeev R. Kulkarni,et al.  Rapid estimation of camera motion from compressed video with application to video annotation , 2000, IEEE Trans. Circuits Syst. Video Technol..

[13]  C.-C. Jay Kuo,et al.  Color distribution analysis and quantization for image retrieval , 1996, Electronic Imaging.

[14]  Yihong Gong,et al.  Automatic parsing of news video , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[15]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.