Content-based video retrieval using the shot cluster tree

Content-based video retrieval is a proper solution to handle the video data. But because of their huge volumes and high dimensionality, finding a proper way to organize them for efficient search and retrieval becomes a challenging and important problem. In this paper, we present a framework for content-based video retrieval using the Shot Cluster Tree, while the latter organizes the content of shots in the tree structure. The framework can supply users with two types of queries: query-by-content and query-by-example. We also put up a novel and efficient clustering method for generating the Shot Cluster Tree structure, where shots which are visual similar and time adjacent are grouped into shot groups using the method of Sliding Shot Window firstly and then shot groups are clustered into shot clusters with the simple agglomerative hierarchical clustering method. In addition to constructing the structure of content, the Shot Cluster Tree also provides better ways to video summary and video annotation, which facilitate the video retrieval. An experimental system has been built up. Experiments verify the effectiveness of the proposed approach.

[1]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, Electronic Imaging.

[2]  Jordi Vitrià,et al.  Linking Visual Cues and Semantic Terms Under Specific Digital Video Domains , 2000, J. Vis. Lang. Comput..

[3]  Paul England,et al.  Comparison of automatic video segmentation algorithms , 1996, Other Conferences.

[4]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[5]  Shih-Fu Chang,et al.  Clustering methods for video browsing and annotation , 1996, Electronic Imaging.

[6]  Milan Petkovic,et al.  Content-Based Video Retrieval , 2004, The Springer International Series in Engineering and Computer Science.

[7]  Thomas S. Huang,et al.  Exploring video structure beyond the shots , 1998, Proceedings. IEEE International Conference on Multimedia Computing and Systems (Cat. No.98TB100241).

[8]  Ramesh C. Jain,et al.  Similarity indexing: algorithms and performance , 1996, Electronic Imaging.

[9]  Boon-Lock Yeo,et al.  Extracting story units from long programs for video browsing and navigation , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[10]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.