A Graph Theoretic Approach for Scene Detection in Produced Videos

Efficient video browsing requires indexing of videos so that the users can quickly locate the segments of their interests. While browsing a video, the users often prefer to skim through the video by scenes rather than frames or shots. In general, a scene can be defined by the continuity in the visual contents of shots due to fixed physical setting or by the continuity of the ongoing actions. We exploit this fact and propose a novel approach for clustering shots into scenes by transforming this task into a graph partitioning problem. This is achieved by constructing a weighted undirected graph called a shot similarity graph, SSG, where each node represents a shot and the edges between the shots are weighted by their similarities. Both color and motion information are utilized to compute shot similarities. The SSG is then split into smaller story units by applying the normalized-cut technique for graph partitioning. The proposed approach is robust and produces meaningful temporal segmentation of video, which is useful for applications such as “video on demand”. Experiments are presented with promising results on two Hollywood movies.

[1]  Mubarak Shah,et al.  Scene detection in Hollywood movies and TV shows , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[2]  Alan Hanjalic,et al.  Automated high-level movie segmentation for advanced video-retrieval systems , 1999, IEEE Trans. Circuits Syst. Video Technol..

[3]  Wallapak Tavanapong,et al.  ShotWeave: A Shot Clustering Technique for Story Browsing for Large Video Databases , 2002, EDBT Workshops.

[4]  Boon-Lock Yeo,et al.  Segmentation of Video by Clustering and Graph Analysis , 1998, Comput. Vis. Image Underst..

[5]  D. Arijon,et al.  Grammar of Film Language , 1976 .

[6]  Thomas S. Huang,et al.  Constructing table-of-content for videos , 1999, Multimedia Systems.

[7]  Richard M. Leahy,et al.  An Optimal Graph Theoretic Approach to Data Clustering: Theory and Its Application to Image Segmentation , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Sudeep Sarkar,et al.  Supervised Learning of Large Perceptual Organization: Graph Spectral Partitioning and Learning Automata , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Niels Haering,et al.  A framework for the design of event detectors , 1999 .

[10]  Svetha Venkatesh,et al.  Towards automatic extraction of expressive elements from motion pictures: tempo , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[11]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.