A practical method for video scene segmentation

Video segmentation is a crucial pass to content-based video summarization and retrieval. In this paper, we present a practical method to efficiently group video content into semantic segments. First we detect shots with double-threshold method to find raw shots quickly, followed by redundant frames removal though spatial color distribution to get the key frames. Finally, we cluster the key frames using the inter-shot correlation via domain color histogram and motion intensity to get the final scenes.

[1]  Peng Wang,et al.  Scene Segmentation and Categorization Using NCuts , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Bin Li,et al.  Scene segmentation based on video structure and spectral methods , 2008, 2008 10th International Conference on Control, Automation, Robotics and Vision.

[3]  Wallapak Tavanapong,et al.  Shot clustering techniques for story browsing , 2004, IEEE Transactions on Multimedia.

[4]  Zhu Liu,et al.  Integration of audio and visual information for content-based video segmentation , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[5]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.