Fast shot segmentation combining global and local visual descriptors

This paper introduces an algorithm for fast temporal segmentation of videos into shots. The proposed method detects abrupt and gradual transitions, based on the visual similarity of neighboring frames of the video. The descriptive efficiency of both local (SURF) and global (HSV histograms) descriptors is exploited for assessing frame similarity, while GPU-based processing is used for accelerating the analysis. Specifically, abrupt transitions are initially detected between successive video frames where there is a sharp change in the visual content, which is expressed by a very low similarity score. Then, the calculated scores are further analysed for the identification of frame-sequences where a progressive change of the visual content takes place and, in this way gradual transitions are detected. Finally, a post-processing step is performed aiming to identify outliers due to object/camera movement and flash-lights. The experiments show that the proposed algorithm achieves high accuracy while being capable of faster-than-real-time analysis.

[1]  Joni-Kristian Kämäräinen,et al.  Video Shot Boundary Detection using Visual Bag-of-Words , 2013, VISAPP.

[2]  Li Huan,et al.  A Method for Fast Shot Boundary Detection Based on SVM , 2008, 2008 Congress on Image and Signal Processing.

[3]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[4]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[5]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[6]  Wei Li,et al.  A Divide-And-Rule Scheme For Shot Boundary Detection Based on SIFT , 2010, J. Digit. Content Technol. its Appl..

[7]  Junaid Baber,et al.  Shot boundary detection from videos using entropy and local descriptor , 2011, 2011 17th International Conference on Digital Signal Processing (DSP).

[8]  Ying Liu,et al.  A method of shot detection based on color and edge features , 2009, 2009 1st IEEE Symposium on Web Society.

[9]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying production effects , 1999, Multimedia Systems.

[10]  Yiannis Kompatsiaris,et al.  Gradual transition detection using color coherence and other criteria in a video shot meta-segmentation framework , 2008, 2008 15th IEEE International Conference on Image Processing.

[11]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[12]  Liu Liu,et al.  A Novel Shot Segmentation Algorithm Based on Motion Edge Feature , 2010, 2010 Symposium on Photonics and Optoelectronics.

[13]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[14]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[15]  Nikolas P. Galatsanos,et al.  Simultaneous detection of abrupt cuts and dissolves in videos using support vector machines , 2009, Pattern Recognit. Lett..