Video shot segmentation using fusion of SVD and mutual information features

A new method for detecting shot boundaries in video sequences by fusing features obtained by singular value decomposition (SVD) and mutual information (MI) is proposed. The first method relies on performing singular value decomposition on a matrix created from 3D color histograms of single frames. The method can detect cuts and gradual transitions, such as dissolves, fades and wipes. The second method relies on evaluating mutual information between two consecutive frames. It can detect abrupt cuts, fade-ins and fade-outs with very high accuracy. A combination of features derived from these methods and subsequent processing through a clustering procedure results in very efficient detection of abrupt cuts and gradual transitions, as demonstrated by experiments on the TRECVID2004 video test set containing different types of shots with significant object and camera motion inside the shots.

[1]  Thomas D. C. Little,et al.  A Survey of Technologies for Parsing and Indexing Digital Video1 , 1996, J. Vis. Commun. Image Represent..

[2]  Ioannis Pitas,et al.  Video shot segmentation using singular value decomposition , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[3]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[4]  Alan Hanjalic,et al.  Shot-boundary detection: unraveled and resolved? , 2002, IEEE Trans. Circuits Syst. Video Technol..

[5]  Ioannis Pitas,et al.  Content-based video parsing and indexing based on audio-visual interaction , 2001, IEEE Trans. Circuits Syst. Video Technol..

[6]  C. Metz Basic principles of ROC analysis. , 1978, Seminars in nuclear medicine.

[7]  Ioannis Pitas,et al.  Shot detection in video sequences using entropy based metrics , 2002, Proceedings. International Conference on Image Processing.

[8]  Ze-Nian Li,et al.  Video dissolve and wipe detection via spatio-temporal images of chromatic histogram differences , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[9]  Zhu Liu,et al.  Multimedia content analysis-using both audio and visual clues , 2000, IEEE Signal Process. Mag..

[10]  Paul England,et al.  Comparison of automatic video segmentation algorithms , 1996, Other Conferences.

[11]  Rainer Lienhart,et al.  Reliable dissolve detection , 2001, IS&T/SPIE Electronic Imaging.

[12]  Kanti V. Mardia,et al.  Statistics of Directional Data , 1972 .

[13]  Alberto Del Bimbo,et al.  Visual information retrieval , 1999 .

[14]  Rainer Lienhart,et al.  Comparison of automatic shot boundary detection algorithms , 1998, Electronic Imaging.

[15]  Chung-Lin Huang,et al.  A robust scene-change detection method for video segmentation , 2001, IEEE Trans. Circuits Syst. Video Technol..