Unsupervised Detection of Gradual Video Shot Changes with Motion-Based False Alarm Removal

The temporal segmentation of a video into shots is a fundamental prerequisite for video retrieval. There are two types of shot boundaries: abrupt shot changes (“cuts”) and gradual transitions. Several high-quality algorithms have been proposed for detecting cuts, but the successful detection of gradual transitions remains a surprisingly difficult problem in practice. In this paper, we present an unsupervised approach for detecting gradual transitions. It has several advantages. First, in contrast to alternative approaches, no training stage and hence no training data are required. Second, no thresholds are needed, since the used clustering approach separates classes of gradual transitions and non-transitions automatically and adaptively for each video. Third, it is a generic approach that does not employ a specialized detector for each transition type. Finally, the issue of removing false alarms caused by camera motion is addressed: in contrast to related approaches, it is not only based on low-level features, but on the results of an appropriate algorithm for camera motion estimation. Experimental results show that the proposed approach achieves very good performance on TRECVID shot boundary test data.

[1]  Rainer Lienhart,et al.  Comparison of automatic shot boundary detection algorithms , 1998, Electronic Imaging.

[2]  Jesús Bescós,et al.  Real-time shot change detection over online MPEG-2 video , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[4]  Tat-Seng Chua,et al.  Temporal Multi-Resolution Framework for Shot Boundary Detection and Keyframe Extraction , 2002, TREC.

[5]  Paul Over,et al.  Video shot boundary detection: Seven years of TRECVid activity , 2010, Comput. Vis. Image Underst..

[6]  Ba Tu Truong,et al.  New enhancements to cut, fade, and dissolve detection processes in video segmentation , 2000, ACM Multimedia.

[7]  David C. Gibbon,et al.  AT&T Research at TRECVID 2006 , 2006, TRECVID.

[8]  Christian Petersohn Wipe shot boundary determination , 2005, IS&T/SPIE Electronic Imaging.

[9]  Bernd Freisleben,et al.  Estimation of arbitrary camera motion in MPEG videos , 2004, ICPR 2004.

[10]  Mohan S. Kankanhalli,et al.  A GENERAL FRAMEWORK FOR VIDEO SEGMENTATION BASED ON TEMPORAL MULTI-RESOLUTION ANALYSIS , 2000 .

[11]  Tat-Seng Chua,et al.  An unified framework for shot boundary detection via active learning , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[12]  Dong Wang,et al.  THU and ICRC at TRECVID 2007 , 2007, TRECVID.

[13]  Ralph Ewerth,et al.  Robust video content analysis via transductive learning methods , 2009 .

[14]  Rainer Lienhart,et al.  Reliable Transition Detection in Videos: A Survey and Practitioner's Guide , 2001, Int. J. Image Graph..

[15]  Bo Zhang,et al.  A Formal Study of Shot Boundary Detection , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Alan Hanjalic,et al.  Shot-boundary detection: unraveled and resolved? , 2002, IEEE Trans. Circuits Syst. Video Technol..

[17]  Bo Zhang,et al.  A novel shot boundary detection framework , 2005, Visual Communications and Image Processing.