Reducing False Positives in Video Shot Detection Using Learning Techniques

Video has become an interactive medium of daily use today. However, the sheer volume of the data makes it extremely difficult to browse and find required information. Organizing the video and locating required information effectively and efficiently presents a great challenge to the video retrieval community. This demands a tool which would break down the video into smaller and manageable units called shots. Traditional shot detection methods use pixel difference, histograms, or temporal slice analysis to detect hard-cuts and gradual transitions. However, systems need to be robust to sequences that contain dramatic illumination changes, shaky camera effects, and special effects such as fire, explosion, and synthetic screen split manipulations. Traditional systems produce false positives for these cases; i.e., they claim a shot break when there is none. We propose a shot detection system which reduces false positives even if all the above effects are cumulatively present in one sequence. Similarities between successive frames are computed by finding the correlation and is further analyzed using a wavelet transformation. A final filtering step is to use a trained Support Vector Machine (SVM). As a result, we achieve better accuracy (while retaining speed) in detecting shot-breaks when compared with other techniques.

[1]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying scene breaks , 1995, MULTIMEDIA '95.

[2]  Hanqing Lu,et al.  Avoiding false alarms due to illumination variation in shot detection , 2000, 2000 IEEE Workshop on SiGNAL PROCESSING SYSTEMS. SiPS 2000. Design and Implementation (Cat. No.00TH8528).

[3]  Behzad Shahraray,et al.  Scene change detection and content-based sampling of video sequences , 1995, Electronic Imaging.

[4]  Brian V. Funt,et al.  Color Constant Color Indexing , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Shih-Fu Chang,et al.  A Framework for Sub-Window Shot Detection , 2005, 11th International Multimedia Modelling Conference.

[6]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[7]  Nilesh V. Patel,et al.  Video shot detection and characterization for video databases , 1997, Pattern Recognit..

[8]  Yap-Peng Tan,et al.  An effective post-refinement method for shot boundary detection , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[9]  Mubarak Shah,et al.  Scene detection in Hollywood movies and TV shows , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[10]  Christian Petersohn Fraunhofer HHI at TRECVID 2004: Shot Boundary Detection System , 2004, TRECVID.

[11]  T. Vlachos Cut detection in video sequences using phase correlation , 2000, IEEE Signal Processing Letters.

[12]  Dong-Sik Jang,et al.  Gradual shot boundary detection using localized edge blocks , 2006, Multimedia Tools and Applications.

[13]  Chengcui Zhang,et al.  PixSO: a system for video shot detection , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[14]  Ramesh C. Jain,et al.  Knowledge-guided parsing in video databases , 1993, Electronic Imaging.

[15]  André Zaccarin,et al.  A system for reliable dissolve detection in videos , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[16]  Subutai Ahmad,et al.  Analysis-by-synthesis dissolve detection , 2002, Proceedings. International Conference on Image Processing.

[17]  Chong-Wah Ngo,et al.  Detection of gradual transitions through temporal slice analysis , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[18]  Xu De,et al.  A solution to illumination variation problem in shot detection , 2004, 2004 IEEE Region 10 Conference TENCON 2004..

[19]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[20]  John R. Smith,et al.  IBM Research TRECVID-2009 Video Retrieval System , 2009, TRECVID.