Evaluating and combining digital video shot boundary detection algorithms

The development of standards for video encoding coupled with the increased power of computing mean that content-based manipulation of digital video information is now feasible. Shots are a basic structural building block of digital video and the boundaries between shots need to be determined automatically to allow for content-based manipulation. A shot can be thought of as continuous images from one camera at a time. In this paper we examine a variety of automatic techniques for shot boundary detection that we have implemented and evaluated on a baseline of 720,000 frames (8 hours) of broadcast television. This extends our previous work on evaluating a single technique based on comparing colour histograms. A description of each of our three methods currently working is given along with how they are evaluated. It is found that although the different methods have about the same order of magnitude in terms of effectiveness, different shot boundaries are detected by the different methods. We then look at combining the three shot boundary detection methods to produce one output result and the benefits in accuracy and performance that this brought to our system. Each of the methods were changed from using a static threshold value for three unconnected methods to one using three dynamic threshold values for one connected method. In a final summing up we look at the future directions for this work.

[1]  Ken Yap,et al.  FRANK: trialing a system for remote navigation of film archives , 1997, Other Conferences.

[2]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying scene breaks , 1995, MULTIMEDIA '95.

[3]  Arding Hsu,et al.  Feature management for large video databases , 1993, Electronic Imaging.

[4]  Alan F. Smeaton,et al.  The Fischlar Digital Video Recording, Analysis and Browsing System , 2000, RIAO.

[5]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, Electronic Imaging.

[6]  Alan F. Smeaton,et al.  Evaluation of automatic shot boundary detection on a large video test suite , 1999 .

[7]  Wei Xiong,et al.  Net comparison: a fast and effective method for classifying image sequences , 1995, Electronic Imaging.

[8]  Stephen W. Smoliar,et al.  Content-based video browsing tools , 1995, Electronic Imaging.

[9]  Alan F. Smeaton Independence of Contributing Retrieval Strategies in Data Fusion for Effective Information Retrieval , 1998, BCS-IRSG Annual Colloquium on IR Research.

[10]  Stéphane Marchand-Maillet,et al.  Towards a Standard Protocol for the Evaluation of Video-to-Shots Segmentation Algorithms , 1999 .

[11]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, J. Electronic Imaging.

[12]  Boon-Lock Yeo,et al.  A unified approach to temporal segmentation of motion JPEG and MPEG compressed video , 1995, Proceedings of the International Conference on Multimedia Computing and Systems.

[13]  John P. Oakley,et al.  Storage and Retrieval for Image and Video Databases , 1993 .

[14]  Alan F. Smeaton,et al.  Implementation and Analysis of Several Keyframe-Based Browsing Interfaces to Digital Video , 2000, ECDL.

[15]  Alan F. Smeaton,et al.  An evaluation of alternative techniques for automatic detection of shot boundaries in digital video , 1999 .

[16]  Patrick Bouthemy,et al.  Scene Segmentation and Image Feature Extraction for Video Indexing and Retrieval , 1999, VISUAL.