Enhanced shot boundary detection using video text information

Shot boundary detection is a pre-requisite process for content-based video indexing and retrieval, an application domain that has attracted much research and consumer attentions due to the explosive growth of video data available and accessible to users worldwide. Currently, a number of edge-based techniques have been proposed in the literature for detecting abrupt shot boundaries to avoid the influence of flashlights common in many video types, such as sports, news, entertainment, and interviews videos. However, these techniques are susceptible to miss and false detections of abrupt shot changes. Our study shows that one main reason for these errors is due to the presence of superimposed texts that are common in various video genres. To address this problem, we present in this paper an efficient method that utilizes the edge type - text-edge (edge in text region) or non-text-edge (edge in non-text region) - to reduce erroneous detection due to the sudden appearance and disappearance of superimposed text. Extensive experiments have been conducted and the results show that our proposed method, as compared with other existing methods, can obtain better performance in detecting abrupt shot boundaries and differentiating them from the effects of superimposed text.

[1]  Ellen K. Hughes,et al.  Video OCR for Digital News Archives , 1998 .

[2]  Robert Burgin,et al.  Performance Standards and Evaluations in IR Test Collections: Cluster-Based Retrieval Models , 1997, Inf. Process. Manag..

[3]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Edward K. Wong,et al.  A new robust algorithm for video text extraction , 2003, Pattern Recognit..

[5]  King Ngi Ngan,et al.  Post shot boundary detection technique: flashlight scene determination , 1999, ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359).

[6]  Riccardo Leonardi,et al.  Scene break detection: a comparison , 1998, Proceedings Eighth International Workshop on Research Issues in Data Engineering. Continuous-Media Databases and Applications.

[7]  Jun Yu,et al.  An efficient method for scene cut detection , 2001, Pattern Recognit. Lett..

[8]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying production effects , 1999, Multimedia Systems.

[9]  Slobodan Ribarić,et al.  Introduction to Pattern Recognition , 1988 .

[10]  Anil K. Jain,et al.  Introduction to Pattern Recognition , 2007 .

[11]  Michael R. Lyu,et al.  A new approach for video text detection , 2002, Proceedings. International Conference on Image Processing.

[12]  Jim R. Parker,et al.  Algorithms for image processing and computer vision , 1996 .

[13]  Rainer Lienhart,et al.  Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..

[14]  Ellen K. Hughes,et al.  Video OCR for digital news archive , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[15]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, Electronic Imaging.

[16]  Azriel Rosenfeld,et al.  Compressed Domain Video Segmentation , 1996 .

[17]  Borko Furht,et al.  Video and Image Processing in Multimedia Systems , 1995 .

[18]  Chung-Lin Huang,et al.  A robust scene-change detection method for video segmentation , 2001, IEEE Trans. Circuits Syst. Video Technol..

[19]  Anil K. Jain,et al.  Locating text in complex color images , 1995, Pattern Recognit..

[20]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[21]  Edward M. Riseman,et al.  TextFinder: An Automatic System to Detect and Recognize Text In Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Ioannis Pitas,et al.  Shot detection in video sequences using entropy based metrics , 2002, Proceedings. International Conference on Image Processing.