Automatic story segmentation of news video based on audio-visual features and text information

In this paper a novel news story automatic segmentation scheme based on audio-visual features and text information is presented. The basic idea is to detect the shot boundaries for news video first, and then the topic-caption frames are identified to get segmentation cues by using text detection algorithm. In the next step, silence clips are detected by using short-time energy and short-time average zero-crossing rate (ZCR) parameters. At last, audio-visual features and text information are integrated to realize automatic story segmentation. On test data with 135, 400 frames, the accuracy rate 85.8% and the recall rate 97.5% are obtained. The experimental results show the approach is valid and robust.

[1]  Young-Min Kim,et al.  Fast Scene Change Detection using Direct Feature Extraction from MPEG Compressed Videos , 2000, IEEE Trans. Multim..

[2]  King Ngi Ngan,et al.  Post shot boundary detection technique: flashlight scene determination , 1999, ISSPA '99. Proceedings of the Fifth International Symposium on Signal Processing and its Applications (IEEE Cat. No.99EX359).

[3]  David Bull,et al.  Scene change detection algorithms for content-based video indexing and retrieval , 2001 .

[4]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying scene breaks , 1995, MULTIMEDIA '95.

[5]  Boon-Lock Yeo,et al.  Extracting story units from long programs for video browsing and navigation , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[6]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.