Scene change detection is an essential step to automatic and content-based video indexing, retrieval and browsing. In this paper, a robust scene change detection and classification approach is presented, which analyzes audio, visual and textual sources and accounts for their inter-relations and coincidence to semantically identify and classify video scenes. Audio analysis focuses on the segmentation of audio stream into four types of semantic data such as silence, speech, music and environmental sound. Further processing on speech segments aims at locating speaker changes. Video analysis partitions visual stream into shots. Text analysis can provide a supplemental source of clues for scene classification and indexing information. We integrate the video and audio analysis results to identify video scenes and use the text information detected by the video OCR technology or derived from transcripts available to refine scene classification. Results from single source segmentation are in some cases suboptimal. By combining visual, aural features adn the accessorial text information, the scence extraction accuracy is enhanced, and more semantic segmentations are developed. Experimental results are proven to rather promising.
[1]
Ioannis Pitas,et al.
Content-based video parsing and indexing based on audio-visual interaction
,
2001,
IEEE Trans. Circuits Syst. Video Technol..
[2]
C.-C. Jay Kuo,et al.
Video content parsing based on combined audio and visual information
,
1999,
Optics East.
[3]
Ali N. Akansu,et al.
Multi-Modal Dialog Scene Detection Using Hidden Markov Models for Content-Based Multimedia Indexing
,
2001,
Multimedia Tools and Applications.
[4]
David S. Doermann,et al.
Automatic text detection and tracking in digital video
,
2000,
IEEE Trans. Image Process..
[5]
Guy de Collongue,et al.
Video Scene Description: An Audio Based Approach
,
2002
.
[6]
Joëlle Coutaz,et al.
A design space for multimodal systems: concurrent processing and data fusion
,
1993,
INTERCHI.
[7]
S. Chen,et al.
Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion
,
1998
.
[8]
Alberto Del Bimbo,et al.
Visual information retrieval
,
1999
.