论文信息 - Audio as a support to scene change detection and characterization of video sequences

Audio as a support to scene change detection and characterization of video sequences

A challenging problem to construct video databases is the organization of video information. The development of algorithms able to organize video information according to semantic content of the data is getting more and more important. This will allow algorithms such as indexing and retrieval to work more efficiently. Until now, an attempt to extract semantic information has been performed using only video information. As a video sequence is constructed from a 2-D projection of a 3-D scene, video processing has shown its limitations especially in solving problems such as object identification or object tracking, reducing the ability to extract semantic characteristics. A possibility to overcome the problem is to use additional information. The associated audio signal is then the most natural way to obtain this information. This paper presents a technique which combines video and audio information together for classification and indexing purposes. The classification is performed on the audio signal; a general framework that uses the results of such classification is then proposed for organizing video information.

Riccardo Leonardi | Caterina Saraceno | R. Leonardi | C. Saraceno

[1] Lawrence R. Rabiner,et al. A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition , 1976 .

[2] Ramesh C. Jain,et al. Digital video segmentation , 1994, MULTIMEDIA '94.

[3] Don R. Hush,et al. Change detection for target detection and classification in video sequences , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[4] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[5] John H. L. Hansen,et al. Discrete-Time Processing of Speech Signals , 1993 .

[6] P. V. de Souza,et al. A statistical approach to the design of an adaptive self-normalizing silence detector , 1983 .

[7] H. Kobatake. Optimization of voiced/Unvoiced decisions in nonstationary noise environments , 1987, IEEE Trans. Acoust. Speech Signal Process..

[8] Nilesh V. Patel,et al. Statistical approach to scene change detection , 1995, Electronic Imaging.

[9] F. Arman,et al. A Statistical Approach to Scene Change Detection , 1995 .

[10] Shih-Fu Chang,et al. Scene change detection in an MPEG-compressed video sequence , 1995, Electronic Imaging.