On-line adaptive background modelling for audio surveillance

We investigate the problem of automatic audio surveillance. This aspect of the surveillance, which extends the more investigated area of video surveillance, can be very informative to solve many problems in real situations. Similarly to video surveillance, also in this case it is necessary to build a background (BG) model, so that it is immediate to discover foreground (FG) events. To this end, we first introduce the concepts of audio BG and FG in an automated surveillance scenario. Subsequently, we propose a novel audio BG system able to build in real time an adaptive model of the audio scene BG, and to promptly detect unexpected FG auditory events. The method is based on the probabilistic modelling of the audio data stream using separate sets of adaptive Gaussian mixture models, working on the audio-frequency spectrum. This approach is also characterized by the use of only one microphone and on-line functioning, so that it can be directly used in real situations, also to support a video surveillance system. Preliminary results show the effectiveness of the approach to discover different FG audio situations.

[1]  Daniel P. W. Ellis,et al.  Detecting Alarm Sounds , 2001 .

[2]  Renate Sitte,et al.  Comparison of techniques for environmental sound recognition , 2003, Pattern Recognit. Lett..

[3]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[4]  K. L. Doty Digital Spectral Analysis of Audio Signals , 1965 .

[5]  C.-C. Jay Kuo,et al.  Audio content analysis for online audiovisual data segmentation and classification , 2001, IEEE Trans. Speech Audio Process..

[6]  Sam T. Roweis,et al.  One Microphone Source Separation , 2000, NIPS.

[7]  Vesa T. Peltonen,et al.  Computational auditory scene recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Albert S. Bregman,et al.  The Auditory Scene. (Book Reviews: Auditory Scene Analysis. The Perceptual Organization of Sound.) , 1990 .

[9]  Deniz Erdoğmuş,et al.  ON-LINE MINIMUM MUTUAL INFORMATION METHOD FOR TIME-VARYING BLIND SOURCE SEPARATION , 2001 .