Matching pursuit based robust acoustic event classification for surveillance systems

Acoustic event classification using matching pursuit and random forest.Event based feature extraction using atom time and frequency information.Superior in classification of human scream from other classes. Display Omitted The ability to automatically recognize acoustic events in real world conditions is an important application of the surveillance systems. This paper presents an acoustic event classification (AEC) method which uses the Matching Pursuit to extract the important Gabor atoms from input audio signals. Rather than extracting features in short-time frames, we apply the matching pursuit to the whole duration of an acoustic event. Information from atoms, such as time, frequency, and amplitude are used to construct time-frequency features. These features capture both spectral and temporal information of the sound event, which is analogous to the spectrogram representation. Experiments were performed on an 8-class database including human scream and gunshot. Under noisy and mismatched conditions, the proposed classification method achieves F1-score of 0.814, which is superior to state-of-the-art methods.

[1]  Michel Vacher,et al.  Sound Environment Analysis in Smart Home , 2012, AmI.

[2]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[3]  Sacha Krstulovic,et al.  Mptk: Matching Pursuit Made Tractable , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[4]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[5]  Shrikanth Narayanan,et al.  Environmental Sound Recognition With Time–Frequency Audio Features , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[7]  Dan Stowell,et al.  Detection and classification of acoustic scenes and events: An IEEE AASP challenge , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[8]  Asma Rabaoui,et al.  Using One-Class SVMs and Wavelets for Audio Surveillance , 2008, IEEE Transactions on Information Forensics and Security.

[9]  Zvi Kons,et al.  Audio event classification using deep neural networks , 2013, INTERSPEECH.

[10]  Chang-Hong Lin,et al.  Gabor-Based Nonuniform Scale-Frequency Map for Environmental Sound Classification in Home Automation , 2014, IEEE Transactions on Automation Science and Engineering.

[11]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[12]  Jörn Anemüller,et al.  Spectro-Temporal Gabor Filterbank Features for Acoustic Event Detection , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[13]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[14]  Dan Stowell,et al.  Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning , 2014, PeerJ.

[15]  Andrzej Czyzewski,et al.  Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations , 2015, Multimedia Tools and Applications.

[16]  Huy Phan,et al.  Random Regression Forests for Acoustic Event Detection and Classification , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[17]  JongSuk Choi,et al.  An Environmental Sound Source Classification System Based on Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models , 2012 .

[18]  Vincent Fontaine,et al.  AUTOMATIC CLASSIFICATION OF ENVIRONMENTAL NOISE EVENTS BY HIDDEN MARKOV MODELS , 1998 .

[19]  Jingchang Huang,et al.  A Robust Feature Extraction Algorithm for the Classification of Acoustic Targets in Wild Environments , 2015, Circuits Syst. Signal Process..

[20]  Jozef Juhár,et al.  Feature selection for acoustic events detection , 2013, Multimedia Tools and Applications.

[21]  JongSuk Choi,et al.  Sound Detection and Localization in Windy Conditions for Intelligent Outdoor Security Cameras , 2016, Circuits Syst. Signal Process..

[22]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[23]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.