Random forest classification based acoustic event detection utilizing contextual-information and bottleneck features

Abstract The variety of event categories and event boundary information have resulted in limited success for acoustic event detection systems. To deal with this, we propose to utilize the long contextual information, low-dimensional discriminant global bottleneck features and category-specific bottleneck features. By concatenating several adjacent frames together, the use of contextual information makes it easier to cope with acoustic signals with long duration. Global and category-specific bottleneck features can extract the prior knowledge of the event category and boundary, which is ideally matched by the task of an event detection system. Evaluations on the UPC-TALP and ITC-IRST databases of highly variable acoustic events demonstrate the effectiveness of the proposed approaches by achieving a 5.30% and 4.44% absolute error rate improvement respectively compared to the state of art technique.

[1]  Dinggang Shen,et al.  Automatic cystocele severity grading in transperineal ultrasound by random forest regression , 2017, Pattern Recognit..

[2]  Yun Yang,et al.  Bi-weighted ensemble via HMM-based approaches for temporal data clustering , 2018, Pattern Recognit..

[3]  Andrey Temko,et al.  Acoustic event detection in meeting-room environments , 2009, Pattern Recognit. Lett..

[4]  Chloé Clavel,et al.  Events Detection for an Audio-Based Surveillance System , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[5]  T. Andringa,et al.  Sound event recognition through expectancy-based evaluation ofsignal-driven hypotheses , 2010, Pattern Recognit. Lett..

[6]  Roberto Togneri,et al.  Frame-Wise Dynamic Threshold Based Polyphonic Acoustic Event Detection , 2017, INTERSPEECH.

[7]  Augusto Sarti,et al.  Scream and gunshot detection and localization for audio-surveillance systems , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[8]  Mark Hasegawa-Johnson,et al.  Acoustic fall detection using Gaussian mixture models and GMM supervectors , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Jun Zhang,et al.  A Target Guided Subband Filter for Acoustic Event Detection in Noisy Environments Using Wavelet Packets , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[10]  Wolf-Joachim Fischer,et al.  Food Intake Monitoring: Automated Chew Event Detection in Chewing Sounds , 2014, IEEE Journal of Biomedical and Health Informatics.

[11]  Stefan Goetze,et al.  Detection and Classification of Acoustic Events for In-Home Care , 2011 .

[12]  Guillaume Lemaitre,et al.  Real-Time Detection of Overlapping Sound Events with Non-Negative Matrix Factorization , 2013 .

[13]  Chng Eng Siong,et al.  Overlapping sound event recognition using local spectrogram features and the generalised hough transform , 2013, Pattern Recognit. Lett..

[14]  Andrey Temko,et al.  Classification of acoustic events using SVM-based clustering schemes , 2006, Pattern Recognit..

[15]  Roberto Togneri,et al.  Random forest regression based acoustic event detection with bottleneck features , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[16]  Jimmy Ludeña-Choez,et al.  Acoustic Event Classification using spectral band selection and Non-Negative Matrix Factorization-based features , 2016, Expert Syst. Appl..

[17]  Nicolai Petkov,et al.  Reliable detection of audio events in highly noisy environments , 2015, Pattern Recognit. Lett..

[18]  Dan Stowell,et al.  A database and challenge for acoustic scene classification and event detection , 2013, 21st European Signal Processing Conference (EUSIPCO 2013).

[19]  Andrey Temko,et al.  Fuzzy integral based information fusion for classification of highly confusable non-speech sounds , 2008, Pattern Recognit..

[20]  Heikki Huttunen,et al.  Recognition of acoustic events using deep neural networks , 2014, 2014 22nd European Signal Processing Conference (EUSIPCO).

[21]  Jin Zhao,et al.  Discriminant deep belief network for high-resolution SAR image classification , 2017, Pattern Recognit..

[22]  Heikki Huttunen,et al.  Polyphonic sound event detection using multi label deep neural networks , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[23]  Xiaohui Peng,et al.  A novel random forests based class incremental learning method for activity recognition , 2018, Pattern Recognit..

[24]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[25]  Jörn Anemüller,et al.  Automatic acoustic siren detection in traffic noise by part-based models , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[26]  Jian Yu,et al.  On convergence and parameter selection of the EM and DA-EM algorithms for Gaussian mixtures , 2018, Pattern Recognit..

[27]  Li Li,et al.  Adjusted weight voting algorithm for random forests in handling missing values , 2017, Pattern Recognit..

[28]  Roberto Togneri,et al.  Random forest classification based acoustic event detection , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[29]  Reishi Kondo,et al.  Acoustic event detection based on non-negative matrix factorization with mixtures of local dictionaries and activation aggregation , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[30]  Huy Phan,et al.  Random Regression Forests for Acoustic Event Detection and Classification , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[31]  Roberto Togneri,et al.  Confidence Based Acoustic Event Detection , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[32]  Tuomas Virtanen,et al.  A report on sound event detection with different binaural features , 2017, ArXiv.

[33]  Giovanni Maria Farinella,et al.  Semantic segmentation of images exploiting DCT based features and random forest , 2016, Pattern Recognit..

[34]  Moncef Gabbouj,et al.  Supervised model training for overlapping sound events based on unsupervised source separation , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[35]  Maria E. Niessen,et al.  Hierarchical modeling using automated sub-clustering for sound event recognition , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[36]  Jozef Juhár,et al.  Feature selection for acoustic events detection , 2013, Multimedia Tools and Applications.

[37]  Jörn Anemüller,et al.  Spectro-Temporal Gabor Filterbank Features for Acoustic Event Detection , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[38]  Erik J. Bekkers,et al.  Retinal vessel delineation using a brain-inspired wavelet transform and random forest , 2017, Pattern Recognit..

[39]  Andrey Temko,et al.  CLEAR Evaluation of Acoustic Event Detection and Classification Systems , 2006, CLEAR.

[40]  Sridhar Krishnan,et al.  Log-frequency spectrogram for respiratory sound monitoring , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[41]  Gernot A. Fink,et al.  A Bag-of-Features approach to acoustic event detection , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[42]  Feiping Nie,et al.  SVM based multi-label learning with missing labels for image annotation , 2018, Pattern Recognit..

[43]  Thomas S. Huang,et al.  Real-world acoustic event detection , 2010, Pattern Recognit. Lett..

[44]  Kellie J. Archer,et al.  Empirical characterization of random forest variable importance measures , 2008, Comput. Stat. Data Anal..

[45]  Noel Lopes,et al.  Towards adaptive learning with improved convergence of deep belief networks on graphics processing units , 2014, Pattern Recognit..