论文信息 - The Details That Matter: Frequency Resolution of Spectrograms in Acoustic Scene Classification

The Details That Matter: Frequency Resolution of Spectrograms in Acoustic Scene Classification

This study describes a convolutional neural network model submitted to the acoustic scene classification task of the DCASE 2017 challenge. The performance of this model is evaluated with different frequency resolutions of the input spectrogram showing that a higher number of mel bands improves accuracy with negligible impact on the learning time. Additionally, apart from the convolutional model focusing solely on the ambient characteristics of the audio scene, a proposed extension with pretrained event detectors shows potential for further exploration.

Karol J. Piczak

[1] Petros Maragos,et al. Improved Dictionary Selection and Detection Schemes in Sparse-CNMF-Based Overlapping Acoustic Event Detection , 2016, DCASE.

[2] Daniele Battaglino,et al. Acoustic scene classification using convolutional neural networks , 2016 .

[3] Nobutaka Ono,et al. ACOUSTIC SCENE CLASSIFICATION USING DEEP NEURAL NETWORK AND FRAME-CONCATENATED ACOUSTIC FEATURE , 2016 .

[4] Kyogu Lee,et al. CONVOLUTIONAL NEURAL NETWORK WITH MULTIPLE-WIDTH FREQUENCY-DELTA DATA AUGMENTATION FOR ACOUSTIC SCENE CLASSIFICATION , 2016 .

[5] VirtanenTuomas,et al. Detection and Classification of Acoustic Scenes and Events , 2018 .

[6] Björn Schuller,et al. RECOGNISING ACOUSTIC SCENES WITH LARGE-SCALE AUDIO FEATURE EXTRACTION AND SVM , 2013 .

[7] Gerhard Widmer,et al. CP-JKU SUBMISSIONS FOR DCASE-2016 : A HYBRID APPROACH USING BINAURAL I-VECTORS AND DEEP CONVOLUTIONAL NEURAL NETWORKS , 2016 .

[8] Ariel Habshush,et al. IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events IEEE AASP SCENE CLASSIFICATION CHALLENGE USING HIDDEN MARKOV MODELS AND FRAME BASED CLASSIFICATION , 2013 .

[9] Franz Pernkopf,et al. Gated Recurrent Networks applied to Acoustic Scene Classification , 2016, DCASE.

[10] Hanseok Ko,et al. Deep Neural Network Bottleneck Feature for Acoustic Scene Classification , 2016 .

[11] P. Herrera,et al. RECURRENCE QUANTIFICATION ANALYSIS FEATURES FOR AUDITORY SCENE CLASSIFICATION , 2013 .