论文信息 - Animal Sound Classification Using A Convolutional Neural Network

Animal Sound Classification Using A Convolutional Neural Network

In this paper, we investigate the problem of animal sound classification using deep learning and propose a system based on convolutional neural network architecture. As the input to the network, sound files were preprocessed to extract Mel Frequency Cepstral Coefficients (MFCC) using LibROSA library. To train and test the system we have collected 875 animal sound samples from an online sound source site for 10 different animal types. We report classification confusion matrices and the results obtained by different gradient descent optimizers. The best accuracy of 75% was obtained by Nesterov-accelerated Adaptive Moment Estimation (Nadam).

F. Boray Tek | Emre Şaşmaz | F. Tek | F. Tek | Emre Sasmaz

[1] Timothy Dozat,et al. Incorporating Nesterov Momentum into Adam , 2016 .

[2] Sunil L. Tade,et al. Identification & Detection System for Animals from their Vocalization , 2013 .

[3] S. Squartini,et al. DCASE 2016 Acoustic Scene Classification Using Convolutional Neural Networks , 2016, DCASE.

[4] Karol J. Piczak. Environmental sound classification with convolutional neural networks , 2015, 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP).

[5] A. K. Santra,et al. Genetic Algorithm and Confusion Matrix for Document Clustering , 2012 .

[6] Lonce L. Wyse,et al. Audio Spectrogram Representations for Processing with Convolutional Neural Networks , 2017, ArXiv.

[7] Colin Raffel,et al. librosa: Audio and Music Signal Analysis in Python , 2015, SciPy.

[8] Izzet Kale,et al. Robust localization and identification of African clawed frogs in digital images , 2014, Ecol. Informatics.

[9] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[10] Heiga Zen,et al. WaveNet: A Generative Model for Raw Audio , 2016, SSW.