Automatic classification of frogs calls based on fusion of features and SVM

This paper presents a new approach for the acoustic classification of frogs' calls using a novel fusion of features: Mel Frequency Cepstral Coefficients (MFCCs), Shannon entropy and syllable duration. First, the audio recordings of different frogs' species are segmented in syllables. For each syllable, each feature is extracted and the cepstral features (MFCC) are computed and evaluated separately as in previous works. Finally, the data fusion is used to train a multiclass Support Vector Machine (SVM) classifier. In our experiment, the results show that our novel feature fusion increase the classification accuracy; achieving an average of 94.21% ± 8,04 in 18 frog's species.

[2]  M. Torralva,et al.  Understanding of the impact of chemicals on amphibians: a meta-analytic review , 2012, Ecology and evolution.

[3]  Miguel Angel Ferrer-Ballester,et al.  Sign language to text by SVM , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[4]  Andrew Taylor,et al.  Monitoring Frog Communities: An Application of Machine Learning , 1996, AAAI/IAAI, Vol. 2.

[5]  L. Trueb,et al.  Biology of Amphibians , 1986 .

[6]  Chun-Cheng Lin,et al.  Automatic recognition of frog calls using a multi-stage average spectrum , 2012, Comput. Math. Appl..

[7]  Héctor Corrada Bravo,et al.  Automated classification of bird and amphibian calls using machine learning: A comparison of methods , 2009, Ecol. Informatics.

[8]  Dzati Athiar Ramli,et al.  Frog Sound Identification System for Frog Species Recognition , 2012, ICCASA.

[9]  Ross A. Alford,et al.  Global Amphibian Declines: A Problem in Applied Ecology , 1999 .

[10]  Paul Roe,et al.  Acoustic classification of Australian anurans using syllable features , 2015, 2015 IEEE Tenth International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP).

[11]  T. S. Brandes,et al.  Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  V. Tiwari MFCC and its applications in speaker recognition , 2010 .

[13]  Sandrine Pavoine,et al.  Rapid Acoustic Survey for Biodiversity Appraisal , 2008, PloS one.

[14]  Danoush Hosseinzadeh,et al.  Combining Vocal Source and MFCC Features for Enhanced Speaker Recognition Performance Using GMMs , 2007, 2007 IEEE 9th Workshop on Multimedia Signal Processing.