An automatic classification of bird species using audio feature extraction and support vector machines

Automatic identification of bird species based on the chirping sounds of birds was experimented using feature extraction method and classification based on support vector machines (SVMs). The proposed technique followed the extraction of cepstral features on mel scale of each audio recording from the collected standard database. Extracted mel frequency cepstral coefficients (MFCCs) formed a feature matrix. This feature matrix was then trained and tested for efficient recognition of audio events from audio test signals. 70% of the whole database was used for training purpose while the reamaining 30% for testing of samples. The classifier achieved upto 89.4% accuracy on a data set containing four species, commonly found in India.

[1]  Karthikeyan Umapathy,et al.  Audio Signal Feature Extraction and Classification Using Local Discriminant Bases , 2004, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Panu Somervuo,et al.  Classification of the harmonic structure in bird vocalization , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Lie Lu,et al.  Content analysis for audio classification and segmentation , 2002, IEEE Trans. Speech Audio Process..

[4]  Aki Härmä Automatic identification of bird species based on sinusoidal modeling of syllables , 2003, ICASSP.

[5]  Michael J. Carey,et al.  A comparison of features for speech, music discrimination , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[6]  J A Kogan,et al.  Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study. , 1998, The Journal of the Acoustical Society of America.

[7]  C.-C. Jay Kuo,et al.  Environmental sound recognition using MP-based features , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[9]  H. C. Card,et al.  Birdsong recognition using backpropagation and multivariate statistics , 1997, IEEE Trans. Signal Process..

[10]  Mohammed Bennamoun,et al.  Text-independent speaker identification in birds , 2006, INTERSPEECH.

[11]  D Margoliash,et al.  Template-based automatic recognition of birdsong syllables from continuous recordings. , 1996, The Journal of the Acoustical Society of America.