Bird Species Recognition Using Support Vector Machines

Automatic identification of bird species by their vocalization is studied in this paper. Bird sounds are represented with two different parametric representations: (i) the mel-cepstrum parameters and (ii) a set of low-level signal parameters, both of which have been found useful for bird species recognition. Recognition is performed in a decision tree with support vector machine (SVM) classifiers at each node that perform classification between two species. Recognition is tested with two sets of bird species whose recognition has been previously tested with alternative methods. Recognition results with the proposed method suggest better or equal performance when compared to existing reference methods.

[1]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[2]  A. Juneja,et al.  Speech segmentation using probabilistic phonetic feature hierarchy and support vector machines , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[3]  Joseph Picone,et al.  Applications of support vector machines to speech recognition , 2004, IEEE Transactions on Signal Processing.

[4]  Aki Härmä,et al.  Parametrization of inharmonic bird sounds for automatic recognition , 2005, 2005 13th European Signal Processing Conference.

[5]  J A Kogan,et al.  Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study. , 1998, The Journal of the Acoustical Society of America.

[6]  Trieu-Kien Truong,et al.  Audio classification and categorization based on wavelets and support vector Machine , 2005, IEEE Transactions on Speech and Audio Processing.

[7]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[8]  Ishwar K. Sethi,et al.  Classification of general audio data for content-based retrieval , 2001, Pattern Recognit. Lett..

[9]  Vincent M. Stanford,et al.  An Automated Acoustic System to Monitor and Classify Birds , 2006, EURASIP J. Adv. Signal Process..

[10]  Charles E. Taylor,et al.  Data Mining Applied to Acoustic Bird Species Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[12]  Anil Prabhakar,et al.  Automatic identification of bird calls using Spectral Ensemble Average Voice Prints , 2006, 2006 14th European Signal Processing Conference.

[13]  Panu Somervuo,et al.  Classification of the harmonic structure in bird vocalization , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[15]  Juha T. Tanttu,et al.  Wavelets in Recognition of Bird Sounds , 2007, EURASIP J. Adv. Signal Process..

[16]  Aki Härmä Automatic identification of bird species based on sinusoidal modeling of syllables , 2003, ICASSP.

[17]  Fagerlund,et al.  Automatic recognition of Bird Species by Their Sound , 2022 .

[18]  Panu Somervuo,et al.  Parametric Representations of Bird Sounds for Automatic Species Recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  S.-A. Selouani,et al.  Automatic birdsong recognition based on autoregressive time-delay neural networks , 2005, 2005 ICSC Congress on Computational Intelligence Methods and Applications.

[20]  D Margoliash,et al.  Template-based automatic recognition of birdsong syllables from continuous recordings. , 1996, The Journal of the Acoustical Society of America.

[21]  Friedhelm Schwenker,et al.  Hierarchical support vector machines for multi-class pattern recognition , 2000, KES'2000. Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies. Proceedings (Cat. No.00TH8516).

[22]  Panu Somervuo,et al.  Bird song recognition based on syllable pair histograms , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[23]  H. C. Card,et al.  Birdsong recognition using backpropagation and multivariate statistics , 1997, IEEE Trans. Signal Process..

[24]  Mohammed Bennamoun,et al.  Text-independent speaker identification in birds , 2006, INTERSPEECH.

[25]  P. Slater,et al.  Bird Song: Biological Themes and Variations , 1995 .