Automatic Detection and Recognition of Tonal Bird Sounds in Noisy Environments

This paper presents a study of automatic detection and recognition of tonal bird sounds in noisy environments. The detection of spectro-temporal regions containing bird tonal vocalisations is based on exploiting the spectral shape to identify sinusoidal components in the short-time spectrum. The detection method provides tonal-based feature representation that is employed for automatic bird recognition. The recognition system uses Gaussian mixture models to model 165 different bird syllables, produced by 95 bird species. Standard models, as well as models compensating for the effect of the noise, are employed. Experiments are performed on bird sound recordings corrupted by White noise and real-world environmental noise. The proposed detection method shows high detection accuracy of bird tonal components. The employed tonal-based features show significant recognition accuracy improvements over the Mel-frequency cepstral coefficients, in both standard and noise-compensated models, and strong robustness to mismatch between the training and testing conditions.

[1]  Juha T. Tanttu,et al.  Wavelets in Recognition of Bird Sounds , 2007, EURASIP J. Adv. Signal Process..

[2]  P. Jancovic,et al.  Improving automatic phoneme alignment under noisy conditions by incorporating spectral voicing information , 2009 .

[3]  David Pearce,et al.  The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[4]  Seppo Ilmari Fagerlund,et al.  Bird Species Recognition Using Support Vector Machines , 2007, EURASIP J. Adv. Signal Process..

[5]  J A Kogan,et al.  Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study. , 1998, The Journal of the Acoustical Society of America.

[6]  Vincent M. Stanford,et al.  An Automated Acoustic System to Monitor and Classify Birds , 2006, EURASIP J. Adv. Signal Process..

[7]  Peter Jancovic,et al.  Incorporating the voicing information into HMM-based automatic speech recognition in noisy environments , 2009, Speech Commun..

[8]  H. C. Card,et al.  Birdsong recognition using backpropagation and multivariate statistics , 1997, IEEE Trans. Signal Process..

[9]  Irene Y. H. Gu,et al.  Classification of bird species by using key song searching: a comparative study , 2003, SMC'03 Conference Proceedings. 2003 IEEE International Conference on Systems, Man and Cybernetics. Conference Theme - System Security and Assurance (Cat. No.03CH37483).

[10]  Peter Jancovic,et al.  Estimation of Voicing-Character of Speech Spectra Based on Spectral Shape , 2007, IEEE Signal Processing Letters.

[11]  N. Fletcher A class of chaotic bird calls? , 2000, The Journal of the Acoustical Society of America.

[12]  D Margoliash,et al.  Template-based automatic recognition of birdsong syllables from continuous recordings. , 1996, The Journal of the Acoustical Society of America.

[13]  Mark J. T. Smith,et al.  Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model , 1997, IEEE Trans. Speech Audio Process..

[14]  Chia-Feng Juang,et al.  Birdsong recognition using prediction-based recurrent neural fuzzy networks , 2007, Neurocomputing.

[15]  Aki Härmä Automatic identification of bird species based on sinusoidal modeling of syllables , 2003, ICASSP.

[16]  Munevver Kokuer Employment of Spectral Voicing Information for Speech and Speaker Recognition in Noisy Conditions , 2008 .

[17]  Chang-Hsing Lee,et al.  Automatic Recognition of Bird Songs Using Cepstral Coefficients , 2006 .

[18]  Panu Somervuo,et al.  Parametric Representations of Bird Sounds for Automatic Species Recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  S.-A. Selouani,et al.  Automatic birdsong recognition based on autoregressive time-delay neural networks , 2005, 2005 ICSC Congress on Computational Intelligence Methods and Applications.