Automatic Segmentation of Audio Signals for Bird Species Identification

The identification of bird species from their audio recorded songs are nowadays used in several important applications, such as to monitor the quality of the environment and to prevent bird-plane collisions near airports. The complete identification cycle involves the use of: (a) recording devices to acquire the songs, (b) audio processing techniques to remove the noise and to select the most representative elements of the signal, (c) feature extraction procedures to obtain relevant characteristics, and (d) decision procedures to make the identification. The decision procedures can be obtained by Machine Learning (ML) algorithms, considering the problem in a standard classification scenario. One key element is this cycle is the selection of the most relevant segments of the audio for identification purposes. In this paper we show that the use of short audio segments with high amplitude - called pulses in our work - outperforms the use of the complete audio records in the species identification task. We also show how these pulses can be automatically obtained, based on measurements performed directly on the audio signal. The employed classifiers are trained using a previously labeled database of bird songs. We use a database that contains bird song recordings from 75 species which appear in the Southern Atlantic Coast of South America. Obtained results show that the use of automatically obtained pulses and a SVM classifier produce the best results, all the necessary procedures can be installed in a dedicated hardware, allowing the construction of a specific bird identification device.

[1]  Alessandro Lameiras Koerich,et al.  Feature set comparison for automatic bird species identification , 2011, 2011 IEEE International Conference on Systems, Man, and Cybernetics.

[2]  Chih-Hsun Chou,et al.  On the Studies of Syllable Segmentation and Improving MFCCs for Automatic Birdsong Recognition , 2008, 2008 IEEE Asia-Pacific Services Computing Conference.

[3]  Panu Somervuo,et al.  Parametric Representations of Bird Sounds for Automatic Species Recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  T. Scott Brandes,et al.  Automated sound recording and analysis techniques for bird surveys and conservation , 2008, Bird Conservation International.

[5]  Chang-Hsing Lee,et al.  Bird Species Recognition by Comparing the HMMs of the Syllables , 2007, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007).

[6]  Xiaoli Z. Fern,et al.  Audio Classification of Bird Species: A Statistical Manifold Approach , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[7]  Seppo Ilmari Fagerlund,et al.  Bird Species Recognition Using Support Vector Machines , 2007, EURASIP J. Adv. Signal Process..

[8]  Vincent M. Stanford,et al.  Bird classification algorithms: theory and experimental results , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Celso A. A. Kaestner,et al.  Spectral Noise Gate Technique Applied to Birdsong Preprocessing on Embedded Unit , 2012, 2012 IEEE International Symposium on Multimedia.

[10]  Charles E. Taylor,et al.  Data Mining Applied to Acoustic Bird Species Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[11]  Chih-Hsun Chou,et al.  Bird Species Recognition by Wavelet Transformation of a Section of Birdsong , 2009, 2009 Symposia and Workshops on Ubiquitous, Autonomic and Trusted Computing.

[12]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[13]  Celso A. A. Kaestner,et al.  Hierarchical Classification of Bird Species Using Their Audio Recorded Songs , 2013, 2013 IEEE International Conference on Systems, Man, and Cybernetics.

[14]  E. B. Newman,et al.  A Scale for the Measurement of the Psychological Magnitude Pitch , 1937 .

[15]  Alessandro Lameiras Koerich,et al.  A Machine Learning Approach to Automatic Music Genre Classification , 2008, Journal of the Brazilian Computer Society.

[16]  Frank Kurth,et al.  Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring , 2010, Pattern Recognit. Lett..

[17]  Andreas Stolcke,et al.  Bird species recognition combining acoustic and sequence modeling , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[18]  Alessandro Lameiras Koerich,et al.  Automatic music genre classification using ensemble of classifiers , 2007, 2007 IEEE International Conference on Systems, Man and Cybernetics.

[19]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[20]  Chin-Chuan Han,et al.  Automatic Classification of Bird Species From Their Sounds Using Two-Dimensional Cepstral Coefficients , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Xiaoli Z. Fern,et al.  Time-frequency segmentation of bird song in noisy acoustic environments , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Jinhai Cai,et al.  Sensor Network for the Monitoring of Ecosystem: Bird Species Recognition , 2007, 2007 3rd International Conference on Intelligent Sensors, Sensor Networks and Information.

[23]  Alessandro Lameiras Koerich,et al.  Automatic Bird Species Identification for Large Number of Species , 2011, 2011 IEEE International Symposium on Multimedia.