Content-Based Classification and Retrieval of Wild Animal Sounds Using Feature Selection Algorithm

Automatic animal sound classification and retrieval is very helpful for bioacoustic and audio retrieval applications. In this paper we propose a system to define and extract a set of acoustic features from all archived wild animal sound recordings that is used in subsequent feature selection, classification and retrieval tasks. The database consisted of sounds of six wild animals. The Fractal Dimension analysis based segmentation was selected due to its ability to select the right portion of signal for extracting the features. The feature vectors of the proposed algorithm consist of spectral, temporal and perceptual features of the animal vocalizations. The minimal Redundancy, Maximal Relevance (mRMR) feature selection analysis was exploited to increase the classification accuracy at a compact set of features. These features were used as the inputs of two neural networks, the k-Nearest Neighbor (kNN), the Multi-Layer Perceptron (MLP) and its fusion. The proposed system provides quite robust approach for classification and retrieval purposes, especially for the wild animal sounds.

[1]  Adam Krzyżak,et al.  Methods of combining multiple classifiers and their applications to handwriting recognition , 1992, IEEE Trans. Syst. Man Cybern..

[2]  S. Gunasekaran,et al.  Fractal dimension analysis of audio signals for Indian musical instrument recognition , 2008, 2008 International Conference on Audio, Language and Image Processing.

[3]  Volker B Deecke,et al.  Automated categorization of bioacoustic signals: avoiding perceptual pitfalls. , 2005, The Journal of the Acoustical Society of America.

[4]  Christian Breiteneder,et al.  Discrimination and retrieval of animal sounds , 2006, 2006 12th International Multi-Media Modelling Conference.

[5]  James Theiler,et al.  Estimating fractal dimension , 1990 .

[6]  Lie Lu,et al.  Content analysis for audio classification and segmentation , 2002, IEEE Trans. Speech Audio Process..

[7]  P. Maragos,et al.  Fractal dimensions of speech sounds: computation and application to automatic speech recognition. , 1999, The Journal of the Acoustical Society of America.

[8]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Shiu Yin Yuen,et al.  Fractal dimension estimation and noise filtering using Hough transform , 2004, Signal Process..

[10]  Brian Litt,et al.  A comparison of waveform fractal dimension algorithms , 2001 .

[11]  S. Gunasekaran,et al.  Recognition of Indian Musical Instruments with Multi-Classifier Fusion , 2008, 2008 International Conference on Computer and Electrical Engineering.

[12]  Hervé Bourlard,et al.  Hybrid HMM/ANN and GMM combination for user-customized password speaker verification , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..