Automatic classification and speaker identification of African elephant (Loxodonta africana) vocalizations.

A hidden Markov model (HMM) system is presented for automatically classifying African elephant vocalizations. The development of the system is motivated by successful models from human speech analysis and recognition. Classification features include frequency-shifted Mel-frequency cepstral coefficients (MFCCs) and log energy, spectrally motivated features which are commonly used in human speech processing. Experiments, including vocalization type classification and speaker identification, are performed on vocalizations collected from captive elephants in a naturalistic environment. The system classified vocalizations with accuracies of 94.3% and 82.5% for type classification and speaker identification classification experiments, respectively. Classification accuracy, statistical significance tests on the model parameters, and qualitative analysis support the effectiveness and robustness of this approach for vocalization analysis in nonhuman species.

[1]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[2]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[3]  Tobias Riede,et al.  The relationship between acoustic structure and semantic information in Diana monkey alarm vocalization. , 2003, The Journal of the Acoustical Society of America.

[4]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[5]  T. Moon The expectation-maximization algorithm , 1996, IEEE Signal Process. Mag..

[6]  Michael T. Johnson,et al.  Application of speech recognition to African elephant (Loxodonta africana) vocalizations , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[7]  Douglas A. Reynolds,et al.  An overview of automatic speaker recognition technology , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  K. Payne,et al.  African Elephants Respond to Distant Playbacks of Low-Frequency Conspecific Calls , 1991 .

[9]  F. Galimberti,et al.  BIOACOUSTICS OF SOUTHERN ELEPHANT SEALS. II. INDIVIDUAL AND GEOGRAPHICAL VARIATION IN MALE AGGRESSIVE VOCALISATIONS , 2000 .

[10]  B. Sjare,et al.  The vocal repertoire of white whales, Delphinapterus leucas, summering in Cunningham Inlet, Northwest Territories , 1986 .

[11]  K. McComb,et al.  Unusually extensive networks of vocal recognition in African elephants , 2000, Animal Behaviour.

[12]  J R Potter,et al.  Marine mammal call discrimination using artificial neural networks. , 1994, The Journal of the Acoustical Society of America.

[13]  R. Seyfarth,et al.  The acoustic features of vowel-like grunt calls in chacma baboons (Papio cyncephalus ursinus): implications for production processes and functions. , 1997, The Journal of the Acoustical Society of America.

[14]  A. Ortolani,et al.  The use of low-frequency vocalizations in African elephant(Loxodonta africana) reproductive strategies , 2003, Hormones and Behavior.

[15]  Dafydd Gibbon,et al.  1 User’s guide , 1998 .

[16]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[17]  Thomas G. Smith,et al.  The relationship between behavioral activity and underwater vocalizations of the white whale, Delphinapterus leucas , 1986 .

[18]  David A. Helweg,et al.  Acoustic identification of female Steller sea lions , 2000 .

[19]  H. Heffner,et al.  Hearing in the elephant (Elephas maximus): absolute sensitivity, frequency discrimination, and sound localization. , 1982, Journal of comparative and physiological psychology.

[20]  William A. Watkins,et al.  Characterizing acoustic features of marine animal sounds , 1992 .

[21]  Charles T. Snowdon,et al.  The Complex Vocal Repertoire of the Adult Cotton‐top Tamarin (Saguinus oedipus oedipus)1) , 2010 .

[22]  Sven Anderson Speech recognition meets bird song: A comparison of statistics‐based and template‐based techniques , 1999 .

[23]  Christopher W. Clark,et al.  Bioacoustic transient detection by image convolution , 1993 .

[24]  Michael Picheny,et al.  Large-Vocabulary Speech Recognition Algorithms , 2002, Computer.

[25]  David A Helweg,et al.  Acoustic identification of female Steller sea lions (Eumetopias jubatus). , 2002, The Journal of the Acoustical Society of America.

[26]  A. Ortolani,et al.  QUANTIFYING ACOUSTIC AND TEMPORAL CHARACTERISTICS OF VOCALIZATIONS FOR A GROUP OF CAPTIVE AFRICAN ELEPHANTS LOXODONTA AFRICANA , 2003 .

[27]  Cynthia J. Moss,et al.  The social contexts of some very low frequency calls of African elephants , 1988, Behavioral Ecology and Sociobiology.

[28]  E. D. Chesmore,et al.  Application of time domain signal coding and artificial neural networks to passive acoustical identification of animals , 2001 .