Hidden Markov and Gaussian mixture models for automatic call classification.

Automatic methods of classification of animal sounds offer many advantages including speed and consistency in processing massive quantities of data. Calculations have been carried out on a set of 75 calls of Northern Resident killer whales, previously classified perceptually (human classification) into seven call types, using, hidden Markov models (HMMs) and Gaussian mixture models (GMMs). Neither of these methods has been used previously for classification of marine mammal call types. With cepstral coefficients as features both HMMs and GMMs give over 90% agreement with the perceptual classification, with the HMM over 95% for some cases.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  Stephen A. Dyer,et al.  Digital signal processing , 2018, 8th International Multitopic Conference, 2004. Proceedings of INMIC 2004..

[3]  J A Kogan,et al.  Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study. , 1998, The Journal of the Acoustical Society of America.

[4]  Thomas W. Parks,et al.  Isolating biological acoustic transient signals , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  David Reby,et al.  Cepstral coefficients and hidden Markov models reveal idiosyncratic voice characteristics in red deer (Cervus elaphus) stags. , 2006, The Journal of the Acoustical Society of America.

[6]  J C Brown Computer identification of musical instruments using pattern recognition with cepstral coefficients as features. , 1999, The Journal of the Acoustical Society of America.

[7]  Judith C Brown,et al.  Automatic classification of killer whale vocalizations using dynamic time warping. , 2007, The Journal of the Acoustical Society of America.

[8]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[9]  David G. Stork,et al.  Pattern Classification , 1973 .

[10]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[11]  M.T. Johnson,et al.  Automatic Song-Type Classification and Speaker Identification of Norwegian Ortolan Bunting (Emberiza Hortulana) Vocalizations , 2005, 2005 IEEE Workshop on Machine Learning for Signal Processing.

[12]  J C Brown,et al.  Feature dependence in the automatic identification of musical woodwind instruments. , 2001, The Journal of the Acoustical Society of America.

[13]  D K Mellinger,et al.  Recognizing transient low-frequency whale sounds by spectrogram correlation. , 2000, The Journal of the Acoustical Society of America.

[14]  S. Datta,et al.  Dolphin whistle classification for determining group identities , 2002, Signal Process..

[15]  Michael T. Johnson,et al.  Acoustic model adaptation for ortolan bunting (Emberiza hortulana L.) song-type classification. , 2008, The Journal of the Acoustical Society of America.

[16]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[17]  Patrick J Clemins,et al.  Automatic classification of animal vocalizations , 2005 .

[18]  J. Hildebrand,et al.  Gaussian mixture model classification of odontocetes in the Southern California Bight and the Gulf of California. , 2007, The Journal of the Acoustical Society of America.

[19]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[20]  Alex Bateman,et al.  An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.

[21]  Patrick J Clemins,et al.  Automatic classification and speaker identification of African elephant (Loxodonta africana) vocalizations. , 2003, The Journal of the Acoustical Society of America.

[22]  Patrick J Clemins,et al.  Generalized perceptual linear prediction features for animal vocalization analysis. , 2006, The Journal of the Acoustical Society of America.