Acoustic detection and classification of Microchiroptera using machine learning: lessons learned from automatic speech recognition.

Current automatic acoustic detection and classification of microchiroptera utilize global features of individual calls (i.e., duration, bandwidth, frequency extrema), an approach that stems from expert knowledge of call sonograms. This approach parallels the acoustic phonetic paradigm of human automatic speech recognition (ASR), which relied on expert knowledge to account for variations in canonical linguistic units. ASR research eventually shifted from acoustic phonetics to machine learning, primarily because of the superior ability of machine learning to account for signal variation. To compare machine learning with conventional methods of detection and classification, nearly 3000 search-phase calls were hand labeled from recordings of five species: Pipistrellus bodenheimeri, Molossus molossus, Lasiurus borealis, L. cinereus semotus, and Tadarida brasiliensis. The hand labels were used to train two machine learning models: a Gaussian mixture model (GMM) for detection and classification and a hidden Markov model (HMM) for classification. The GMM detector produced 4% error compared to 32% error for a baseline broadband energy detector, while the GMM and HMM classifiers produced errors of 0.6 +/- 0.2% compared to 16.9 +/- 1.1% error for a baseline discriminant function analysis classifier. The experiments showed that machine learning algorithms produced errors an order of magnitude smaller than those for conventional methods.

[1]  Gareth Jones,et al.  Identification of twenty‐two bat species (Mammalia: Chiroptera) from Italy by analysis of time‐expanded recordings of echolocation calls , 2002 .

[2]  Mark D Skowronski,et al.  Exploiting independent filter bandwidth of human factor cepstral coefficients in automatic speech recognition. , 2004, The Journal of the Acoustical Society of America.

[3]  M. Fenton,et al.  Bats of Kootenay, Glacier, and Mount Revelstoke national parks in Canada: identification by echolocation calls, distribution, and biology , 1983 .

[4]  S. Parsons,et al.  Acoustic identification of twelve species of echolocating bat by discriminant function analysis and artificial neural networks. , 2000, The Journal of experimental biology.

[5]  Y. Chan,et al.  A parameter estimation approach to estimation of frequencies of sinusoids , 1981 .

[6]  B. Atal Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. , 1974, The Journal of the Acoustical Society of America.

[7]  M. Obrist Flexible bat echolocation: the influence of individual, habitat and conspecifics on sonar signal design , 1995, Behavioral Ecology and Sociobiology.

[8]  James A. Simmons,et al.  The structure of echolocation sounds used by the big brown bat Eptesicus fuscus : some consequences for echo processing , 1991 .

[9]  Alan V. Oppenheim,et al.  Discrete-Time Signal Pro-cessing , 1989 .

[10]  M. Obrist,et al.  Variability in echolocation call design of 26 Swiss bat species: consequences, limits and options for automated field identification with a synergetic pattern recognition approach , 2004 .

[11]  C. Weinstein,et al.  A system for acoustic-phonetic analysis of continuous speech , 1975 .

[12]  Stephen M. Dawson,et al.  Echolocation Calls of the Long-Tailed Bat: A Quantitative Analysis of Types of Calls , 1997 .

[13]  M. Fenton,et al.  TIME-EXPANSION AND ZERO-CROSSING PERIOD METER SYSTEMS PRESENT SIGNIFICANTLY DIFFERENT VIEWS OF ECHOLOCATION CALLS OF BATS , 2001 .

[14]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[15]  S. Furui,et al.  Automatic recognition and understanding of spoken language - a first step toward natural human-machine communication , 2000, Proceedings of the IEEE.

[16]  Bruce W. Miller,et al.  Qualitative Identification of Free-Flying Bats Using the Anabat Detector , 1999 .

[17]  S. Furui,et al.  Cepstral analysis technique for automatic speaker verification , 1981 .

[18]  M. Fenton,et al.  Recognition of Species of Insectivorous Bats by Their Echolocation Calls , 1981 .

[19]  Stuart Parsons,et al.  ADVANTAGES AND DISADVANTAGES OF TECHNIQUES FOR TRANSFORMING AND ANALYZING CHIROPTERAN ECHOLOCATION CALLS , 2000 .

[20]  Stephanie Seneff,et al.  Two-stage continuous speech recognition using feature-based models: a preliminary study , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[21]  John G. Harris,et al.  Automatic detection of microchiroptera echolocation calls from field recordings using machine learning algorithms , 2005 .