Using image processing to detect and classify narrow-band cricket and frog calls.

An automatic call recognition (ACR) process is described that uses image processing techniques on spectrogram images to detect and classify constant-frequency cricket and frog calls recorded amidst a background of evening sounds found in a lowland Costa Rican rainforest. This process involves using image blur filters along with thresholding filters to isolate likely calling events. Features of these events, notably the event's central frequency, duration and bandwidth, along with the type of blur filter applied, are used with a Bayesian classifier to make identifications of the different calls. Of the 22 distinct sonotypes (calls presumed to be species-specific) recorded in the study site, 17 of them were recorded in high enough numbers to both train and test the classifier. The classifier approaches 100% true-positive accuracy for these 17 sonotypes, but also has a high false-negative rate (over 50% for 4 sonotypes). The very high true-positive accuracy of this process enables its use for monitoring singing crickets (and some frog species) in tropical forests.

[1]  Friedhelm Schwenker,et al.  Classification of bioacoustic time series based on the combination of global and local decisions , 2004, Pattern Recognit..

[2]  J A Kogan,et al.  Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study. , 1998, The Journal of the Acoustical Society of America.

[3]  T. J. Walker,et al.  Cryptic Species Among Sound-Producing Ensiferan Orthoptera (Gryllidae and Tettigoniidae) , 1964, The Quarterly Review of Biology.

[4]  Stuart Parsons,et al.  Identification of New Zealand bats (Chalinolobus tuberculatus and Mystacina tuberculata) in flight from analysis of echolocation calls by artificial neural networks , 2001 .

[5]  Changsheng Xu,et al.  Automatic music classification and summarization , 2005, IEEE Transactions on Speech and Audio Processing.

[6]  F. Fischer,et al.  QUANTITATIVE ASSESSMENT OF GRASSLAND QUALITY: ACOUSTIC DETERMINATION OF POPULATION SIZES OF ORTHOPTERAN INDICATOR SPECIES , 1997 .

[7]  Andrew Taylor,et al.  Monitoring Frog Communities: An Application of Machine Learning , 1996, AAAI/IAAI, Vol. 2.

[8]  Edwin Diday,et al.  The dynamic clusters method in nonhierarchical clustering , 1973, International Journal of Computer & Information Sciences.

[9]  D Margoliash,et al.  Template-based automatic recognition of birdsong syllables from continuous recordings. , 1996, The Journal of the Acoustical Society of America.

[10]  T. J. Walker,et al.  FACTORS RESPONSIBLE FOR INTRASPECIFIC VARIATION IN THE CALLING SONGS OF CRICKETS , 1962 .

[11]  K. Riede Acoustic monitoring of Orthoptera and its potential for conservation , 1998, Journal of Insect Conservation.

[12]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[13]  S. Harris,et al.  Identification of British bat species by multivariate analysis of echolocation call parameters , 1997 .

[14]  S. Parsons,et al.  Acoustic identification of twelve species of echolocating bat by discriminant function analysis and artificial neural networks. , 2000, The Journal of experimental biology.

[15]  E. D. Chesmore,et al.  Application of time domain signal coding and artificial neural networks to passive acoustical identification of animals , 2001 .

[16]  D. R. Ragge,et al.  The Songs of the Grasshoppers and Crickets of Western Europe , 1998 .