Monitoring and Mining Animal Sounds in Visual Space

Monitoring animals by the sounds they produce is an important and challenging task, whether the application is outdoors in a natural habitat, or in the controlled environment of a laboratory setting. In the former case, the density and diversity of animal sounds can act as a measure of biodiversity. In the latter case, researchers often create control and treatment groups of animals, expose them to different interventions, and test for different outcomes. One possible manifestation of different outcomes may be changes in the bioacoustics of the animals. With such a plethora of important applications, there have been significant efforts to build bioacoustic classification tools. However, we argue that most current tools are severely limited. They often require the careful tuning of many parameters (and thus huge amounts of training data), are either too computationally expensive for deployment in resource-limited sensors, specialized for a very small group of species, or are simply not accurate enough to be useful. In this work we introduce a novel bioacoustic recognition/classification framework that mitigates or solves all of the above problems. We propose to classify animal sounds in the visual space, by treating the texture of their sonograms as an acoustic fingerprint using a recently introduced parameter-free texture measure as a distance measure. We further show that by searching for the most representative acoustic fingerprint, we can significantly outperform other techniques in terms of speed and accuracy.

[1]  Jill L. Deppe,et al.  Using soundscape recordings to estimate bird species abundance, richness, and composition , 2009 .

[2]  J A Kogan,et al.  Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: a comparative study. , 1998, The Journal of the Acoustical Society of America.

[3]  Geoffrey A. Williamson,et al.  Comparison of methods for automated recognition of avian nocturnal flight calls , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Zeev Zalevsky,et al.  Cleaning and Quality Classification of Optically Recorded Voice Signals , 2010 .

[5]  T. Holy,et al.  Ultrasonic Songs of Male Mice , 2005, PLoS biology.

[6]  Dah-Jye Lee,et al.  Converting non-parametric distance-based classification to anytime algorithms , 2008, Pattern Analysis and Applications.

[7]  Wu-chi Feng,et al.  RHA: A robust hybrid architecture for information processing in wireless sensor networks , 2010, 2010 Sixth International Conference on Intelligent Sensors, Sensor Networks and Information Processing.

[8]  Li Wei,et al.  Compression-based data mining of sequential data , 2007, Data Mining and Knowledge Discovery.

[9]  Christian Breiteneder,et al.  Discrimination and retrieval of animal sounds , 2006, 2006 12th International Multi-Media Modelling Conference.

[10]  Bin Ma,et al.  The similarity metric , 2001, IEEE Transactions on Information Theory.

[11]  Eamonn J. Keogh,et al.  Scaling and time warping in time series querying , 2005, The VLDB Journal.

[12]  Klaus Riede,et al.  High background noise shapes selective auditory filters in a tropical cricket , 2011, Journal of Experimental Biology.

[13]  H. L. Le Roy,et al.  Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV , 1969 .

[14]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[15]  Amy Roda,et al.  Perspective and Promise: a Century of Insect Acoustic Detection and Monitoring , 2011 .

[16]  R. Sequeira,et al.  Potential Effect of Anoplophora glabripennis (Coleoptera: Cerambycidae) on Urban Trees in the United States , 2001, Journal of economic entomology.

[17]  F. Schwenker,et al.  Automated annotation of Orthoptera songs: first results from analysing the DORSA sound repository , 2006 .

[18]  Jean-Jacques E. Slotine,et al.  Audio classification from time-frequency texture , 2008, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19]  Andreas M. Ali,et al.  Acoustic monitoring in terrestrial environments using microphone arrays: applications, technological considerations and prospectus , 2011 .

[20]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[21]  H. Gerhardt,et al.  Temperature Effects on the Temporal Properties of Calling Songs in the Crickets Gryllus fultoni and G. vernalis: Implications for Reproductive Isolation in Sympatric Populations , 2007, Journal of Insect Behavior.

[22]  Garet P. Lahvis,et al.  Affiliative Behavior, Ultrasonic Communication and Social Reward Are Influenced by Genetic Variation in Adolescent Mice , 2007, PloS one.

[23]  N. Han,et al.  Acoustic classification of Australian anurans based on hybrid spectral-entropy approach , 2011 .

[24]  Günther Palm,et al.  Classification of Time Series Utilizing Temporal and Decision Fusion , 2001, Multiple Classifier Systems.

[25]  Francesco Bianconi,et al.  Evaluation of the effects of Gabor filter parameters on texture classification , 2007, Pattern Recognit..

[26]  J. Sloper Chicken and egg , 2006, British Journal of Ophthalmology.

[27]  Eamonn J. Keogh,et al.  A compression‐based distance measure for texture , 2010, Stat. Anal. Data Min..

[28]  Rolf Bardeli,et al.  Similarity Search in Animal Sound Databases , 2009, IEEE Transactions on Multimedia.

[29]  D K Mellinger,et al.  Recognizing transient low-frequency whale sounds by spectrogram correlation. , 2000, The Journal of the Acoustical Society of America.

[30]  L. Desutter‐Grandcolas First Analysis of a Disturbance Stridulation in Crickets, Brachytrupes tropicus (Orthoptera: Grylloidea: Gryllidae) , 2004, Journal of Insect Behavior.

[31]  Paris Smaragdis,et al.  Hidden Markov and Gaussian mixture models for automatic call classification. , 2009, The Journal of the Acoustical Society of America.