Unsupervised blue whale call detection using multiple time-frequency features

In the context of bio-acoustic sciences, call detection is a critical task for understanding the behaviour of marine mammals such as the blue whale species (Balaeonoptera musculus) considered in this work. In this paper we present an approach to blue whale call detection from an unsupervised perspective. To achieve this, we use temporal and spectral features of audio acquired with a marine autonomous recording unit. The features considered are 46-dimensional and include the mel frequency ceptrum coefficients, chromagrams, and other scalar quantities; these features were then grouped via two different clustering algorithms. Our findings confirm the suitability of the proposed approach for isolating blue whale calls from other environmental sounds (as validated by a bio-acoustic specialist). This is a clear contribution for the annotation of blue whales calls, where the search for calls can now be performed by analysing the clusters identified instead of the entire recordings, thus saving time and effort for practitioners in bio-acoustics.

[1]  Honglak Lee,et al.  Unsupervised feature learning for audio classification using convolutional deep belief networks , 2009, NIPS.

[2]  Nikos Fakotakis,et al.  An Overview of Automatic Audio Segmentation , 2014 .

[3]  N. Wiener Generalized harmonic analysis , 1930 .

[4]  Meinard Mller,et al.  Fundamentals of Music Processing: Audio, Analysis, Algorithms, Applications , 2015 .

[5]  Whitlow W. L. Au,et al.  Principles of marine bioacoustics , 2008 .

[6]  M. Tasker Marine Mammals and Noise, by W.J. Richardson, C.R. Greene, C.I. Malme and D.H. Thomson. Academic Press, London and San Diego, 1998, 576 pp. Price: £29.95. ISBN 0 12588 441 9. , 2000 .

[7]  Haru Matsumoto,et al.  An Overview of Fixed Passive Acoustic Observation Methods for Cetaceans , 2007 .

[8]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[9]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[10]  P. O. Thompson,et al.  Underwater Sounds from the Blue Whale, Balaenoptera musculus , 1971 .

[11]  E. B. Newman,et al.  A Scale for the Measurement of the Psychological Magnitude Pitch , 1937 .

[12]  Jay Barlow,et al.  ESTIMATES OF SPERM WHALE ABUNDANCE IN THE NORTHEASTERN TEMPERATE PACIFIC FROM A COMBINED ACOUSTIC AND VISUAL SURVEY , 2005 .

[13]  Stephen Brooks,et al.  A two phase method for general audio segmentation , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[14]  L. May-Collado,et al.  A characterization of Guyana dolphin (Sotalia guianensis) whistles from Costa Rica: the importance of broadband recording systems. , 2009, The Journal of the Acoustical Society of America.

[15]  Richard E. Turner,et al.  Learning Stationary Time Series using Gaussian Processes with Nonparametric Kernels , 2015, NIPS.

[16]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[17]  T. Akamatsu,et al.  Seasonal changes in the local distribution of Yangtze finless porpoises related to fish presence , 2012 .

[18]  C. Clark,et al.  Acoustic masking in marine ecosystems: intuitions, analysis, and implication , 2009 .

[19]  Felipe A. Tobar,et al.  Spectral Mixture Kernels for Multi-Output Gaussian Processes , 2017, NIPS.

[20]  Luke Rendell,et al.  A new song recorded from blue whales in the Corcovado Gulf, Southern Chile, and an acoustic link to the Eastern Tropical Pacific , 2014 .

[21]  Peter L. Tyack,et al.  A digital acoustic recording tag for measuring the response of wild marine mammals to sound , 2003 .

[22]  Thang D. Bui,et al.  Design of Covariance Functions using Inter-Domain Inducing Variables , 2015 .

[23]  A. Khintchine Korrelationstheorie der stationären stochastischen Prozesse , 1934 .

[24]  Whitlow W L Au,et al.  Common humpback whale (Megaptera novaeangliae) sound types for passive acoustic monitoring. , 2011, The Journal of the Acoustical Society of America.

[25]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[26]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.