Feature Learning and Automatic Segmentation for Dolphin Communication Analysis

The study of dolphin cognition involves intensive research of animal vocalizations recorded in the field. We address the automated analysis of audible dolphin communication and propose a system that automatically discovers patterns in dolphin signals. These patterns are invariant to frequency shifts and time warping transformations. The discovery algorithm is based on feature learning and unsupervised time series segmentation using hidden Markov models. Researchers can inspect the patterns visually and interactively run comparative statistics between the distribution of dolphin signals in different behavioral contexts. Our results indicate that our system provides meaningful patterns to the marine biologist and that the comparative statistics are aligned with the biologists domain knowledge.

[1]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[2]  Paul M Baggenstoss,et al.  Comparing shift-autocorrelation with cepstrum for detection of burst pulses in impulsive noise. , 2014, The Journal of the Acoustical Society of America.

[3]  V B Deecke,et al.  Quantifying complex patterns of bioacoustic variation: use of a neural network to compare killer whale (Orcinus orca) dialects. , 1999, The Journal of the Acoustical Society of America.

[4]  Honglak Lee,et al.  An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.

[5]  Daniel P. W. Ellis,et al.  Call detection and extraction using Bayesian inference , 2006 .

[6]  Simon O'Keefe,et al.  An active contour algorithm for spectrogram track detection , 2010, Pattern Recognit. Lett..

[7]  Denise L. Herzing,et al.  Acoustics and Social Behavior of Wild Dolphins: Implications for a Sound Society , 2000 .

[8]  Kuntoro Adi,et al.  Unsupervised validity measures for vocalization clustering , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Nurgun Erdol,et al.  On contour-based classification of dolphin whistles by type , 2014 .

[10]  Arik Kershenbaum,et al.  The Encoding of Individual Identity in Dolphin Signature Whistles: How Much Information Is Needed? , 2013, PloS one.

[11]  Menno van Zaanen ABL: Alignment-Based Learning , 2000, COLING.

[12]  Thomas Lampert,et al.  A survey of spectrogram track detection algorithms , 2010 .

[13]  Thad Starner,et al.  Probabilistic extraction and discovery of fundamental units in dolphin whistles , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  Eamonn J. Keogh,et al.  Mining Massive Archives of Mice Sounds with Symbolized Representations , 2012, SDM.

[16]  Irfan A. Essa,et al.  Discovering Multivariate Motifs using Subsequence Density Estimation and Greedy Mixture Learning , 2007, AAAI.

[17]  Chao Wang,et al.  A versatile pitch tracking algorithm: from human speech to killer whale vocalizations. , 2009, The Journal of the Acoustical Society of America.

[18]  Avery Wang,et al.  An Industrial Strength Audio Search Algorithm , 2003, ISMIR.