Automated extraction of odontocete whistle contours.

Many odontocetes produce frequency modulated tonal calls known as whistles. The ability to automatically determine time × frequency tracks corresponding to these vocalizations has numerous applications including species description, identification, and density estimation. This work develops and compares two algorithms on a common corpus of nearly one hour of data collected in the Southern California Bight and at Palmyra Atoll. The corpus contains over 3000 whistles from bottlenose dolphins, long- and short-beaked common dolphins, spinner dolphins, and melon-headed whales that have been annotated by a human, and released to the Moby Sound archive. Both algorithms use a common signal processing front end to determine time × frequency peaks from a spectrogram. In the first method, a particle filter performs Bayesian filtering, estimating the contour from the noisy spectral peaks. The second method uses an adaptive polynomial prediction to connect peaks into a graph, merging graphs when they cross. Whistle contours are extracted from graphs using information from both sides of crossings. The particle filter was able to retrieve 71.5% (recall) of the human annotated tonals with 60.8% of the detections being valid (precision). The graph algorithm's recall rate was 80.0% with a precision of 76.9%.

[1]  William F. Perrin,et al.  Evidence for two species of common dolphins (genus Delphinus) from the eastern North Pacific , 1994, Contributions in science.

[2]  Nils J. Nilsson,et al.  Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Anna Scaglione,et al.  Product high-order ambiguity function for multicomponent polynomial-phase signal modeling , 1998, IEEE Trans. Signal Process..

[4]  Daniel P. W. Ellis,et al.  Call detection and extraction using Bayesian inference , 2006 .

[5]  N. Gordon,et al.  Novel approach to nonlinear/non-Gaussian Bayesian state estimation , 1993 .

[6]  Chao Wang,et al.  A versatile pitch tracking algorithm: from human speech to killer whale vocalizations. , 2009, The Journal of the Acoustical Society of America.

[7]  Paul Dierckx,et al.  Curve and surface fitting with splines , 1994, Monographs on numerical analysis.

[8]  P L Tyack,et al.  A quantitative measure of similarity for tursiops truncatus signature whistles. , 1993, The Journal of the Acoustical Society of America.

[9]  Cornel Ioana,et al.  Analysis of underwater mammal vocalisations using time–frequency-phase tracker , 2010 .

[10]  Nando de Freitas,et al.  An Introduction to Sequential Monte Carlo Methods , 2001, Sequential Monte Carlo Methods in Practice.

[11]  Christian Musso,et al.  Improving Regularised Particle Filters , 2001, Sequential Monte Carlo Methods in Practice.

[12]  Nando de Freitas,et al.  Sequential Monte Carlo Methods in Practice , 2001, Statistics for Engineering and Information Science.

[13]  Olivier Adam Segmentation of Killer Whale Vocalizations Using the Hilbert-Huang Transform , 2008, EURASIP J. Adv. Signal Process..

[14]  Fred N. Spiess,et al.  Flip‐Floating Instrument Platform , 1963 .

[15]  W. Au,et al.  The broadband social acoustic signaling behavior of spinner and spotted dolphins. , 2003, The Journal of the Acoustical Society of America.

[16]  Judith C Brown,et al.  Automatic classification of killer whale vocalizations using dynamic time warping. , 2007, The Journal of the Acoustical Society of America.

[17]  David K. Mellinger,et al.  Ishmael : 1.0 user's guide ; Ishmael : integrated system for holistic multi-channel acoustic exploration and localization , 2002 .

[18]  A. Thode,et al.  PAMGUARD : SEMIAUTOMATED , OPEN SOURCE SOFTWARE FOR REAL-TIME ACOUSTIC DETECTION AND LOCALISATION OF CETACEANS , 2008 .

[19]  Kristine L. Bell,et al.  A Tutorial on Particle Filters for Online Nonlinear/NonGaussian Bayesian Tracking , 2007 .

[20]  Paul R. White,et al.  Introduction to particle filters for tracking applications in the passive acoustic monitoring of cetaceans , 2008 .

[21]  Thippur V. Sreenivas,et al.  IF estimation using higher order TFRs , 2002, Signal Process..

[22]  William H. Press,et al.  The Art of Scientific Computing Second Edition , 1998 .

[23]  Mandar Chitre,et al.  Spectrogram denoising and automated extraction of the fundamental frequency variation of dolphin whistles. , 2008, The Journal of the Acoustical Society of America.

[24]  Len Thomas,et al.  Estimating cetacean population density using fixed passive acoustic sensors: an example with Blainville's beaked whales. , 2009, The Journal of the Acoustical Society of America.

[25]  Jay Barlow,et al.  ACOUSTIC IDENTIFICATION OF NINE DELPHINID SPECIES IN THE EASTERN TROPICAL PACIFIC OCEAN , 2000 .

[26]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[27]  Julie N Oswald,et al.  A tool for real-time acoustic species identification of delphinid whistles. , 2005, The Journal of the Acoustical Society of America.

[28]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[29]  William A. Watkins,et al.  The harmonic interval : fact or artifact in spectral analysis of pulse trains , 1968 .

[30]  Neil J. Gordon,et al.  Editors: Sequential Monte Carlo Methods in Practice , 2001 .

[31]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[32]  M. Goldstein,et al.  Multivariate Analysis: Methods and Applications , 1984 .

[33]  S. Datta,et al.  Dolphin whistle classification for determining group identities , 2002, Signal Process..

[34]  G. Kitagawa Monte Carlo Filter and Smoother for Non-Gaussian Nonlinear State Space Models , 1996 .

[35]  Yu Shi,et al.  Spectrogram-based formant tracking via particle filters , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..