Active contour-based detection of estuarine dolphin whistles in spectrogram images

Abstract An algorithm for detecting tonal vocalizations from estuarine dolphin (Sotalia guianensis) specimens without interference of a human operator was developed. The raw audio data collected from a passive monitoring sensor in the Cananeia underwater soundscape was converted to spectrogram images. Detection is a four-step method: first, ridge maps are obtained from the spectrogram images; second, a probabilistic Hough transform algorithm is applied to detect ridges similar to thick line segments, referred to as line-like, which are adjusted to the geometry of the whistles in the images via an active contour algorithm; third, feature vectors are built from the geometry of each detected whistle, with 9 descriptive features; and fourth, the detections are fed to a random forest classifier to parse out mistakes by the detection process. We developed a system for classifying the characteristic patterns detected as Sotalia guianensis whistles or random empty detections. We obtained accuracy of 0.977 and F1-score of 0.981.

[1]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[2]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[3]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[4]  Hussein Hussein,et al.  Acoustic Event Classification Using Convolutional Neural Networks , 2017, GI-Jahrestagung.

[5]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[6]  Tess Gridley,et al.  Automatic classification of whistles from coastal dolphins of the southern African subregion. , 2017, The Journal of the Acoustical Society of America.

[7]  Arik Kershenbaum,et al.  An image processing based paradigm for the extraction of tonal sounds in cetacean communications. , 2013, The Journal of the Acoustical Society of America.

[8]  Jiri Matas,et al.  Progressive probabilistic Hough transform for line detection , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[9]  L R Padovese,et al.  Underwater soundscape of marine protected areas in the south Brazilian coast. , 2016, Marine pollution bulletin.

[10]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[11]  Alejandro F. Frangi,et al.  Muliscale Vessel Enhancement Filtering , 1998, MICCAI.

[12]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[13]  Len Thomas,et al.  A method for detecting whistles, moans, and other frequency contour sounds. , 2011, The Journal of the Acoustical Society of America.

[14]  C. Bazúa-Durán Differences in the whistle characteristics and repertoire of Bottlenose and Spinner Dolphins. , 2004, Anais da Academia Brasileira de Ciencias.

[15]  S. Elwen,et al.  Identification and Characteristics of Signature Whistles in Wild Bottlenose Dolphins (Tursiops truncatus) from Namibia , 2014, PloS one.

[16]  John A. Hildebrand,et al.  The behavioral context of common dolphin (Delphinus sp.) vocalizations , 2012 .

[17]  Marielle Malfante,et al.  Automatic fish sounds classification. , 2016, The Journal of the Acoustical Society of America.

[18]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[19]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[20]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[21]  Douglas Gillespie,et al.  Detection and classification of right whale calls using an 'edge' detector operating on a smoothed spectrogram , 2004 .

[22]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.