Deep Machine Learning Techniques for the Detection and Classification of Sperm Whale Bioacoustics

We implemented Machine Learning (ML) techniques to advance the study of sperm whale (Physeter macrocephalus) bioacoustics. This entailed employing Convolutional Neural Networks (CNNs) to construct an echolocation click detector designed to classify spectrograms generated from sperm whale acoustic data according to the presence or absence of a click. The click detector achieved 99.5% accuracy in classifying 650 spectrograms. The successful application of CNNs to clicks reveals the potential of future studies to train CNN-based architectures to extract finer-scale details from cetacean spectrograms. Long short-term memory and gated recurrent unit recurrent neural networks were trained to perform classification tasks, including (1) “coda type classification” where we obtained 97.5% accuracy in categorizing 23 coda types from a Dominica dataset containing 8,719 codas and 93.6% accuracy in categorizing 43 coda types from an Eastern Tropical Pacific (ETP) dataset with 16,995 codas; (2) “vocal clan classification” where we obtained 95.3% accuracy for two clan classes from Dominica and 93.1% for four ETP clan types; and (3) “individual whale identification” where we obtained 99.4% accuracy using two Dominica sperm whales. These results demonstrate the feasibility of applying ML to sperm whale bioacoustics and establish the validity of constructing neural networks to learn meaningful representations of whale vocalizations.

[1]  V B Deecke,et al.  Quantifying complex patterns of bioacoustic variation: use of a neural network to compare killer whale (Orcinus orca) dialects. , 1999, The Journal of the Acoustical Society of America.

[2]  H. Whitehead,et al.  Individualized social preferences and long-term social fidelity between social units of sperm whales , 2015, Animal Behaviour.

[3]  Christian Rutz,et al.  Animal cultures matter for conservation , 2019, Science.

[4]  H. Whitehead,et al.  Kinship influences sperm whale social organization within, but generally not among, social units , 2018, Royal Society Open Science.

[5]  W E Schevill,et al.  Underwater Listening to the White Porpoise (Delphinapterus leucas). , 1949, Science.

[6]  S. Mesnick Genetic relatedness in sperm whales: Evidence and cultural implications , 2001, Behavioral and Brain Sciences.

[7]  J. Mann,et al.  Social evolution in toothed whales. , 1998, Trends in ecology & evolution.

[8]  S Dawson,et al.  Vocal behavior of male sperm whales: why do they click? , 2001, The Journal of the Acoustical Society of America.

[9]  Peter Stone,et al.  Reinforcement learning , 2019, Scholarpedia.

[10]  H. Whitehead,et al.  Cultural turnover among Galápagos sperm whales , 2016, Royal Society Open Science.

[11]  Ammie K. Kalan,et al.  Human impact erodes chimpanzee behavioral diversity , 2019, Science.

[12]  H. Whitehead,et al.  Indications of fitness differences among vocal clans of sperm whales , 2007, Behavioral Ecology and Sociobiology.

[13]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[14]  J. Hildebrand,et al.  Comparison of machine learning techniques for the classification of echolocation clicks from three species of odontocetes , 2008 .

[15]  Dong Yu,et al.  Exploring convolutional neural network structures and optimization techniques for speech recognition , 2013, INTERSPEECH.

[16]  Cetacean Behavior: Mechanisms and Functions, Louis M. Herman (Ed.). John Wiley, New York (1980), xiii , 1981 .

[17]  H. Whitehead,et al.  Multilevel Societies of Female Sperm Whales (Physeter macrocephalus) in the Atlantic and Pacific: Why Are They So Different? , 2012, International Journal of Primatology.

[18]  Andrea Vedaldi,et al.  Cross Pixel Optical Flow Similarity for Self-Supervised Learning , 2018, ACCV.

[19]  Sperm whale feeding variation by location, year, social group and clan: evidence from stable isotopes , 2007 .

[20]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[21]  L. Miller,et al.  Sperm whale clicks: directionality and source level revisited. , 2000, The Journal of the Acoustical Society of America.

[22]  Vahid Mirjalili,et al.  Python machine learning : machine learning and deep learning with Python, scikit-learn, and TensorFlow , 2017 .

[23]  H. Whitehead,et al.  Sperm whale social units : variation and change , 1998 .

[24]  Mark P. Johnson,et al.  Sperm whale behaviour indicates the use of echolocation click buzzes ‘creaks’ in prey capture , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[25]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[26]  H. Whitehead,et al.  Vocal clans in sperm whales (Physeter macrocephalus) , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[27]  Wilfried A M Beslin,et al.  Automatic acoustic estimation of sperm whale size distributions achieved through machine recognition of on-axis clicks. , 2018, The Journal of the Acoustical Society of America.

[28]  Hal Whitehead,et al.  Distinctive vocalizations from mature male sperm whales (Physeter macrocephalus) , 1988 .

[29]  H. Whitehead,et al.  Group-specific dialects and geographical variation in coda repertoire in South Pacific sperm whales , 1997, Behavioral Ecology and Sociobiology.

[30]  H. Whitehead,et al.  Population Structure of Female and Immature Sperm Whales (Physeter macrocephalus) off the Galápagos Islands , 1992 .

[31]  H L Roitblat,et al.  The neural network classification of false killer whale (Pseudorca crassidens) vocalizations. , 1998, The Journal of the Acoustical Society of America.

[32]  Pierre Vandergheynst,et al.  Geometric Deep Learning: Going beyond Euclidean data , 2016, IEEE Signal Process. Mag..

[33]  P. Madsen,et al.  Sperm whale codas may encode individuality as well as clan identity. , 2016, The Journal of the Acoustical Society of America.

[34]  L. V. Worthington,et al.  Underwater Sounds heard from Sperm Whales , 1957, Nature.

[35]  Michel André,et al.  Neural network-based sperm whale click classification , 2007 .

[36]  Lukás Burget,et al.  Strategies for training large scale neural network language models , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[37]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[38]  L. Marino Cetacean brain evolution: Multiplication generates complexity. , 2004 .

[40]  L. Lefebvre,et al.  Cetaceans Have Complex Brains for Complex Cognition , 2007, PLoS biology.

[41]  G. Roth,et al.  Evolution of the brain and intelligence , 2005, Trends in Cognitive Sciences.

[42]  Paris Smaragdis,et al.  Automatic identification of individual killer whales. , 2010, The Journal of the Acoustical Society of America.

[43]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[44]  Differences in sperm whale codas between two waters off Japan: possible geographic separation of vocal clans , 2014 .

[45]  Nobuyuki Miyazaki,et al.  Diel diving behavior of sperm whales off Japan , 2007 .

[46]  P. Madsen,et al.  Socially segregated, sympatric sperm whale clans in the Atlantic Ocean , 2016, Royal Society Open Science.

[47]  Peter L. Tyack,et al.  Cetacean societies : field studies of dolphins and whales , 2001 .

[48]  Juan Carlos Fernández,et al.  Multiobjective evolutionary algorithms to identify highly autocorrelated areas: the case of spatial distribution in financially compromised farms , 2014, Ann. Oper. Res..

[49]  Tyler M. Schulz,et al.  Individually distinctive acoustic features in sperm whale codas , 2011, Animal Behaviour.

[50]  H. Whitehead,et al.  Movements, habitat use and feeding success of cultural clans of South Pacific sperm whales , 2004 .

[51]  Navdeep Jaitly,et al.  Towards End-To-End Speech Recognition with Recurrent Neural Networks , 2014, ICML.

[52]  Louis Ranjard,et al.  A hidden Markov model approach to indicate Bryde’s whale acoustics , 2018 .

[53]  Ming Yang,et al.  DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[54]  H. Whitehead,et al.  How does social behavior differ among sperm whale clans , 2015 .

[55]  William A. Watkins,et al.  Sperm whale codas , 1977 .

[56]  P. Tyack,et al.  Behavior and social structure of the sperm whales of Dominica, West Indies , 2014 .

[57]  H. Whitehead,et al.  Individual, unit and vocal clan level identity cues in sperm whale codas , 2016, Royal Society Open Science.

[58]  Zenghui Wang,et al.  Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review , 2017, Neural Computation.