Multispecies bioacoustic classification using transfer learning of deep convolutional neural networks with pseudo-labeling

Abstract In this study, we evaluated deep convolutional neural networks for classifying the calls of 24 birds and amphibian species detected in ambient field recordings from the tropical mountains of Puerto Rico. Training data were collected using a template-based detection algorithm followed by a manual validation process. As preparing sufficient training data is a major challenge for many deep learning applications, we propose a novel approach that combines transfer learning of a pre-trained deep convolutional neural network (CNN) model and a semi-supervised pseudo-labeling method with a custom loss function to meet this challenge. Our proposed methodology enables the network to be trained in a supervised fashion with labeled and unlabeled data simultaneously, which effectively increases the size of training set and thus boosts the model performance. In classifying a test set of manually validated positive and negative template-based detections, our proposed model achieves 97.7% sensitivity (true positive rate), 96.4% specificity (true negative rate) and 99.5% Area Under a Curve (AUC). This multi-label multi-species classification methodology and its framework can be easily adopted by other acoustic classification problems.

[1]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[2]  Willem Bouten,et al.  A flexible GPS tracking system for studying bird behaviour at multiple scales , 2012, Journal of Ornithology.

[3]  Ian Goodfellow,et al.  Generative adversarial networks , 2020, Commun. ACM.

[4]  T. Mitchell Aide,et al.  Real-time bioacoustics monitoring and automated species identification , 2013, PeerJ.

[5]  Charles M. Bishop,et al.  Novel GPS tracking of sea turtles as a tool for conservation management , 2007 .

[6]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7]  M. Linkie,et al.  Estimating overlap of daily activity patterns from camera trap data , 2009 .

[8]  Dan Stowell,et al.  Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning , 2014, PeerJ.

[9]  Eske Willerslev,et al.  Environmental DNA - An emerging tool in conservation for monitoring past and present biodiversity , 2015 .

[10]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[12]  Colin Raffel,et al.  librosa: Audio and Music Signal Analysis in Python , 2015, SciPy.

[13]  Douglas W. Yu,et al.  Environmental DNA for wildlife biology and biodiversity monitoring. , 2014, Trends in ecology & evolution.

[14]  J. Ahumada,et al.  Community structure and diversity of tropical forest mammals: data from a global camera trap network , 2011, Philosophical Transactions of the Royal Society B: Biological Sciences.

[15]  James Christie,et al.  Solo: an open source, customizable and inexpensive audio recorder for bioacoustic research , 2017 .

[16]  Margaret Kosmala,et al.  Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning , 2017, Proceedings of the National Academy of Sciences.

[17]  Klaus Riede,et al.  Automatic bird sound detection in long real-field recordings: Applications and tools , 2014 .

[18]  Ronald M. Summers,et al.  Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning , 2016, IEEE Transactions on Medical Imaging.

[19]  Peter Jancovic,et al.  Automatic Detection and Recognition of Tonal Bird Sounds in Noisy Environments , 2011, EURASIP J. Adv. Signal Process..

[20]  Xiaoli Z. Fern,et al.  Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. , 2012, The Journal of the Acoustical Society of America.

[21]  Hervé Glotin,et al.  Automatic acoustic detection of birds through deep learning: The first Bird Audio Detection challenge , 2018, Methods in Ecology and Evolution.

[22]  Alex Rogers,et al.  AudioMoth: Evaluation of a smart open acoustic device for monitoring biodiversity and the environment , 2018 .