A new survey method using convolutional neural networks for automatic classification of bird calls

Abstract Habitat and reproduction surveys of raptors are common components of environmental impact assessments. Raptors are few in number and widely dispersed; therefore, raptor surveys require greater survey frequency compared to surveys for many other bird species, which, in turn, may result in extra costs as well as negative impact on raptors caused by observers. In this research, we propose a new, efficient, method for surveying raptors which is low-impact on the raptors themselves. The target species of this study was the Northern goshawk (Accipiter gentilis), categorized as “Near Threatened” in the Japanese Red List, its population estimated at 5010–8950 in 2008. We developed a system which can automatically classify five classes of sounds, including goshawk calls, using a convolutional neural network. To establish the system's applicability as a survey method, we then additionally verified three factors; (1) applicability of the method in different locales, (2) optimal distance from the nest to recording device, and (3) the ability to gauge reproductive states. We report an overall accuracy of 97.0% for this system. This system could classify, with high accuracy, goshawk calls “kek-kek-kek” and “whee-oo” across different locales. This system could classify sounds collected by recorders placed far from nests and within the forest, not placed on the nest itself. Some limitations of the system, notably limitation in data for verification, remain to be improved through further studies. This survey method can be used to judge whether an area was inhabited by goshawks or not, and their approximate reproductive state, based on a three-hour recording of environmental sounds, with required fieldwork limited to the placing and collection of recorders, and minimal additional human input, due to the system classifying the sounds automatically.

[1]  Takayuki Okatani Deep Learning(Technical Survey) , 2014 .

[2]  Hanna Pamula,et al.  Towards the Acoustic Monitoring of Birds Migrating at Night , 2019, Biodiversity Information Science and Standards.

[3]  H. Southern,et al.  Handbook of the Birds of Europe, the Middle East and North Africa; the Birds of the Western Palearctic , 1978 .

[4]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[6]  Mario Lasseck,et al.  Audio-based Bird Species Identification with Deep Convolutional Neural Networks , 2018, CLEF.

[7]  J. S. Marks,et al.  Northern Goshawk (Accipiter gentilis) , 2020 .

[8]  Paul Roe,et al.  3D convolution recurrent neural networks for bird sound detection , 2018 .

[9]  Kenneth L. Dudley,et al.  Listening to ecosystems: data-rich acoustic monitoring through landscape-scale sensor networks , 2017, Ecological Research.

[10]  V. Bretagnolle Calls of Wilson's Storm Petrel: Functions, Individual and Sexual Recognitions, and Geographic Variation , 1989 .

[11]  V. Penteriani The annual and diel cycles of Goshawk vocalizations at nest sites , 2001 .

[12]  Aren Jansen,et al.  CNN architectures for large-scale audio classification , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  S. Higashi,et al.  NEST-SITE SELECTION OF NORTHERN GOSHAWKS AND EURASIAN SPARROWHAWKS IN A FRAGMENTED LANDSCAPE IN NORTHERN JAPAN , 2007 .

[14]  Jay H. Schnell Nesting Behavior and Food Habits of Goshawks in the Sierra Nevada of California , 1958 .

[15]  Edward Carney,et al.  Course notes and workbook for introduction to sound : acoustics for the hearing and speech sciences , 1992 .

[16]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[17]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[18]  Sanyam Shukla,et al.  Analysis of k-Fold Cross-Validation over Hold-Out Validation on Colossal Datasets for Quality Classification , 2016, 2016 IEEE 6th International Conference on Advanced Computing (IACC).

[19]  V. Penteriani Dawn and Morning Goshawk Courtship Vocalizations as a Method for Detecting Nest Sites , 1999 .

[20]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[21]  So Kazama,et al.  WATERSHED ENVIRONMENT EVALUATION FROM THE VIEWPOINT OF BIODIVERSITY USING HSI , 2007 .