Bird detection in audio: A survey and a challenge

Many biological monitoring projects rely on acoustic detection of birds. Despite increasingly large datasets, this detection is often manual or semi-automatic, requiring manual tuning/postprocessing. We review the state of the art in automatic bird sound detection, and identify a widespread need for tuning-free and species-agnostic approaches. We introduce new datasets and an IEEE research challenge to address this need, to make possible the development of fully automatic algorithms for bird sound detection.

[1]  Stephen J. Roberts,et al.  Detecting bird sound in unknown acoustic background using crowdsourced training data , 2015, ArXiv.

[2]  DeLiang Wang,et al.  Boosting Contextual Information for Deep Neural Network Based Voice Activity Detection , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[3]  Abraham L Borker,et al.  Vocal Activity as a Low Cost and Scalable Index of Seabird Colony Size , 2014, Conservation biology : the journal of the Society for Conservation Biology.

[4]  Nadia Pieretti,et al.  Acoustic Indices for Biodiversity Assessment and Landscape Investigation , 2014 .

[5]  Richard L. Hutto,et al.  Humans versus autonomous recording units: a comparison of point-count results , 2009 .

[6]  Paul Roe,et al.  A toolbox for animal call recognition , 2012 .

[7]  Stephen R. Baillie,et al.  Species traits explain variation in detectability of UK birds , 2014 .

[8]  Mathieu Lagrange,et al.  Detection of overlapping acoustic events using a temporally-constrained probabilistic model , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Seppo Ilmari Fagerlund,et al.  Bird Species Recognition Using Support Vector Machines , 2007, EURASIP J. Adv. Signal Process..

[10]  Paul E. Allen,et al.  Random Forest for improved analysis efficiency in passive acoustic monitoring , 2014, Ecol. Informatics.

[11]  Francesco Piazza,et al.  A Deep Neural Network approach for Voice Activity Detection in multi-room domestic scenarios , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[12]  Peter Jan DETECTION OF SINUSOIDAL SIGNALS IN NOISE BY PROBABILISTIC MODELLING OF THE SPECTRAL MAGNITUDE SHAPE AND PHASE CONTINUITY , 2011 .

[13]  Louis Ranjard,et al.  Unsupervised bird song syllable classification using evolving neural networks. , 2008, The Journal of the Acoustical Society of America.

[14]  Todor Ganchev,et al.  Bird acoustic activity detection based on morphological filtering of the spectrogram , 2015 .

[15]  Jordi Bonada,et al.  Bird Song Synthesis Based on Hidden Markov Models , 2016, INTERSPEECH.

[16]  H. C. Card,et al.  Birdsong recognition using backpropagation and multivariate statistics , 1997, IEEE Trans. Signal Process..

[17]  Dan Stowell,et al.  Acoustic event detection for multiple overlapping similar sources , 2015, 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[18]  D. MacKenzie Occupancy Estimation and Modeling: Inferring Patterns and Dynamics of Species Occurrence , 2005 .

[19]  R. Ranft Natural sound archives: past, present and future. , 2004, Anais da Academia Brasileira de Ciencias.

[20]  Dan Stowell,et al.  An Open Dataset for Research on Audio Field Recording Archives: freefield1010 , 2013, Semantic Audio.

[21]  P. Tyack,et al.  Estimating animal population density using passive acoustics , 2012, Biological reviews of the Cambridge Philosophical Society.

[22]  Michael Towsey,et al.  A practical comparison of manual and autonomous methods for acoustic monitoring , 2013 .

[23]  Andreas Stolcke,et al.  Bird species recognition combining acoustic and sequence modeling , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[24]  Michael W. Towsey,et al.  Visualization of Long-duration Acoustic Recordings of the Environment , 2014, ICCS.

[25]  Dan Stowell,et al.  Automatic large-scale classification of bird sounds is strongly improved by unsupervised feature learning , 2014, PeerJ.

[26]  Xiaoli Z. Fern,et al.  A Syllable-Level Probabilistic Framework for Bird Species Identification , 2009, 2009 International Conference on Machine Learning and Applications.

[27]  Xiaoli Z. Fern,et al.  Time-frequency segmentation of bird song in noisy acoustic environments , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[28]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[29]  Juan Manuel Górriz,et al.  Voice Activity Detection. Fundamentals and Speech Recognition System Robustness , 2007 .

[30]  Rachel T. Buxton,et al.  Measuring nocturnal seabird activity and status using acoustic recording devices: applications for island restoration , 2012 .

[31]  Peter Jancovic,et al.  Detection of sinusoidal signals in noise by probabilistic modelling of the spectral magnitude shape and phase continuity , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[32]  Peter Jancovic,et al.  Automatic Detection and Recognition of Tonal Bird Sounds in Noisy Environments , 2011, EURASIP J. Adv. Signal Process..

[33]  D Margoliash,et al.  Template-based automatic recognition of birdsong syllables from continuous recordings. , 1996, The Journal of the Acoustical Society of America.

[34]  Paul Roe,et al.  Timed Probabilistic Automaton: A Bridge between Raven and Song Scope for Automatic Species Recognition , 2013, IAAI.

[35]  B. Furnas,et al.  Using automated recorders and occupancy models to monitor common forest birds across a large geographic region , 2015 .

[36]  VirtanenTuomas,et al.  Detection and Classification of Acoustic Scenes and Events , 2018 .