NIPS4Bplus: a richly annotated birdsong audio dataset

Recent advances in birdsong detection and classification have approached a limit due to the lack of fully annotated recordings. In this paper, we present NIPS4Bplus, the first richly annotated birdsong audio dataset, that is comprised of recordings containing bird vocalisations along with their active species tags plus the temporal annotations acquired for them. Statistical information about the recordings, their species specific tags and their temporal annotations are presented along with example uses. NIPS4Bplus could be used in various ecoacoustic tasks, such as training models for bird population monitoring, species classification, birdsong vocalisation detection and classification.

[1]  THE SINGING LIFE OF BIRDS: THE ART AND SCIENCE OF LISTENING TO BIRDSONG , 2006 .

[2]  David A. Luther,et al.  Signaller: receiver coordination and the timing of communication in Amazonian birds , 2008, Biology Letters.

[3]  Effects of Vegetation and Background Noise on the Detection Process in Auditory Avian Point-Count Surveys , 2008 .

[4]  T. Scott Brandes,et al.  Automated sound recording and analysis techniques for bird surveys and conservation , 2008, Bird Conservation International.

[5]  Murray G. Efford,et al.  Bird population density estimated from acoustic signals , 2009 .

[6]  David A Luther,et al.  Production and perception of communicatory signals in a noisy environment , 2009, Biology Letters.

[7]  Xiaoli Z. Fern,et al.  Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. , 2012, The Journal of the Acoustical Society of America.

[8]  P. Tyack,et al.  Estimating animal population density using passive acoustics , 2012, Biological reviews of the Cambridge Philosophical Society.

[9]  Stan G. Sovern,et al.  Barred owls and landscape attributes influence territory occupancy of northern spotted owls , 2014, The Journal of wildlife management.

[10]  Kathryn T. A. Lambert,et al.  A low-cost, yet simple and highly repeatable system for acoustically surveying cryptic species , 2014 .

[11]  Charles E. Taylor,et al.  Bird-DB: A database for annotated bird song sequences , 2015, Ecol. Informatics.

[12]  Germán Castellanos-Domínguez,et al.  Multiple Instance Learning-Based Birdsong Classification Using Unsupervised Recording Segmentation , 2015, IJCAI.

[13]  Matthew D. Frey,et al.  Using digital recordings and sonogram analysis to obtain counts of yellow rails , 2016 .

[14]  Bhiksha Raj,et al.  Audio Event Detection using Weakly Labeled Data , 2016, ACM Multimedia.

[15]  Hervé Glotin,et al.  LifeCLEF Bird Identification Task 2016: The arrival of Deep learning , 2016, CLEF.

[16]  Jan Schlüter,et al.  Learning to Pinpoint Singing Voice from Weakly Labeled Examples , 2016, ISMIR.

[17]  Tuomas Virtanen,et al.  Sound event detection using weakly labeled dataset with stacked convolutional and recurrent neural network , 2017, ArXiv.

[18]  Ilyas Potamitis,et al.  Deep Networks tag the location of bird vocalisations on audio spectrograms , 2017, ArXiv.

[19]  Erin M. Bayne,et al.  Recommendations for acoustic recognizer performance assessment with application to five common automated signal recognition programs , 2017 .

[20]  Justin Salamon,et al.  Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification , 2016, IEEE Signal Processing Letters.

[21]  Thomas Pellegrini,et al.  Densely connected CNNs for bird audio detection , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[22]  Andreas Rauber,et al.  LifeCLEF Bird Identification Task 2017 , 2017, CLEF.

[23]  Tuomas Virtanen,et al.  Stacked convolutional and recurrent neural networks for bird audio detection , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[24]  Dan Stowell,et al.  Data-efficient weakly supervised learning for low-resource audio event detection using deep learning , 2018, DCASE.

[25]  Hervé Glotin,et al.  Unsupervised Bioacoustic Segmentation by Hierarchical Dirichlet Process Hidden Markov Model , 2018, Multimedia Tools and Applications for Environmental & Biodiversity Informatics.

[26]  Dan Stowell,et al.  Deep Learning for Audio Event Detection and Tagging on Low-Resource Datasets , 2018, Applied Sciences.

[27]  Vincent Lostanlen,et al.  Birdvox-Full-Night: A Dataset and Benchmark for Avian Flight Call Detection , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).