Improving biodiversity assessment via unsupervised separation of biological sounds from long-duration recordings

Investigating the dynamics of biodiversity via passive acoustic monitoring is a challenging task, owing to the difficulty of identifying different animal vocalizations. Several indices have been proposed to measure acoustic complexity and to predict biodiversity. Although these indices perform well under low-noise conditions, they may be biased when environmental and anthropogenic noises are involved. In this paper, we propose a periodicity coded non-negative matrix factorization (PC-NMF) for separating different sound sources from a spectrogram of long-term recordings. The PC-NMF first decomposes a spectrogram into two matrices: spectral basis matrix and encoding matrix. Next, on the basis of the periodicity of the encoding information, the spectral bases belonging to the same source are grouped together. Finally, distinct sources are reconstructed on the basis of the cluster of the basis matrix and the corresponding encoding information, and the noise components are then removed to facilitate more accurate monitoring of biological sounds. Our results show that the PC-NMF precisely enhances biological choruses, effectively suppressing environmental and anthropogenic noises in marine and terrestrial recordings without a need for training data. The results may improve behaviour assessment of calling animals and facilitate the investigation of the interactions between different sound sources within an ecosystem.

[1]  Israel Cohen,et al.  Speech enhancement using a noncausal a priori SNR estimator , 2004, IEEE Signal Processing Letters.

[2]  Almo Farina,et al.  Ecoacoustics: the Ecological Investigation and Interpretation of Environmental Sound , 2015, Biosemiotics.

[3]  Luis J. Villanueva-Rivera,et al.  Soundscape Ecology: The Science of Sound in the Landscape , 2011 .

[4]  Rino,et al.  BRIDGING BIODIVERSITY DATA GAPS : RECOMMENDATIONS TO MEET USERS ’ DATA NEEDS , 2013 .

[5]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[6]  Pascal Scalart,et al.  Speech enhancement based on a priori signal to noise estimation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[7]  Yu Tsao,et al.  Speech enhancement using segmental nonnegative matrix factorization , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Junfeng Li,et al.  Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication , 2011, Speech Commun..

[9]  Haru Matsumoto,et al.  An Overview of Fixed Passive Acoustic Observation Methods for Cetaceans , 2007 .

[10]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[11]  Jason Wimmer,et al.  Acoustic sensing: Roles and applications in monitoring avian biodiversity , 2015 .

[12]  Eric Plourde,et al.  Auditory-Based Spectral Amplitude Estimators for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Patrik O. Hoyer,et al.  Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[14]  Julie N. Oswald,et al.  A review and inventory of fixed autonomous recorders for passive acoustic monitoring of marine mammals: 2013 state-of-the-industry , 2013, 2013 IEEE/OES Acoustics in Underwater Geosciences Symposium.

[15]  I. Cohen,et al.  Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[16]  Richard S. Sutton,et al.  A Unified View , 1998 .

[17]  Yu Tsao,et al.  Generalized maximum a posteriori spectral amplitude estimation for speech enhancement , 2016, Speech Commun..

[18]  Sandrine Pavoine,et al.  Author's Personal Copy Ecological Indicators Monitoring Animal Diversity Using Acoustic Indices: Implementation in a Temperate Woodland , 2022 .

[19]  A Kumar,et al.  Biodiversity and Climate Change , 2018 .

[20]  John H. L. Hansen,et al.  Speech Enhancement Based on Generalized Minimum Mean Square Error Estimators and Masking Properties of the Auditory System , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Sandrine Pavoine,et al.  Rapid Acoustic Survey for Biodiversity Appraisal , 2008, PloS one.

[22]  Joseph Vignola,et al.  Dynamics of soundscape in a shallow water marine environment: a study of the habitat of the Indo-Pacific humpback dolphin. , 2015, The Journal of the Acoustical Society of America.

[23]  Stuart H. Gage,et al.  Connecting soundscape to landscape: Which acoustic index best describes landscape configuration? , 2015 .

[24]  P. Leadley,et al.  Impacts of climate change on the future of biodiversity. , 2012, Ecology letters.

[25]  Melvin L. Warren,et al.  DYNAMICS IN SPECIES COMPOSITION OF STREAM FISH ASSEMBLAGES: ENVIRONMENTAL VARIABILITY AND NESTED SUBSETS , 2001 .

[26]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[27]  Mikkel N. Schmidt,et al.  Sparse Non-negative Matrix Factor 2-D Deconvolution , 2006 .

[28]  Paris Smaragdis,et al.  Static and Dynamic Source Separation Using Nonnegative Factorizations: A unified view , 2014, IEEE Signal Processing Magazine.

[29]  Bryan C. Pijanowski,et al.  A primer of acoustic analysis for landscape ecologists , 2011, Landscape Ecology.

[30]  R. McAulay,et al.  Speech enhancement using a soft-decision noise suppression filter , 1980 .

[31]  Almo Farina,et al.  A new methodology to infer the singing activity of an avian community: The Acoustic Complexity Index (ACI) , 2011 .

[32]  Yang Lu,et al.  A geometric approach to spectral subtraction , 2008, Speech Commun..

[33]  Morten Mørup,et al.  Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation , 2006, ICA.

[34]  Haesun Park,et al.  Sparse Nonnegative Matrix Factorization for Clustering , 2008 .

[35]  Nick T. Shears,et al.  Ecoacoustic indices as proxies for biodiversity on temperate reefs , 2016 .

[36]  Peter L Tyack,et al.  Measuring acoustic habitats , 2015, Methods in ecology and evolution.