Estimation of the Perceived Time of Presence of Sources in Urban Acoustic Environments Using Deep Learning Techniques

The impact of urban sound on human beings has often been studied from a negative point of view (noise pollution). In the two last decades, the interest of studying its positive impact has been revealed with the soundscape approach (resourcing spaces). The literature shows that the recognition of sources plays a great role in the way humans are affected by sound environments. There is thus a need for characterizing urban acoustic environments not only with sound pressure measurements but also with source-specific attributes such as their perceived time of presence, dominance or volume. This paper demonstrates, on a controlled dataset, that machine learning techniques based on state of the art neural architectures can predict the perceived time of presence of several sound sources at a sufficient accuracy. To validate this assertion, a corpus of simulated sound scenes is first designed. Perceptual attributes corresponding to those stimuli are gathered through a listening experiment. From the contributions of the individual sound sources available for the simulated corpus, a physical indicator approximating the perceived time of presence of sources is computed and used to train and evaluate a multi-label source detection model. This model predicts the presence of simultaneously active sources from fast third octave spectra, allowing the estimation of perceptual attributes such as pleasantness in urban sound environments at a sufficient degree of precision.

[1]  Pierre Aumond,et al.  Modeling Soundscape Pleasantness Using perceptual Assessments and Acoustic Measurements Along Paths in Urban Context , 2017 .

[2]  Justin Salamon,et al.  Scaper: A library for soundscape synthesis and augmentation , 2017, 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[3]  Nicolai Petkov,et al.  Reliable detection of audio events in highly noisy environments , 2015, Pattern Recognit. Lett..

[4]  Catherine Lavandier,et al.  Sound quality indicators for urban places in Paris cross-validated by Milan data. , 2015, The Journal of the Acoustical Society of America.

[5]  Bert De Coensel,et al.  Machine Listening for Park Soundscape Quality Assessment , 2018 .

[6]  Jin Yong Jeon,et al.  Relationship between spatiotemporal variability of soundscape and urban morphology in a multifunctional urban area: A case study in Seoul, Korea , 2017 .

[7]  B. Berglund,et al.  Soundscape quality in suburban green areas and city parks. , 2006 .

[8]  R. K. Reddy,et al.  Categorization of environmental sounds , 2009, Biological Cybernetics.

[9]  Jean-Rémy Gloaguen,et al.  Creation of a corpus of realistic urban sound scenes with controlled acoustic properties , 2017 .

[10]  G. Brambilla,et al.  Responses to noise in urban parks and in rural quiet areas , 2006 .

[11]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[12]  Félix Gontier,et al.  Towards perceptual soundscape characterization using event detection algorithms. , 2018 .

[13]  Jian Kang,et al.  Soundscape descriptors and a conceptual framework for developing predictive soundscape models , 2016 .

[14]  Arnaud Can,et al.  Capturing urban traffic noise dynamics through relevant descriptors , 2008 .

[15]  Dick Botteldooren,et al.  Classification of soundscapes of urban public open spaces , 2019, Landscape and Urban Planning.

[16]  A. F. Ramos-Ridao,et al.  Application of a methodology for categorizing and differentiating urban soundscapes using acoustical descriptors and semantic-differential attributes. , 2013, The Journal of the Acoustical Society of America.

[17]  B. Berglund,et al.  A principal components model of soundscape perception. , 2010, The Journal of the Acoustical Society of America.

[18]  Jian Kang,et al.  Towards standardization in soundscape preference assessment , 2011 .

[19]  Henrik Møller Fundamentals of binaural technology , 1991 .

[20]  Ute Jekosch Basic concepts and terms of quality, reconsidered in the context of product-sound quality , 2004 .

[21]  S. Squartini,et al.  DCASE 2016 Acoustic Scene Classification Using Convolutional Neural Networks , 2016, DCASE.

[22]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23]  Justin Salamon,et al.  A Dataset and Taxonomy for Urban Sound Research , 2014, ACM Multimedia.

[24]  Sanjeev Khudanpur,et al.  Librispeech: An ASR corpus based on public domain audio books , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[25]  Paul Jennings,et al.  The development and application of the emotional dimensions of a soundscape , 2013 .

[26]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[27]  C. Lavandier,et al.  An Efficient Audio Coding Scheme for Quantitative and Qualitative Large Scale Acoustic Monitoring Using the Sensor Grid Approach , 2017, Sensors.

[28]  Catherine Lavandier,et al.  Measurements of acoustic environments for urban soundscapes: choice of homogeneous periods, optimization of durations, and selection of indicators. , 2013, The Journal of the Acoustical Society of America.

[29]  Justin Salamon,et al.  Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification , 2016, IEEE Signal Processing Letters.

[30]  B. Berglund,et al.  Experimental investigation of noise annoyance caused by high-speed trains. , 2007 .

[31]  Emmanuel Vincent,et al.  Sound Event Detection in the DCASE 2017 Challenge , 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[32]  Brian Gygi,et al.  Similarity and categorization of environmental sounds , 2007, Perception & psychophysics.

[33]  Michael Vorländer,et al.  The ITA-Toolbox: An Open Source MATLAB Toolbox for Acoustic Measurements and Signal Processing , 2017 .

[34]  François Pachet,et al.  The bag-of-frames approach to audio pattern recognition: a sufficient model for urban soundscapes but not for polyphonic music. , 2007, The Journal of the Acoustical Society of America.

[35]  Jian Kang,et al.  Modeling subjective evaluation of soundscape quality in urban open spaces: An artificial neural network approach. , 2009, The Journal of the Acoustical Society of America.

[36]  Catherine Lavandier,et al.  Urban soundscape maps modelled with geo-referenced data , 2016 .

[37]  Dan Stowell,et al.  Acoustic Scene Classification: Classifying environments from the sounds they produce , 2014, IEEE Signal Processing Magazine.

[38]  Östen Axelsson,et al.  On urban soundscape mapping : A computer can predict the outcome of soundscape assessments , 2016 .

[40]  Aren Jansen,et al.  Audio Set: An ontology and human-labeled dataset for audio events , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[41]  Heikki Huttunen,et al.  Polyphonic sound event detection using multi label deep neural networks , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[42]  Justin Salamon,et al.  Adaptive Pooling Operators for Weakly Labeled Sound Event Detection , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[43]  J. Pratt Remarks on Zeros and Ties in the Wilcoxon Signed Rank Procedures , 1959 .

[44]  C. Lavandier,et al.  The contribution of sound source characteristics in the assessment of urban soundscapes , 2006 .

[45]  Axel Röbel,et al.  A Morphological Model for Simulating Acoustic Scenes and Its Application to Sound Event Detection , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[46]  D. Botteldooren,et al.  19th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 ACOUSTIC INDICATORS OF SOUNDSCAPE QUALITY AND NOISE ANNOYANCE IN OUTDOOR URBAN AREAS , 2007 .

[47]  Justin Salamon,et al.  The Implementation of Low-cost Urban Acoustic Monitoring Devices , 2016, ArXiv.

[48]  Catherine Lavandier,et al.  Influence of loudness of noise events on perceived sound quality in urban context , 2014 .

[49]  Ankit Shah,et al.  DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System , 2017, DCASE.

[50]  Catherine Lavandier,et al.  A cross-national comparison in assessment of urban park soundscapes in France, Korea, and Sweden through laboratory experiments , 2018 .

[51]  Mario Lasseck Acoustic bird detection with deep convolutional neural networks , 2018, DCASE.

[52]  Seiichiro Namba,et al.  Measurement of habituation to noise using the method of continuous judgment by category , 1988 .

[53]  Tuomas Virtanen,et al.  Sound event detection using spatial features and convolutional recurrent neural network , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).