IoT based Urban Noise Monitoring in Deep Learning using Historical Reports

In this paper, we propose a new Internet of things (IoT) solution, called the Urban Noise Monitoring (UNM) system, which can classify real-time environmental audio sound using an embedded system such as Raspberry pi 4 and log the data in the Google Cloud. The reported events will be available for future usage, i.e., selection of the safe area for living. The real-time audio classification has been a big challenge for deep learning in environmental sounds due to the high noisy nature of sound. We have implemented a real-time IoT system for urban sound classification and monitored the historical reports generated. We have developed an advanced fusion method using normalization techniques such as peak, RMS, and EBU and an efficient data augmentation method using various factors, including time stretch, pitch shifting, and dynamic range compression. Further, we have integrated the normalization and the augmentation methods into 2D Convolutional Neural Network (CNN) with the TensorFlow framework on Raspberry pi 4 for urban sound classification. Our classification model outperformed the state of the art performance: 95% accuracy with the Urban sound dataset. The outstanding performance confirmed the effectiveness of the proposed method on the IoT system for urban noise monitoring.

[1]  Justin Salamon,et al.  A Dataset and Taxonomy for Urban Sound Research , 2014, ACM Multimedia.

[2]  Nasser Kehtarnavaz,et al.  Real-time hierarchical classification of sound signals for hearing improvement devices , 2018 .

[3]  Yugyung Lee,et al.  Smart 311 Request System with Automatic Noise Detection for Safe Neighborhood , 2018, 2018 IEEE International Smart Cities Conference (ISC2).

[4]  Navarro,et al.  On the application of big data techniques to noise monitoring of smart cities , 2022 .

[5]  Chng Eng Siong,et al.  Overlapping sound event recognition using local spectrogram features and the generalised hough transform , 2013, Pattern Recognit. Lett..

[6]  Yugyung Lee,et al.  Audio IoT Analytics for Home Automation Safety , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[7]  Colin Raffel,et al.  librosa: Audio and Music Signal Analysis in Python , 2015, SciPy.

[8]  Nithya Davis,et al.  Environmental Sound Classification Using Deep Convolutional Neural Networks and Data Augmentation , 2018, 2018 IEEE Recent Advances in Intelligent Computational Systems (RAICS).

[9]  Walid Mahdi,et al.  Improving speech recognition using data augmentation and acoustic model fusion , 2017, KES.

[10]  Justin Joseph,et al.  Low-Cost Computing Using Raspberry Pi 2 Model B , 2018, J. Comput..

[11]  Preeth Raguraman,et al.  LibROSA Based Assessment Tool for Music Information Retrieval Systems , 2019, 2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).

[12]  Laurence Moroney Cloud Storage for Firebase , 2017 .

[13]  Huaimin Wang,et al.  Sample Mixed-Based Data Augmentation for Domestic Audio Tagging , 2018, DCASE.

[14]  L. Rafael Aguiar,et al.  Exploring Data Augmentation to Improve Music Genre Classification with ConvNets , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[15]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[16]  Nicholas D. Lane,et al.  DeepEar: robust smartphone audio sensing in unconstrained acoustic environments using deep learning , 2015, UbiComp.

[17]  Justin Salamon,et al.  Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification , 2016, IEEE Signal Processing Letters.

[18]  Takao Kobayashi,et al.  Harmonics tracking and pitch extraction based on instantaneous frequency , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[19]  Karol J. Piczak Environmental sound classification with convolutional neural networks , 2015, 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP).

[20]  Gerald Penn,et al.  Convolutional Neural Networks for Speech Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[21]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.