RETRACTED: Urban Sound Classification Using Convolutional Neural Network Model

The programmed content-based order of urban sound classes is a significant part of different developing methods and applications, for example, observation, urban soundscape comprehension and commotion source distinguishing proof, along these lines the exploration subject has increased a great deal of consideration lately. The objective of this paper is to create a proficient AI based plan for urban sound classification. Ongoing fruitful utilizations of convolutional neural systems (CNNs) to sound order and discourse acknowledgment have spurred the quest for better information portrayals for progressively proficient preparation. Visual presentations of a sound signal, through different time-recurrence portrayals, for example, spectrograms offer a very good representation of the worldly picture of the original signal. Utilizing a spectrogram picture of the sound and afterward changing over the equivalent to information focuses (As is accomplished for pictures). This is effortlessly done utilizing mel_spectogram a function of Librosa. At the approval stage, we lead tests on Urban Sound 8K database which comprises 10 classes of urban sound happenings with 8732 real-world sound clips. As a result, we see how convolutional neural network (CNN) frameworks with raw sound waveforms improve the exactness in urban sound classification and clearly shows the structure concerning the number of parameters.

[1]  Shrikanth Narayanan,et al.  Environmental Sound Recognition With Time–Frequency Audio Features , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Haiyan Shu,et al.  Using deep convolutional neural network to classify urban sounds , 2017, TENCON 2017 - 2017 IEEE Region 10 Conference.

[3]  Benjamin Schrauwen,et al.  End-to-end learning for music audio , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[4]  Hermann Ney,et al.  Convolutional neural networks for acoustic modeling of raw time signal in LVCSR , 2015, INTERSPEECH.

[5]  Jürgen T. Geiger,et al.  Improving event detection for audio surveillance using Gabor filterbank features , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[6]  Onur Dikmen,et al.  Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Hermann Ney,et al.  Acoustic modeling with deep neural networks using raw time signal for LVCSR , 2014, INTERSPEECH.

[8]  Wei Dai,et al.  Very deep convolutional neural networks for raw waveforms , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Hemant A. Patil,et al.  Novel TEO-based Gammatone features for environmental sound classification , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[10]  Feng Liu,et al.  Learning Environmental Sounds with Multi-scale Convolutional Neural Network , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[11]  Soomyung Park,et al.  Convolutional Recurrent Neural Networks for Urban Sound Classification Using Raw Waveforms , 2018, 2018 26th European Signal Processing Conference (EUSIPCO).

[12]  Gaël Richard,et al.  Acoustic scene classification with matrix factorization for unsupervised feature learning , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Justin Salamon,et al.  Feature learning with deep scattering for urban sound analysis , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[14]  Mathieu Lagrange,et al.  Detection of overlapping acoustic events using a temporally-constrained probabilistic model , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Lars Lundberg,et al.  Classifying environmental sounds using image recognition networks , 2017, KES.

[16]  Nicolai Petkov,et al.  Audio Surveillance of Roads: A System for Detecting Anomalous Sounds , 2016, IEEE Transactions on Intelligent Transportation Systems.

[17]  Karol J. Piczak Environmental sound classification with convolutional neural networks , 2015, 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP).

[18]  Justin Salamon,et al.  Unsupervised feature learning for urban sound classification , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[19]  Justin Salamon,et al.  A Dataset and Taxonomy for Urban Sound Research , 2014, ACM Multimedia.

[20]  Justin Salamon,et al.  Deep Convolutional Neural Networks and Data Augmentation for Environmental Sound Classification , 2016, IEEE Signal Processing Letters.

[21]  Ramesh C. Poonia,et al.  A Literature Review on Dedicated Short Range Communication for Intelligent Transport , 2013 .

[22]  Sridhar Krishnan,et al.  Time–Frequency Matrix Feature Extraction and Classification of Environmental Audio Signals , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[23]  C.-C. Jay Kuo,et al.  Environmental sound recognition: A survey , 2013, 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference.