Acoustic Traffic Event Detection in Long Tunnels Using Fast Binary Spectral Features

In this paper, we study the traffic event detection from audio signals. Real-life data are collected in a long tunnel, and audio samples are labeled in accordance with traffic events including tire friction sound, vehicle percussion sound and other background sounds. Efficient spectral features are proposed for the fast classification of audio events. In order to model the acoustic characters, deep neural network approach is adopted. Several state-of-the-art algorithms are used for comparison, including LSTM neural network and Gaussian mixture models with Mel frequency cepstral coefficients. A novel convolutional neural network architecture which processes the input audio data in an end-to-end fashion is adopted for our traffic event detection application. Furthermore, we use time delay estimation algorithms to locate the sound location when the incident happens in the long tunnel. By comparison with the state-of-the-art audio detection methods, our proposed efficient spectral features are proved to be more accurate and more efficient in the detection of audio events related to traffic incidents.

[1]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[2]  Hao Jiang,et al.  A Bimodal Biometric Verification System Based on Deep Learning , 2017, ICVIP.

[3]  V. Willhoeft,et al.  Laser scanners for obstacle detection in automotive applications , 2000, Proceedings of the IEEE Intelligent Vehicles Symposium 2000 (Cat. No.00TH8511).

[4]  ANKUSH MITTAL,et al.  Audio–Video based People Counting and Security Framework for Traffic Crossings , 2007, J. VLSI Signal Process..

[5]  Yong Xu,et al.  Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Chung-Ping Young,et al.  Intelligent data fusion system for predicting vehicle collision warning using vision/GPS sensing , 2010, Expert Syst. Appl..

[7]  Denis McKeown,et al.  Vehicle classification by acoustic signature , 1998 .

[8]  Y. C. Guo,et al.  Sound source localization algorithm based on seven-microphone array and PHAT-GCC method , 2015 .

[9]  Yulei Rao,et al.  A deep learning framework for financial time series using stacked autoencoders and long-short term memory , 2017, PloS one.

[10]  Yu Wang,et al.  Automated Accident Detection System , 2001 .

[11]  Xin Sha Fu,et al.  Video-Based Automatic Incident Detection of Highway Network Monitoring System , 2011 .

[12]  B. Feiten,et al.  Automatic indexing of a sound database using self-organizing neural nets , 1994 .

[13]  Bing W Kwan,et al.  Audio detection of moving vehicles , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[14]  Daniele Battaglino,et al.  Acoustic scene classification using convolutional neural networks , 2016 .