Traffic density state estimation based on acoustic fusion

In this paper, we propose an acoustic fusion based approach to classify the traffic density states. In particular, we combine the information from mel-frequency cepstral coefficients (MFCC) based classifier, which models the cumulative road side signal and honk event based classifier. Honk based classifier is obtained by modeling the honk statistics for each traffic class, viz., Jam, Medium and Free. We study in detail the discriminative capabilities of honk information based classifier. Decisions from MFCC and honk classifier are then combined in probabilistic framework with an appropriate fusion strategy. We also propose to use prior honk information in-order to further improve the classification results. Classification results show good performance even with 10s of audio data.

[1]  Shivkumar Kalyanaraman,et al.  Vehicular Traffic Density State Estimation Based on Cumulative Road Acoustics , 2012, IEEE Transactions on Intelligent Transportation Systems.

[2]  Belur V. Dasarathy,et al.  Sensor fusion potential exploitation-innovative architectures and illustrative applications , 1997, Proc. IEEE.

[3]  Bhaskaran Raman,et al.  RoadSoundSense: Acoustic sensing based road congestion monitoring in developing regions , 2011, 2011 8th Annual IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks.

[4]  Bhaskaran Raman,et al.  Horn-ok-please , 2010, MobiSys '10.

[5]  Trent W. Lewis,et al.  Sensor Fusion Weighting Measures in Audio-Visual Speech Recognition , 2004, ACSC.

[6]  Cheol Hoon Park,et al.  Adaptive Decision Fusion for Audio-Visual Speech Recognition , 2008 .

[7]  R. D. Bretherton,et al.  Optimizing networks of traffic signals in real time-the SCOOT method , 1991 .

[8]  Fan Ye,et al.  Mobile crowdsensing: current state and future challenges , 2011, IEEE Communications Magazine.

[9]  Tanzeem Choudhury,et al.  Multimodal person recognition using unconstrained audio and video , 1998 .

[10]  Ramachandran Ramjee,et al.  Nericell: rich monitoring of road and traffic conditions using mobile smartphones , 2008, SenSys '08.

[11]  Jian Huang,et al.  A Traffic Congestion Estimation Approach from Video Using Time-Spatial Imagery , 2008, 2008 First International Conference on Intelligent Networks and Intelligent Systems.

[12]  Manohar Das,et al.  An efficient technique for modeling and synthesis of automotive engine sounds , 2001, IEEE Trans. Ind. Electron..