Vehicle acoustic classification in netted sensor systems using Gaussian mixture models

Acoustic vehicle classification is a difficult problem due to the non-stationary nature of the signals, and especially the lack of strong harmonic structure for most civilian vehicles with highly muffled exhausts. Acoustic signatures will also vary largely depending on speed, acceleration, gear position, and even the aspect angle of the sensor. The problem becomes more complicated when the deployed acoustic sensors have less than ideal characteristics, in terms of both the frequency response of the transducers, and hardware capabilities which determine the resolution and dynamic range. In a hierarchical network topology, less capable Tier 1 sensors can be tasked with reasonably sophisticated signal processing and classification algorithms, reducing energy-expensive communications with the upper layers. However, at Tier 2, more sophisticated classification algorithms exceeding the Tier 1 sensor/processor capabilities can be deployed. The focus of this paper is the investigation of a Gaussian mixture model (GMM) based classification approach for these upper nodes. The use of GMMs is motivated by their ability to model arbitrary distributions, which is very relevant in the case of motor vehicles with varying operation modes and engines. Tier 1 sensors acquire the acoustic signal and transmit computed feature vectors up to Tier 2 processors for maximum-likelihood classification using GMMs. In a binary classification task of light-vs-heavy vehicles, the GMM based approach achieves 7% equal error rate, providing an approximate error reduction of 49% over Tier 1 only approaches.

[1]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.

[2]  Christopher W. Menge,et al.  FHWA Traffic Noise Model, Version 1.0 Technical Manual , 1998 .

[3]  Markus Bodden,et al.  ANALYSIS OF THE TIME STRUCTURE OF GEAR RATTLE , 2022 .

[4]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[5]  Garry M. Jacyna,et al.  Simulation of vehicle acoustics in support of netted sensor research and development , 2005, SPIE Defense + Commercial Sensing.

[6]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8]  R. Williams,et al.  Source Decomposition for Vehicle Sound Simulation , 2001 .

[9]  Garry M. Jacyna,et al.  Netted sensors-based vehicle acoustic classification at Tier 1 nodes , 2005, SPIE Defense + Commercial Sensing.

[10]  Christopher W. Menge,et al.  FHWA traffic noise model, version 1.0 : user's guide , 1998 .

[11]  U Sandberg,et al.  NOISE EMISSION, FRICTION AND ROLLING RESISTANCE OF CAR TIRES. SUMMARY OF AN EXPERIMENTAL STUDY. PAPER PUBLISHED IN THE PROCEEDINGS OF THE 2000 NATIONAL CONFERENCE ON NOISE CONTROL ENGINEERING (NOISE-CON 2000), 2000 DEC. 3-5, NEWPORT BEACH, CALIFORNIA, USA , 2000 .

[12]  Roger O. Williams,et al.  Sound Decomposition - A Key to Improved Sound Simulation , 2003 .