Bayesian subspace methods for acoustic signature recognition of vehicles

Vehicles may be recognized from the sound they make when moving, i.e., from their acoustic signature. Characteristic patterns may be extracted from the Fourier description of the signature and used for recognition. This paper compares conventional methods used for speaker recognition, namely, systems based on Mel-frequency cepstral coefficients (MFCC) and either Gaussian mixture models (GMM) or hidden Markov models (HMM), with Bayesian subspace method based on the short term Fourier transform (STFT) of the vehicles' acoustic signature. A probabilistic subspace classifier achieves a 11.7% error for the ACIDS database, outperforming conventional MFCC-GMM- and MFCC-HMM-based systems by 50%.

[1]  P. Khosla,et al.  Vehicle sound signature recognition by frequency vector principal component analysis , 1998, IMTC/98 Conference Proceedings. IEEE Instrumentation and Measurement Technology Conference. Where Instrumentation is Going (Cat. No.98CH36222).

[2]  Somkiat Sampan,et al.  Neural Fuzzy Techniques In Vehicle Acoustic Signal Classification , 1997 .

[3]  Alex Pentland,et al.  Probabilistic visual learning for object detection , 1995, Proceedings of IEEE International Conference on Computer Vision.

[4]  J. Baras Ground Vehicle Acoustic Signal Processing Based on Biological Hearing Models , 1999 .

[5]  Kuansan Wang,et al.  Auditory representations of acoustic signals , 1992, IEEE Trans. Inf. Theory.

[6]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[7]  ChiWei Che,et al.  Speaker recognition using HMM with experiments on the yoho database , 1995, EUROSPEECH.

[8]  Ea-Ee Jan,et al.  Microphone arrays and speaker identification , 1994, IEEE Trans. Speech Audio Process..

[9]  Baback Moghaddam,et al.  Principal Manifolds and Probabilistic Subspaces for Visual Recognition , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Grant R. Gerhart,et al.  Wavelet-based ground vehicle recognition using acoustic signals , 1996, Defense + Commercial Sensing.

[11]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[12]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..