Ground Vehicle Acoustic Signal Processing Based on Biological Hearing Models

Abstract : This thesis presents a prototype vehicle acoustic signal classification system with low classification error and short processing delay. To analyze the spectrum of the vehicle acoustic signal, we adopt biologically motivated feature extraction models - cochlear filter and A1-cortical wavelet transform. The multi-resolution representation obtained from these two models is used in the later classification system. Different VQ based clustering algorithms are implemented and tested for real world vehicle acoustic signals. Among them, Learning VQ achieves the optimal Bayes classification performance, but its long search and training time make it not suitable for real time implementation. TSVQ needs a logarithmic search time and its tree structure naturally imitates the aggressive hearing in biological hearing systems, but it has a higher classification error. Finally, a high performance parallel TSVQ (PTSVQ) is introduced, which has classification performance close to the optimal LVQ, while maintains logarithmic search time. Experiments on ACIDS database show that both PTSVQ and LVQ achieve high classification rate. PTSVQ has additional advantages such as easy online training and insensitivity to initial conditions. All these features make PTSVQ the most promising candidate for practical system implementation.

[1]  F. Bilsen Pitch of noise signals: evidence for a "central spectrum". , 1977, The Journal of the Acoustical Society of America.

[2]  Kuansan Wang,et al.  Auditory representations of acoustic signals , 1992, IEEE Trans. Inf. Theory.

[3]  Mieczyslaw M. Kokar,et al.  Model-based fusion for multisensor target recognition , 1996, Defense, Security, and Sensing.

[4]  Grant R. Gerhart,et al.  Wavelet-based ground vehicle recognition using acoustic signals , 1996, Defense + Commercial Sensing.

[5]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[6]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[7]  R. Gray,et al.  Combining Image Compression and Classification Using Vector Quantization , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  John S. Baras,et al.  Convergence of a neural network classifier , 1990, 29th IEEE Conference on Decision and Control.

[9]  John Lazzaro,et al.  Analog VLSI model of binaural hearing , 1991, IEEE Trans. Neural Networks.

[10]  J R Cohen,et al.  Application of an auditory model to speech recognition. , 1989, The Journal of the Acoustical Society of America.

[11]  Martin P. DeSimio,et al.  Phoneme recognition with binaural cochlear models and the stereausis representation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[13]  Gert Cauwenberghs,et al.  An analog VLSI architecture for auditory based feature extraction , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  H S Colburn,et al.  Theory of binaural interaction based on auditory-nerve data. I. General strategy and preliminary results on interaural discrimination. , 1973, The Journal of the Acoustical Society of America.

[15]  Somkiat Sampan,et al.  Neural Fuzzy Techniques In Vehicle Acoustic Signal Classification , 1997 .

[16]  Stephen W. Kercel,et al.  Wavelet-based acoustic recognition of aircraft , 1994, Defense, Security, and Sensing.

[17]  John S. Baras,et al.  Efficient organization of large ship radar databases using wavelets and structured vector quantization , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[18]  John S. Baras,et al.  Combined compression and classification with learning vector quantization , 1999, IEEE Trans. Inf. Theory.

[19]  S. Seneff A joint synchrony/mean-rate model of auditory processing , 1988 .

[20]  Nathaniel I. Durlach,et al.  Chapter 11 – MODELS OF BINAURAL INTERACTION , 1978 .

[21]  Richard F. Lyon A computational model of binaural localization and separation , 1983, ICASSP.

[22]  John S. Baras,et al.  Wavelet-based hierarchical organization of large image databases: ISAR and face recognition , 1998, Defense, Security, and Sensing.

[23]  A. LaVigna Nonparametric Classification Using Learning Vector Quantization , 1990 .

[24]  John S. Baras,et al.  Wavelet-based progressive classification of high-range-resolution radar returns , 1994, Defense, Security, and Sensing.

[25]  Grant R. Gerhart,et al.  Wavelet analysis of ground vehicle acoustic signatures , 1995, Defense, Security, and Sensing.

[26]  S A Shamma,et al.  Stereausis: binaural processing without neural delays. , 1989, The Journal of the Acoustical Society of America.

[27]  Gert Cauwenberghs,et al.  An analog VLSI chip with asynchronous interface for auditory feature extraction , 1998 .

[28]  Emily A. Scott Recognition of aerospace acoustic sources using advanced pattern recognition techniques , 1991 .

[29]  John S. Baras,et al.  Hierarchical Wavelet Representations of Ship Radar Returns , 1993 .

[30]  Kuansan Wang,et al.  Spectral shape analysis in the central auditory system , 1995, IEEE Trans. Speech Audio Process..