Robust target feature extraction based on modified cochlear filter analysis model

The performance of underwater target recognition systems depends on the consistency and adaptation of target feature in complex conditions during the training and testing stages. In this paper, we investigated the target feature extraction problems in a complex underwater environment and proposed a novel approach based on modified cochlear filter analysis model of the human auditory system. The frequency responses and distributions of the modified cochlear filter bank are similar to that of the basilar membrane in the cochlea. The fast forward and inverse transforms of modified cochlear filter bank are also presented for discrete-time signals to save the computational load. The experimental results using real ship radiated noise data verified that the modified cochlear filter analysis model is robust in background noise.

[1]  R. S. Holambe,et al.  Comparison of Speech Recognition of Isolated Words Using Linear Predictive Coding (Lpc), Linear Predictive Cepstral Coefficient (Lpcc) & Perceptual Linear Prediction (Plp) and the Effect of Variation of Model Order on Speech Recognition Rate , 2011 .

[2]  Yixin Yang,et al.  Target Feature Extraction for Passive Sonar Based on Two Cepstrums , 2008, 2008 2nd International Conference on Bioinformatics and Biomedical Engineering.

[3]  Hynek Hermansky,et al.  RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..

[4]  Francesc Alías,et al.  Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification , 2012, IEEE Transactions on Multimedia.

[5]  T. Irino,et al.  A compressive gammachirp auditory filter for both physiological and psychophysical data. , 2001, The Journal of the Acoustical Society of America.

[6]  Nikos Fakotakis,et al.  On acoustic surveillance of hazardous situations , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[8]  Roy D. Patterson,et al.  A Dynamic Compressive Gammachirp Auditory Filterbank , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  DeLiang Wang,et al.  An auditory-based feature for robust speech recognition , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  P. Coleman,et al.  Experiments in hearing , 1961 .

[11]  J. Allen,et al.  Cochlear modeling , 1985, IEEE ASSP Magazine.

[12]  Thaweesak Yingthawornsuk,et al.  Speech Recognition using MFCC , 2012 .

[13]  Mark A Gregory,et al.  A novel approach for MFCC feature extraction , 2010, 2010 4th International Conference on Signal Processing and Communication Systems.