On the Efficient Speech Feature Extraction Based on Independent Component Analysis

A new efficient code for speech signals is proposed. To represent speech signals with minimum redundancy we use independent component analysis to adapt features (basis vectors) that efficiently encode the speech signals. The learned basis vectors are sparsely distributed and localized in both time and frequency. Time-frequency analysis of basis vectors shows the property similar with the critical bandwidth of human auditory system. Our results suggest that the obtained codes of speech signals are sparse and biologically plausible.

[1]  E. Zwicker,et al.  Analytical expressions for critical‐band rate and critical bandwidth as a function of frequency , 1980 .

[2]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[3]  Daryl R. Kipke,et al.  Functional connectivity in auditory cortex using chronic, multichannel unit recordings , 1999, Neurocomputing.

[4]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[5]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[6]  E. Wigner Quantum-Mechanical Distribution Functions Revisited , 1997 .

[7]  Te-Won Lee,et al.  Independent Component Analysis , 1998, Springer US.

[8]  U.K. Laine,et al.  Time-frequency And Multiple-resolution Representations In Auditory Modeling , 1991, Final Program and Paper Summaries 1991 IEEE ASSP Workshop on Applications of Signal Processing to Audio and Acoustics.

[9]  Shun-ichi Amari,et al.  Natural Gradient Works Efficiently in Learning , 1998, Neural Computation.

[10]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[11]  Christian Jutten,et al.  Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture , 1991, Signal Process..

[12]  Bruno A. Olshausen,et al.  PROBABILISTIC FRAMEWORK FOR THE ADAPTATION AND COMPARISON OF IMAGE CODES , 1999 .

[13]  Daniel P. W. Ellis,et al.  Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition , 1999 .

[14]  Ingrid Daubechies,et al.  The wavelet transform, time-frequency localization and signal analysis , 1990, IEEE Trans. Inf. Theory.