An adaptive KLT approach for speech enhancement

An adaptive Karhunen-Loeve transform (KLT) tracking-based algorithm is proposed for enhancement of speech degraded by colored additive interference. This algorithm decomposes noisy speech into its components along the axes of a KLT-based vector space of clean speech. It is observed that the noise energy is disparately distributed along each eigenvector. These energies are obtained from noise samples gathered from silence intervals between speech samples. To obtain these silence intervals, we proposed an efficient voice activity detector based on outputs of the principle component eigenfilter; the greatest eigenvalue of speech KLT. Enhancement is performed by modifying each KLT component due to its noise and clean speech energies. The objective is to minimize the produced distortion when residual noise power is limited to a specific level. At the end, the inverse KLT is performed and an estimation of the clean signal is synthesized. Our listening tests indicated that 71% of our subjects preferred the enhanced speech by the above method over former methods of enhancement of speech degraded by computer generated white Gaussian noise. Our method was preferred by 80% of our subjects when we processed real samples of noisy speech gathered from various environments.

[1]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[2]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[3]  Eric Moulines,et al.  An algorithm for maximum likelihood estimation of hidden Markov models with unknown state-tying , 1998, IEEE Trans. Speech Audio Process..

[4]  C.E. Mokbel,et al.  Automatic word recognition in cars , 1995, IEEE Trans. Speech Audio Process..

[5]  Bin Yang,et al.  Projection approximation subspace tracking , 1995, IEEE Trans. Signal Process..

[6]  Jae Lim,et al.  Evaluation of a correlation subtraction method for enhancing speech degraded by additive white noise , 1978 .

[7]  Hamid Sheikhzadeh,et al.  HMM-based strategies for enhancement of speech signals embedded in nonstationary noise , 1998, IEEE Trans. Speech Audio Process..

[8]  Tony Gustafsson,et al.  Instrumental variable subspace tracking using projection approximation , 1998, IEEE Trans. Signal Process..

[9]  A.V. Oppenheim,et al.  Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[10]  Victor Solo,et al.  Performance analysis of adaptive eigenanalysis algorithms , 1998, IEEE Trans. Signal Process..

[11]  Ehud Weinstein,et al.  Iterative and sequential Kalman filter-based speech enhancement algorithms , 1998, IEEE Trans. Speech Audio Process..

[12]  Y. Ephraim Statistical model-based speech enhancement systems , 1988 .

[13]  George S. Kang,et al.  Quality improvement of LPC-processed noisy speech by using spectral subtraction , 1989, IEEE Trans. Acoust. Speech Signal Process..

[14]  I. Richer,et al.  Long-range communications at extremely low frequencies , 1974 .

[15]  Søren Holdt Jensen,et al.  Reduction of broad-band noise in speech by truncated QSVD , 1995, IEEE Trans. Speech Audio Process..

[16]  Biing-Hwang Juang,et al.  The past, present, and future of speech processing , 1998, IEEE Signal Process. Mag..