We view the speech enhancement task in two aspects: reduction of the perceptual noise level in degraded speech and reconstruction of the degraded information, which may result in improvement of speech intelligibility. We are also very interested in noiseindependent speech enhancement where test noise environments could differ in intensity from those of algorithm development. To this end, we have developed in this paper an algorithm called Noise-Independent Statistical Spectral Mapping (NISSM) to estimate a speech enhancement Wiener filter. NISSM consists of a noise-resistant transformation, which converts noisy speech to a set of noise-resistant features, and a spectral mapping function, which maps the features to autoregressive spectra of clean speech. We will show that the proposed algorithm effectively reduces noise intensity. When the noise intensity of training differs from that of testing, NISSM outperforms significantly a conventional spectral mapping. The algorithm operates frame-by-frame and is designed for real-time application. The noise interference could be stationary or non-stationary white noise with variable intensity.
[1]
Biing-Hwang Juang,et al.
On the application of hidden Markov models for enhancing noisy speech
,
1988,
ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.
[2]
F. Itakura,et al.
Minimum prediction residual principle applied to speech recognition
,
1975
.
[3]
Alan V. Oppenheim,et al.
All-pole modeling of degraded speech
,
1978
.
[4]
Biing-Hwang Juang,et al.
Signal restoration by spectral mapping
,
1987,
ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[5]
Douglas D. O'Shaughnessy,et al.
Speech enhancement based conceptually on auditory evidence
,
1991,
IEEE Trans. Signal Process..
[6]
R. Gray,et al.
Speech coding based upon vector quantization
,
1980,
ICASSP.