Increased noise immunity in large vocabulary speech recognition with the aid of spectral subtraction
暂无分享,去创建一个
This paper presents several ways of making the signal processing in the IBM speech recognition system more robust with respect to variations in the background noise level. The underlying problem is that the speech recognition system trains on the specific noise circumstances of the training session. A simple solution lays in the controlled addition of noise. The level of noise that has to be added in to effectively mask all background noise is rather high and causes a significant reduction in accuracy. Spectral subtraction does a better job in a limited number of cases, but the thresholding in spectral subtraction often leads to training problems in the hidden Markov model based recognition system. The best results were obtained by reintroducing a semi-natural background by adding noise after applying spectral subtraction.