Robust Speech Recognition with MSC / DRA Feature Extraction on Modulation Spectrum Domain

This report introduces noise robust speech recognition and proposes advanced speech analysis techniques named MSC (Modulation Spectrum Control)/DRA (Dynamic Range Adjustment). The dynamic range of cepstrum obtained from noisy speech is usually smaller than that from the same speech without noise since some speech features are hidden in noise. This difference may cause recognition errors. Therefore the adjustment of dynamic range can realize the accurate extraction of speech features. The proposed techniques DRA and MSC focus on the speech feature adjustment. DRA normalizes dynamic ranges and MSC eliminates the noise corruption of speech feature parameters. The experiments on isolated word recognition were carried out using 40 male and female speakers for training and 5 male and female speakers for recognition. The result of recognition rate improving from 17% to 64% versus running car noise at -10dB SNR is shown as an example.