Robust speech recognition using missing feature theory in the cepstral or LDA domain
暂无分享,去创建一个
When applying Missing Feature Theory to noise robus t speech recognition, spectral features are labeled a s either reliable or unreliable in the time-frequency plane. The acoustic model evaluation of the unreliable feature s is modified to express that their clean values are unk nown or confined within bounds. Classically, MFT requires a n assumption of statistical independence in the spect ral domain, which deteriorates the accuracy on clean speech. In t is paper, MFT is expressed in any domain that is a linear tra nsform of (log-)spectra, for example for cepstra and their ti mederivatives. The acoustic model evaluation is recas t as a nonnegative least squares problem. Approximate solutio ns are proposed and the success of the method is shown thr oug experiments on the AURORA-2 database.
[1] Mikael Adlers,et al. Topics in Sparse Least Squares Problems , 2000 .
[2] Phil D. Green,et al. Robust automatic speech recognition with missing and unreliable acoustic data , 2001, Speech Commun..
[3] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.
[4] Juha Häkkinen,et al. On the Use of Missing Feature Theory with Cepstral Features , 2022 .