论文信息 - Robust speech recognition using missing feature theory in the cepstral or LDA domain

Robust speech recognition using missing feature theory in the cepstral or LDA domain

When applying Missing Feature Theory to noise robus t speech recognition, spectral features are labeled a s either reliable or unreliable in the time-frequency plane. The acoustic model evaluation of the unreliable feature s is modified to express that their clean values are unk nown or confined within bounds. Classically, MFT requires a n assumption of statistical independence in the spect ral domain, which deteriorates the accuracy on clean speech. In t is paper, MFT is expressed in any domain that is a linear tra nsform of (log-)spectra, for example for cepstra and their ti mederivatives. The acoustic model evaluation is recas t as a nonnegative least squares problem. Approximate solutio ns are proposed and the success of the method is shown thr oug experiments on the AURORA-2 database.

Hugo Van hamme

[1] Mikael Adlers,et al. Topics in Sparse Least Squares Problems , 2000 .

[2] Phil D. Green,et al. Robust automatic speech recognition with missing and unreliable acoustic data , 2001, Speech Commun..

[3] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[4] Juha Häkkinen,et al. On the Use of Missing Feature Theory with Cepstral Features , 2022 .