论文信息 - Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases

Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases

We investigate a highly effective and extremely simple noiserobust front end based on novel post-processing of standard MFCC features on the Aurora databases. It performs remarkably well on both the Aurora 2.0 and Aurora 3.0 databases without requiring any increase in model complexity. Our experiments on Aurora 2.0 have been reported in [1]. In this paper, we evaluate this technique on the Aurora 3.0 corpus, and present updated results on Aurora 2.0. Results in the past have shown that endpointing (i.e., presegmentation) on Aurora 3.0 can yield significant improvements. Our experiments reported herein show that our approach integrates well with this endpointing, namely we obtain additional significant improvements. Overall, on Aurora 3.0 we obtain a 47.17% improvement over the segmented baseline. Also, our most recent Aurora 2.0 results show an overall improvement of 41.09% over the baseline for the matched training conditions, and 65.07% for the mis-matched conditions.

Karim Filali | Jeff A. Bilmes | Chia-Ping Chen

[1] Jeff A. Bilmes,et al. Low-resource noise-robust feature post-processing on Aurora 2.0 , 2002, INTERSPEECH.

[2] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[3] Lou Boves,et al. Feature vector selection to improve ASR robustness in noisy conditions , 2001, INTERSPEECH.

[4] Jon Barker,et al. Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise , 2001, INTERSPEECH.

[5] Geoffrey Zweig,et al. The graphical models toolkit: An open source software system for speech and time-series processing , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6] Daniel P. W. Ellis,et al. Investigations into tandem acoustic modeling for the Aurora task , 2001, INTERSPEECH.