论文信息 - Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech

Exploiting Linguistic Knowledge in Language Modeling of Czech Spontaneous Speech

In our paper, we present a method for incorporating available linguistic information into a statistical language model that is used in ASR system for transcribing spontaneous speech. We employ the class-based language model paradigm and use the morphological tags as the basis for world-to-class mapping. Since the number of different tags is at least by one order of magnitude lower than the number of words even in the tasks with moderately-sized vocabularies, the tag-based model can be rather robustly estimated using even the relatively small text corpora. Unfortunately, this robustness goes hand in hand with restricted predictive ability of the class-based model. Hence we apply the two-pass recognition strategy, where the first pass is performed with the standard word-based n-gram and the resulting lattices are rescored in the second pass using the aforementioned class-based model. Using this decoding scenario, we have managed to moderately improve the word error rate in the performed ASR experiments.

Josef Psutka | Pavel Ircing | Jan Hoidekr

[1] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[2] Jan Hajic. Disambiguation of Rich Inflection - Computational Morphology of Czech , 2004 .

[3] Roger K. Moore. Computer Speech and Language , 1986 .

[4] William J. Byrne,et al. Issues in Annotation of the Czech Spontaneous Speech Corpus in the MALACH project , 2004, LREC.

[5] Bhuvana Ramabhadran,et al. Automatic recognition of spontaneous speech for access to multilingual oral history archives , 2004, IEEE Transactions on Speech and Audio Processing.

[6] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[7] Josef Psutka,et al. Fitting class-based language models into weighted finite-state transducer framework , 2003, INTERSPEECH.

[8] Steve Young,et al. The HTK book , 1995 .

[9] Fernando Pereira,et al. Weighted finite-state transducers in speech recognition , 2002, Comput. Speech Lang..

[10] William J. Byrne,et al. Large vocabulary ASR for spontaneous czech in the MALACH project , 2003, INTERSPEECH.