Context-dependent search in a context-independent network

This paper introduces a search algorithm for continuous speech recognition, working on a network that integrates both lexical and linguistic constraints. It differs from traditional Viterbi beam-search in that it does not assume that the network includes any information regarding context dependency of the acoustic models. Phonetic context dependency is instead taken into account by the search procedure itself, in a way that uniformly deals with within-word and cross-word contexts. In the paper the algorithm is described in detail, and results are given on two representative tasks: American English dictation and Italian broadcast news.

[1]  Fabio Brugnara Model agglomeration for context-dependent acoustic modeling , 2001, INTERSPEECH.

[2]  Xavier L. Aubert,et al.  An overview of decoding techniques for large vocabulary continuous speech recognition , 2002, Comput. Speech Lang..

[3]  Andrej Ljolje,et al.  Full expansion of context-dependent networks in large vocabulary speech recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).