论文信息 - Fast LM look-ahead for large vocabulary continuous speech recognition using perfect hashing

Fast LM look-ahead for large vocabulary continuous speech recognition using perfect hashing

In this paper we present a fast method to implement a language model (LM) look-ahead algorithm in a Viterbi-based, single-lexical-tree speech recognizer. We have used three different mechanisms to speed up the calculation: a cache memory attached to each node or the network, a pre-calculation of the probabilities of the active contexts, and an organization of the LM using perfect hash. These enhancements make it possible to use the full trigram LM to compute the look-ahead with better overall results, both in terms of recognition rate and computation time, than using a unigram or bigram based approximation.

Carmen García-Mateo | Antonio Cardenal López | Javier Dieguez-Tirado

[1] Richard M. Stern,et al. THE 1999 CMU 10X REAL TIME BROADCAST NEWS TRANSCRIPTION SYSTEM , 1999 .

[2] Steve J. Young,et al. A One Pass Decoder Design For Large Vocabulary Recognition , 1994, HLT.

[3] Albino Nogueiras Rodríguez,et al. The demiphone:an efficient subword unit for Continuous Speech Recognition , 1997 .

[4] János Komlós,et al. Storing a sparse table with O(1) worst case access time , 1982, 23rd Annual Symposium on Foundations of Computer Science (sfcs 1982).

[5] George Havas,et al. An Optimal Algorithm for Generating Minimal Perfect Hash Functions , 1992, Inf. Process. Lett..

[6] Hermann Ney,et al. Improved lexical tree search for large vocabulary speech recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[7] Albino Nogueiras,et al. The demiphone: an efficient subword unit for continuous speech recognition , 1997, EUROSPEECH.

[8] Albino Nogueiras,et al. The demiphone: An efficient contextual subword unit for continuous speech recognition , 2000, Speech Commun..

[9] John B. Shoven,et al. I , Edinburgh Medical and Surgical Journal.

[10] Patrick Wambacq,et al. An efficient search space representation for large vocabulary continuous speech recognition , 2000, Speech Commun..