论文信息 - Efficient, High-Performance Algorithms for N-Best Search

Efficient, High-Performance Algorithms for N-Best Search

We present two efficient search algorithms for real-time spoken language systems. The first called the Word-Dependent N-Best algorithm is an improved algorithm for finding the top N sentence hypotheses. The new algorithm is shown to perform as well as the Exact Sentence-Dependent algorithm presented previously but with an order of magnitude less computation. The second algorithm is a fast match scheme for continuous speech recognition called the Forward-Backward Search. This algorithm, which is directly motivated by the Baum-Welch Forward-Backward training algorithm, has been shown to reduce the computation of a time-synchronous beam search by a factor of 40 with no additional search errors.

Richard M. Schwartz | Steve Austin

[1] John Makhoul,et al. Context-dependent modeling for acoustic-phonetic recognition of continuous speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2] Richard M. Schwartz,et al. The N-Best Algorithm: Efficient Procedure for Finding Top N Sentence Hypotheses , 1989, HLT.

[3] Richard M. Schwartz,et al. Toward a Real-Time Spoken Language System Using Commercial Hardware , 1990, HLT.

[4] Pietro Laface,et al. Very large vocabulary isolated utterance recognition: a comparison between one pass and two pass strategies , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[5] Richard M. Schwartz,et al. A Simple Statistical Class Grammar for Measuring Speech Recognition Performance , 1989, HLT.

[6] Hy Murveit,et al. Integrating natural language constraints into HMM-based speech recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[7] S. Roucos,et al. Statistical language modeling using a small corpus from an application domain , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[8] Dimitri Kanevsky,et al. Constructing groups of acoustically confusable words , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[9] Volker Steinbiss,et al. Sentence-hypotheses generation in a continuous-speech recognition system , 1989, EUROSPEECH.