Voice assimilation phenomenon and its implementation in LVCSR system with lexical tree and bigram language model

In this paper a LVCSR system with implementation of the Czech voice assimilation phenomenon is proposed. The recognition system uses lexical trees and a bigram language model. The first part of this article is focused on voice assimilation phenomenon description, triphone lexical tree construction, and voice assimilation impact on LVCSR system performance. The second part outlines lexical tree decoding algorithm based on Viterbi search with pruning. Different methods of voice assimilation implementation are discussed. Key-Words: voice assimilation, LVCSR system, lexical tree, bigram language model

[1]  Ludek Müller,et al.  Design of Speech Recognition Engine , 2000, TSD.

[2]  Mei-Yuh Hwang,et al.  Improvements on the pronunciation prefix tree search organization , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[3]  Steve Young,et al.  The HTK book , 1995 .

[4]  Steve Young,et al.  Token passing: a simple conceptual model for connected speech recognition systems , 1989 .

[5]  Giuliano Antoniol,et al.  Language model representations for beam-search decoding , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.