论文信息 - Spoken word recognition based on top-down phoneme segmentation

Spoken word recognition based on top-down phoneme segmentation

This paper describes a new spoken word recognition approach based on the top-down phoneme segmentation. Fourteen phoneme recognition functions are introduced to deal with various coarticulations. This new approach has two advantages. First, a precise phoneme recognition can be achieved because of the phoneme level top-down verification. Second, only phoneme symbol sequences are required for the vocabulary knowledge source, because the coarticulation knowledge is included in phoneme level knowledge sources. Experimental recognition results for 100 city names uttered by 50 speakers indicate that the phoneme concatenations showing strong coarticulation must be segmented as a unit to achieve a high recognition rate.

K. Aikawa | K. Shikano | M. Sugiyama

[1] Shozo Makino,et al. A speaker independent word recognition system based on phoneme recognition for a large size (212 words) vocabulary , 1984, ICASSP.

[2] Kiyohiro Shikano. Acoustic processing in the conversational speech recognition system , 1981, ICASSP.

[3] Lalit R. Bahl,et al. A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.