论文信息 - Multilingual speech recognition with language identification

Multilingual speech recognition with language identification

This paper presents a new approach to multilingual speech recognition. The proposed algorithm combines both language identification (LID) and speech recognition into a single process. It is shown to be effective for multilingual grammarbased speech recognition where the language information is not available prior to recognition. The idea is to make use of acoustic-phonetic and lexical information in each language to reduce possible mismatch caused by potential difference in acoustic and recording conditions when the training utterances for each language were collected. By doing so, it is shown that, with the help of LID information, the word error rate of a mixed Mandarin and English speech recognition system is greatly reduced. The same formulation can also be used to enhance language identification accuracy.

Bin Ma | Haizhou Li | Chin-Hui Lee | Cuntai Guan

[1] Yonghong Yan,et al. Development of an approach to automatic language identification based on phone recognition , 1996, Comput. Speech Lang..

[2] Ji R Navrr. Spoken Language Recognition -a Step towards Multilinguality in Speech Processing , 2001 .

[3] Chin-Hui Lee,et al. Vocabulary independent discriminative utterance verification for non-keyword rejection in subword based speech recognition , 1998 .

[4] Timothy J. Hazen,et al. Segment-based automatic language identification , 1997 .

[5] Chin-Hui Lee,et al. Vocabulary independent discriminative utterance verification for nonkeyword rejection in subword based speech recognition , 1996, IEEE Trans. Speech Audio Process..

[6] Tanja Schultz,et al. LVCSR-based language identification , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[7] Marc A. Zissman,et al. Comparison of : Four Approaches to Automatic Language Identification of Telephone Speech , 2004 .

[8] Marc A. Zissman,et al. Automatic language identification , 2001, Speech Commun..