论文信息 - Language identification incorporating lexical information

Language identification incorporating lexical information

In this paper we explore the use of lexical information for language identification (LID). Our reference LID system uses language-dependent acoustic phone models and phone-based bigram language models. For each language, lexical information is introduced by augmenting the phone vocabulary with the N most frequent words in the training data. Combined phone and word bigram models are used to provide linguistic constraints during acoustic decoding. Experiments were carried out on a 4-language telephone speech corpus. Using lexical information achieves a relative error reduction of about 20% on spontaneous and read speech compared to the reference phone-based system. Identification rates of 92%, 96% and 99% are achieved for spontaneous, read and task-specific speech segments respectively, with prior speech detection.

Jean-Luc Gauvain | Lori Lamel | Driss Matrouf | Martine Adda-Decker

[1] Jean-Luc Gauvain,et al. A Multilingual Corpus for Language Identification , 1998 .

[2] Marc A. Zissman,et al. Predicting, diagnosing and improving automatic language identification performance , 1997, EUROSPEECH.

[3] Marc A. Zissman,et al. Comparison of : Four Approaches to Automatic Language Identification of Telephone Speech , 2004 .

[4] Jean-Luc Gauvain,et al. Language identification with language-independent acoustic models , 1997, EUROSPEECH.

[5] Ronald A. Cole,et al. The OGI multi-language telephone speech corpus , 1992, ICSLP.

[6] Tanja Schultz,et al. Fast bootstrapping of LVCSR systems with multilingual phoneme sets , 1997, EUROSPEECH.

[7] Larry Gillick,et al. Language Identification via Large Vocabulary Speaker Independent Continuous Speech Recognition , 1994, HLT.

[8] Jean-Luc Gauvain,et al. Identifying non-linguistic speech features , 1993, EUROSPEECH.