Automatic language identification using large vocabulary continuous speech recognition

We have developed a highly accurate automatic language identification system based on large vocabulary continuous speech recognition (LVCSR). Each test utterance is recognized in a number of languages, and the language ID decision is based on the probability of the output word sequence reported by each recognizer. Recognizers were implemented for this test in English, Japanese, and Spanish, using the Ricardo corpus of telephone monologues. When tested on the OGI corpus of digitally recorded telephone speech, we obtained error rates of 3% or lower on 2-way and 3-way closed-set classification of ten-second and one-minute speech segments.

[1]  Etienne Barnard,et al.  Analysis of phoneme-based features for language identification , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Jean-Luc Gauvain,et al.  A phone-based approach to non-linguistic speech feature identification , 1995, Comput. Speech Lang..

[3]  Alex Waibel,et al.  Experiments with LVCSR based language identification , 1995 .

[4]  Seiichi Nakagawa,et al.  Three language identification methods based on HMMs , 1994, ICSLP.

[5]  Ronald A. Cole,et al.  A comparison of approaches to automatic language identification using telephone speech , 1993, EUROSPEECH.

[6]  Marc A. Zissman,et al.  Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  M. J. Hunt,et al.  An investigation of PLP and IMELDA acoustic representations and of their potential for combination , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Victor Zue,et al.  Automatic language identification using a segment-based approach , 1993, EUROSPEECH.

[9]  Larry Gillick,et al.  Language Identification via Large Vocabulary Speaker Independent Continuous Speech Recognition , 1994, HLT.

[10]  Jean-Luc Gauvain,et al.  Identification of Non-Linguistic Speech Features , 1993, HLT.

[11]  Janet M. Baker,et al.  Topic and Speaker Identification via Large Vocabulary Continuous Speech Recognition , 1993, HLT.

[12]  Ronald A. Cole,et al.  The OGI multi-language telephone speech corpus , 1992, ICSLP.

[13]  P. McCullagh,et al.  Generalized Linear Models , 1972, Predictive Analytics.

[14]  Ronald A. Cole,et al.  Automatic segmentation and identification of ten languages using telephone speech , 1992, ICSLP.

[15]  Janet M. Baker,et al.  Application of large vocabulary continuous speech recognition to topic and speaker identification using telephone speech , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  Hermann Ney,et al.  On structuring probabilistic dependences in stochastic language modelling , 1994, Comput. Speech Lang..