Toward new language adaptation for language identification

Abstract We study the adaptation of all existing language-identification system to new languages using a limited amount of training data. The platform used for this study is the system recently developed ( Yan and Barnard, 1995a , Yan and Barnard, 1995b ) to exploit phonotactic constraints based on language-dependent phone recognition. Using the proposed language model re-estimation technique based on probabilistic gradient descent, two new approaches and their combination are proposed and tested. These approaches all modify the phonotactic language models, so that they no longer equal the conventional maximum-likelihood estimate. The difference of these methods can be viewed as different information resampling on the same amount of data. Experiments were conducted using the standard OGI_TS database ( Muthusamy et al., 1992 ). For comparison, the baseline system (with traditional model estimation) was also subjected to the same set of tests. Systems trained with different amounts of training data in the new languages were evaluated. Compared with the conventional model estimation, the results demonstrate that the new methods improve adaptation to new languages. The success of the discriminative model shows that conventional model estimation is not optimal for language identification, so that improvements can be obtained by modifying the maximum-likelihood estimates of the language models.

[1]  Seiichi Nakagawa,et al.  Three language identification methods based on HMMs , 1994, ICSLP.

[2]  Keikichi Hirose,et al.  Recognized phoneme-based N-gram modeling in automatic language identification , 1995, EUROSPEECH.

[3]  Yonghong Yan,et al.  Development of an approach to automatic language identification based on phone recognition , 1996, Comput. Speech Lang..

[4]  Ronald A. Cole,et al.  The OGI multi-language telephone speech corpus , 1992, ICSLP.

[5]  Yonghong Yan,et al.  An approach to automatic language identification based on language-dependent phone recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[6]  Jean-Luc Gauvain,et al.  Language identification using phone-based acoustic likelihoods , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Shubha Kadambe,et al.  Language identification with phonological and lexical models , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Marc A. Zissman,et al.  Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Herbert Gish,et al.  Two novel language model estimation techniques for statistical language identification , 1995, EUROSPEECH.

[10]  Padma Ramesh,et al.  Language identification with embedded word models , 1994, ICSLP.

[11]  Paul Dalsgaard,et al.  On the use of data-driven clustering technique for identification of poly- and mono-phonemes for four European languages , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[12]  Marc A. Zissman Language identification using phoneme recognition and phonotactic language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[13]  Kung-Pu Li Experimental improvements of a language Id system , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[14]  Y.K. Muthusamy,et al.  Reviewing automatic language identification , 1994, IEEE Signal Processing Magazine.

[15]  Itahashi Shuichi,et al.  Language identification based on speech fundamental frequency , 1995, EUROSPEECH.