A COMPARISON OF NEURAL NET AND LINEAR CLASSIFIER AS THE PATTERN RECOGNIZER IN AUTOMATIC LANGUAGE IDENTIFICATION

The goal for language identiication (LID) is to quickly and accurately identify the language being spoken in a given test utterance. Recent researches has shown the importance of acoustic, phonotactic and prosodic information for language identiication. How to combine these multiple information sources to give the nal results is still a research issue. Traditional ways to combine multiple scores were similar to a linear classiier. In this paper, experiments were conducted to compare the performance of linear classiier based and neural network based nal score combination. The results showed that approximately 15% errors of linear classiier based system were reduced by the neural network based system, which suggests that a non-linear combination of multi-information sources is necessary for language identiication.

[1]  Yonghong Yan,et al.  An approach to automatic language identification based on language-dependent phone recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[2]  Marc A. Zissman Language identification using phoneme recognition and phonotactic language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Victor Zue,et al.  Recent improvements in an approach to segment-based automatic language identification , 1994, ICSLP.

[4]  Ronald A. Cole,et al.  The OGI multi-language telephone speech corpus , 1992, ICSLP.

[5]  Ronald A. Cole,et al.  A neural-net training program based on conjugate-radient optimization , 1989 .

[6]  Kung-Pu Li Experimental improvements of a language Id system , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[7]  Michael J. Carey,et al.  Language identification using multiple knowledge sources , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Yonghong Yan,et al.  An approach to language identification with enhanced language model , 1995, EUROSPEECH.