NEW FEATURES FOR LANGUA GE IDENTIFICATION USING GMM

Automatic Language Identification (LID) is the process of identifying the language spoken within an utterance. The challenge that this task presents is that no prior information is available indicating the content of the utterance or the identity of the language. Most of the existing LID systems are based on MFCC feature vectors. This paper introduces the use of new feature extraction approach for LID task. For this, an approach is proposed to derive a new type of feature vectors from speech signal alone. The feature extraction method is based on the frequency of occurrence of phonemes is different for different languages. The variations in frequency of occurrence of phonemes are effectively captured using Gaussians. These variations are captures in the form of probability vectors using Gaussians. This approach outperformed the existing conventional MFCC feature vector based LID systems.

[1]  Marc A. Zissman,et al.  Comparison of : Four Approaches to Automatic Language Identification of Telephone Speech , 2004 .

[2]  E. Wong,et al.  Comparison of linear prediction cepstrum coefficients and mel-frequency cepstrum coefficients for language identification , 2001, Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing. ISIMP 2001 (IEEE Cat. No.01EX489).

[3]  Jirí Navrátil,et al.  Spoken language recognition-a step toward multilinguality in speech processing , 2001, IEEE Trans. Speech Audio Process..

[4]  Douglas A. Reynolds,et al.  Language identification using Gaussian mixture model tokenization , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  M. Sugiyama,et al.  Automatic language recognition using acoustic features , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[6]  Etienne Barnard,et al.  Analysis of phoneme-based features for language identification , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Qu Dan,et al.  Discriminative Training of GMM for Language Identification , 2003 .

[8]  Ji R Navrr Spoken Language Recognition -a Step towards Multilinguality in Speech Processing , 2001 .

[9]  Marc A. Zissman,et al.  Automatic language identification using Gaussian mixture and hidden Markov models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  William M. Campbell,et al.  Acoustic, phonetic, and discriminative approaches to automatic language identification , 2003, INTERSPEECH.

[11]  Jonathan Lareau Application of shifted delta cepstral features for GMM language identification , 2006 .