论文信息 - Hybrid Approach for Language Identification Oriented to Multilingual Speech Recognition in the Basque Context

Hybrid Approach for Language Identification Oriented to Multilingual Speech Recognition in the Basque Context

The development of Multilingual Large Vocabulary Continuous Speech Recognition systems involves issues as: Language Identification, Acoustic-Phonetic Decoding, Language Modelling or the development of appropriated Language Resources The interest on Multilingual Systems arouses because there are three official languages in the Basque Country (Basque, Spanish, and French), and there is much linguistic interaction among them, even if Basque has very different roots than the other two languages This paper describes the development of a Language Identification (LID) system oriented to robust Multilingual Speech Recognition for the Basque context The work presents hybrid strategies for LID, based on the selection of system elements by Support Vector Machines and Multilayer Perceptron classifiers and stochastic methods for speech recognition tasks (Hidden Markov Models and n-grams).

Karmele López de Ipiña | Nora Barroso | Aitzol Ezeiza | Odei Barroso | Unai Susperregi

[1] Piero Cosi. Hybrid HMM-NN architectures for connected digit recognition , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[2] Tanja Schultz,et al. Multilingual Speech Processing , 2006 .

[3] Pavel Matejka,et al. Phonotactic language identification using high quality phoneme recognition , 2005, INTERSPEECH.

[4] Bin Ma,et al. A Phonotactic Language Model for Spoken Language Identification , 2005, ACL.

[5] Hema A. Murthy,et al. Language identification using parallel syllable-like unit recognition , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6] Alberto Sanfeliu,et al. Progress in Pattern Recognition, Speech and Image Analysis , 2003, Lecture Notes in Computer Science.

[7] Mark J. F. Gales,et al. Speech Recognition using SVMs , 2001, NIPS.

[8] João Paulo da Silva Neto,et al. The COST278 Pan-European Broadcast News Database , 2004, LREC.

[9] Laurent Besacier,et al. Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[10] Manuel Graña,et al. Selection of Lexical Units for Continuous Speech Recognition of Basque , 2003, CIARP.

[11] Laurent Besacier,et al. Which units for acoustic and language modeling for Khmer automatic speech recognition? , 2008, SLTU.

[12] Dau-Cheng Lyu,et al. Language identification on code-switching utterances using multiple cues , 2008, INTERSPEECH.

[13] Manuel Graña,et al. Hierarchically structured systems , 1986 .

[14] Joseph Picone,et al. Hybrid SVM/HMM architectures for speech recognition , 2000, INTERSPEECH.

[15] Tanja Schultz,et al. Multilingual and Crosslingual Speech Recognition , 1998 .

[16] Kazuhiro Kondo,et al. An evaluation of cross-language adaptation for rapid HMM development in a new language , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[17] Bin Ma,et al. An acoustic segment modeling approach to automatic language identification , 2005, INTERSPEECH.

[18] Fernando Díaz-de-María,et al. Support Vector Machines for continuous speech recognition , 2006, 2006 14th European Signal Processing Conference.

[19] Eliathamby Ambikairajah,et al. Robust language identification based on fused phonotactic information with MLKSFM pre-classifier , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[20] Karmele López de Ipiña,et al. Development of multimodal resources for multilingual information retrieval in the basque context , 2007, INTERSPEECH.