A Study on Bilingual Speech Recognition Involving a Minority Language

Multilingual Automatic Speech Recognition (ASR) systems are of great interest in multilingual environments. We have studied the case of the Comunitat Valenciana because the two official languages are Spanish and Valencian. These two languages share most of their phonemes and their syntax and vocabulary are also quite similar since they have influenced each other for many years. In this work, we present the design of the language and the acoustic models for this bilingual situation. Acoustic models can be separate for each language or shared by both of them, and they can be obtained directly from a training corpus or by adapting a previous set of acoustic models. Language models can be separate for each language (monolingual recognition) or mixed for both languages (bilingual recognition). We performed experiments with a small corpus to determine which option was better for this case.

[1]  Hermann Ney,et al.  Some approaches to statistical and finite-state speech-to-speech translation , 2004, Comput. Speech Lang..

[2]  Chin-Hui Lee,et al.  MAP Estimation of Continuous Density HMM : Theory and Applications , 1992, HLT.

[3]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[4]  M. Wald Using Automatic Speech Recognition to Enhance Education for All Students: Turning a Vision into Reality , 2005, Proceedings Frontiers in Education 35th Annual Conference.

[5]  Ulla Uebler,et al.  Multilingual speech recognition in seven languages , 2001, Speech Commun..

[6]  Philip C. Woodland Speaker adaptation for continuous density HMMs: a review , 2001 .

[7]  Tanja Schultz,et al.  Language-independent and language-adaptive acoustic modeling for speech recognition , 2001, Speech Commun..

[8]  Vicente Alabau,et al.  Bilingual speech corpus in two phonetically similar languages , 2006, LREC.

[9]  Sander J. van Wijngaarden,et al.  Intelligibility of native and non-native Dutch speech , 2001, Speech Commun..

[10]  Robert Eklund,et al.  Xenophones: An investigation of phone set expansion in Swedish and implications for speech recognition and speech synthesis , 2001, Speech Commun..

[11]  Lluís Màrquez i Villodre,et al.  Generation of Language Resources for the Development of Speech Technologies in Catalan , 2006, LREC.

[12]  Steve Young,et al.  The HTK book , 1995 .