The COST 278 MASPER Initiative - Crosslingual Speech Recognition with Large Telephone Databases

This paper presents the work on crosslingual speech recognition carried out by the MASPER initiative that was formed as a part of the COST 278 Action. Two different approaches for transfering monolingual source acoustic models to a new language were compared. The first one was expert-driven, based on the IPA scheme. The second was data-driven, based on a crosslingual phoneme confusion matrix. German, Spanish, Hungarian and Slovak were used as sourcelanguages. Slovenian was selected to be the target language. All experiments were carried out on SpeechDat databases. The results’ analysis showed that the expert-driven method outperforms the data-driven one, and that similarities between source and target language have a significant influence on the performance.