An automated linguistic knowledge-based cross-language transfer method for building acoustic models for a language without native training data

In this paper we describe an automated, linguistic knowledgebased method for building acoustic models for a target language for which there is no native training data. The method assumes availability of well-trained acoustic models for a number of existing source languages. It employs statistically derived phonetic and phonological distance metrics, particularly a combined phonetic-phonological (CPP) metric, defined to characterize a variety of linguistic relationships between phonemes from the source languages and a target language. Using these metrics, candidate phonemes from the source languages are automatically selected for each phoneme of the target language and acoustic models are constructed. Our experiments show that this automated method can generate acoustic models with good quality, far above the general phoneme symbol-based crosslanguage transfer strategy, reaching the performance of models generated through acoustic-distance mapping.

[1]  John Nerbonne,et al.  Measuring Dialect Distance Phonetically , 1997, SIGMORPHON@EACL.

[2]  Elizabeth C. Botha,et al.  COMPARISON OF ACOUSTIC DIS AUTOMATIC CROSS-LANGUAG , 2002 .

[3]  Joachim Köhler,et al.  Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of sounds , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[4]  Tanja Schultz,et al.  Fast bootstrapping of LVCSR systems with multilingual phoneme sets , 1997, EUROSPEECH.

[5]  Etienne Barnard,et al.  Phone clustering using the Bhattacharyya distance , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[6]  Harold L. Somers Similarity Metrics for Aligning Children's Articulation Data , 1998, COLING-ACL.

[7]  Philips,et al.  CROSS-LANGUAGE TRANSFER OF MULTILINGUAL PHONEME MODELS , 2003 .

[8]  J. Connolly,et al.  Quantifying target-realization differences. Part II: Sequences , 1997 .