论文信息 - Online discriminative training for grapheme-to-phoneme conversion

Online discriminative training for grapheme-to-phoneme conversion

We present an online discriminative training approach to grapheme-to-phoneme (g2p) conversion. We employ a manyto-many alignment between graphemes and phonemes, which overcomes the limitations of widely used one-to-one alignments. The discriminative structure-prediction model incorporates input segmentation, phoneme prediction, and sequence modeling in a unified dynamic programming framework. The learning model is able to capture both local context features in inputs, as well as non-local dependency features in sequence outputs. Experimental results show that our system surpasses the state-of-the-art on several data sets.

Grzegorz Kondrak | Sittichai Jiampojamarn

[1] Terrence J. Sejnowski,et al. Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[2] Walter Daelemans,et al. Language-Independent Data-Oriented Grapheme-to-Phoneme Conversion , 1996 .

[3] Hermann Ney,et al. Joint-sequence models for grapheme-to-phoneme conversion , 2008, Speech Commun..

[4] Walter Daelemans,et al. Do Not Forget: Full Memory in Memory-Based Learning of Word Pronunciation , 1998, CoNLL.

[5] Robert I. Damper,et al. A multistrategy approach to improving pronunciation by analogy , 2000, CL.

[6] Grzegorz Kondrak,et al. Joint Processing and Discriminative Training for Letter-to-Phoneme Conversion , 2008, ACL.

[7] Hermann Ney,et al. Investigations on joint-multigram models for grapheme-to-phoneme conversion , 2002, INTERSPEECH.

[8] Grzegorz Kondrak,et al. Applying Many-to-Many Alignments and Hidden Markov Models to Letter-to-Phoneme Conversion , 2007, NAACL.

[9] Grzegorz Kondrak,et al. Substring-Based Transliteration , 2007, ACL.

[10] Alan W. Black,et al. Issues in building general letter to sound rules , 1998, SSW.

[11] Andrzej Stachurski,et al. Parallel Optimization: Theory, Algorithms and Applications , 2000, Parallel Distributed Comput. Pract..