Letter-Phoneme Alignment: An Exploration

Letter-phoneme alignment is usually generated by a straightforward application of the EM algorithm. We explore several alternative alignment methods that employ phonetics, integer programming, and sets of constraints, and propose a novel approach of refining the EM alignment by aggregation of best alignments. We perform both intrinsic and extrinsic evaluation of the assortment of methods. We show that our proposed EM-Aggregation algorithm leads to the improvement of the state of the art in letter-to-phoneme conversion on several different data sets.

[1]  Hong-Goo Kang,et al.  A perspective on the next challenges for TTS research , 2002, Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002..

[2]  Haizhou Li,et al.  Transliteration Alignment , 2009, ACL.

[3]  Antal van den Bosch,et al.  Improved morpho-phonological sequence processing with constraint satisfaction inference , 2006, SIGMORPHON.

[4]  MarchandYannick,et al.  Can syllabification improve pronunciation by analogy of English , 2007 .

[5]  Paul Taylor,et al.  Hidden Markov models for grapheme to phoneme conversion , 2005, INTERSPEECH.

[6]  Robert I. Damper,et al.  Aligning Text and Phonemes for Speech Technology Applications Using an EM-Like Algorithm , 2005, Int. J. Speech Technol..

[7]  Walter Daelemans,et al.  Language-Independent Data-Oriented Grapheme-to-Phoneme Conversion , 1996 .

[8]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner , 2007 .

[9]  Robert I. Damper,et al.  A multistrategy approach to improving pronunciation by analogy , 2000, CL.

[10]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[11]  Vera Demberg,et al.  Phonological Constraints and Morphological Preprocessing for Grapheme-to-Phoneme Conversion , 2007, ACL.

[12]  Kristina Toutanova,et al.  Pronunciation Modeling for Improved Spelling Correction , 2002, ACL.

[13]  Susan Fitt,et al.  Robust LTS rules with the Combilex speech technology lexicon , 2009, INTERSPEECH.

[14]  Grzegorz Kondrak,et al.  Joint Processing and Discriminative Training for Letter-to-Phoneme Conversion , 2008, ACL.

[15]  Robert I. Damper,et al.  Can syllabification improve pronunciation by analogy of English? , 2006, Natural Language Engineering.

[16]  Terrence J. Sejnowski,et al.  Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[17]  Hermann Ney,et al.  Joint-sequence models for grapheme-to-phoneme conversion , 2008, Speech Commun..

[18]  Grzegorz Kondrak,et al.  A New Algorithm for the Alignment of Phonetic Sequences , 2000, ANLP.

[19]  Alan W. Black,et al.  Issues in building general letter to sound rules , 1998, SSW.

[20]  Hermann Ney,et al.  Improvements in Phrase-Based Statistical Machine Translation , 2004, NAACL.

[21]  Tanja Schultz,et al.  Rapid Development of an Afrikaans English Speech-to-Speech Translator , 2005, IWSLT.

[22]  Grzegorz Kondrak,et al.  Applying Many-to-Many Alignments and Hidden Markov Models to Letter-to-Phoneme Conversion , 2007, NAACL.