论文信息 - A Composition Algorithm of Compact Finite-State Super Transducers for Grapheme-to-Phoneme Conversion

A Composition Algorithm of Compact Finite-State Super Transducers for Grapheme-to-Phoneme Conversion

Minimal deterministic finite-state transducers (MDFSTs) are powerful models that can be used to represent pronunciation dictionaries in a compact form. Intuitively, we would assume that by increasing the size of the dictionary, the size of the MDFSTs would increase as well. However, as we show in the paper, this intuition does not hold for highly inflected languages. With such languages the size of the MDFSTs begins to decrease once the number of words in the represented dictionary reaches a certain threshold. Motivated by this observation, we have developed a new type of FST, called a finite-state super transducer (FSST), and show experimentally that the FSST is capable of representing pronunciation dictionaries with fewer states and transitions than MDFSTs. Furthermore, we show that (unlike MDFSTs) our FSSTs can also accept words that are not part of the represented dictionary. The phonetic transcriptions of these out-of-dictionary words may not always be correct, but the observed error rates are comparable to the error rates of the traditional methods for grapheme-to-phoneme conversion.

Simon Dobrisek | France Mihelic | Vitomir Struc | Jerneja Zganec-Gros | Ziga Golob

[1] Mehryar Mohri,et al. Finite-State Transducers in Language and Speech Processing , 1997, CL.

[2] Jerneja Zganec-Gros,et al. SI-PRON Pronunciation Lexicon: a New Language Resource for Slovenian , 2006, Informatica.

[3] Simon Dobrisek,et al. FST-Based Pronunciation Lexicon Compression for Speech Engines , 2012 .

[4] Hermann Ney,et al. Structure learning in hidden conditional random fields for grapheme-to-phoneme conversion , 2013, INTERSPEECH.

[5] Stefan Hahn,et al. Comparison of Grapheme-to-Phoneme Methods on Large Pronunciation Dictionaries and LVCSR Tasks , 2012, INTERSPEECH.

[6] Maja Skrjanc,et al. Automatic Lexical Stress Assignment of Unknown Words for Highly Inflected Slovenian Language , 2002, TSD.

[7] Hermann Ney,et al. Joint-sequence models for grapheme-to-phoneme conversion , 2008, Speech Commun..

[8] Johan Schalkwyk,et al. OpenFst: A General and Efficient Weighted Finite-State Transducer Library , 2007, CIAA.

[9] Grzegorz Kondrak,et al. Letter-Phoneme Alignment: An Exploration , 2010, ACL.

[10] Mehryar Mohri,et al. Minimization algorithms for sequential transducers , 2000, Theor. Comput. Sci..