论文信息 - Hidden Conditional Random Fields with M-to-N Alignments for Grapheme-to-Phoneme Conversion

Hidden Conditional Random Fields with M-to-N Alignments for Grapheme-to-Phoneme Conversion

Conditional Random Fields have been successfully applied to a number of NLP tasks like concept tagging, named entity tagging, or grapheme-to-phoneme conversion. When no alignment between source and target side is provided with the training data, it is challenging to build a CRF system with state-of-the-art performance. In this work, we present an approach incorporating an Mto-N alignment as a hidden variable within a transducerbased implementation of CRFs. Including integrated estimation of transition penalties, it was possible to train a state-of-the-art hidden CRF system in reasonable time for an English grapheme-to-phoneme conversion task without using an external model to provide the alignment.

Hermann Ney | Stefan Hahn | Patrick Lehnen | Vlad-Andrei Guta

[1] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[2] Wai Lam,et al. Hidden Dynamic Probabilistic Models for Labeling Sequence Data , 2008, AAAI.

[3] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[4] Hermann Ney,et al. Incorporating alignments into Conditional Random Fields for grapheme to phoneme conversion , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5] Michael Collins,et al. Hidden-Variable Models for Discriminative Reranking , 2005, HLT.

[6] Hermann Ney,et al. Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[7] Grzegorz Kondrak,et al. Integrating Joint n-gram Features into a Discriminative Training Framework , 2010, HLT-NAACL.

[8] Mitchell P. Marcus,et al. Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.

[9] Hermann Ney,et al. A Deep Learning Approach to Machine Transliteration , 2009, WMT@EACL.

[10] Martin A. Riedmiller,et al. A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[11] François Yvon,et al. Practical Very Large Scale CRFs , 2010, ACL.

[12] Trevor Darrell,et al. Hidden Conditional Random Fields , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Hermann Ney,et al. Powerful extensions to CRFS for grapheme to phoneme conversion , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14] Hermann Ney,et al. N-Grams for Conditional Random Fields or a Failure-Transition(f) Posterior for Acyclic FSTs , 2011, INTERSPEECH.

[15] Brian Roark,et al. Generalized Algorithms for Constructing Statistical Language Models , 2003, ACL.

[16] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[17] Hermann Ney,et al. Joint-sequence models for grapheme-to-phoneme conversion , 2008, Speech Commun..

[18] Phil Blunsom,et al. Discriminative Word Alignment with Conditional Random Fields , 2006, ACL.