论文信息 - Domain Transfer for Deep Natural Language Generation from Abstract Meaning Representations

Domain Transfer for Deep Natural Language Generation from Abstract Meaning Representations

Stochastic natural language generation systems that are trained from labelled datasets are often domainspecific in their annotation and in their mapping from semantic input representations to lexical-syntactic outputs. As a result, learnt models fail to generalize across domains, heavily restricting their usability beyond single applications. In this article, we focus on the problem of domain adaptation for natural language generation. We show how linguistic knowledge from a source domain, for which labelled data is available, can be adapted to a target domain by reusing training data across domains. As a key to this, we propose to employ abstract meaning representations as a common semantic representation across domains. We model natural language generation as a long short-term memory recurrent neural network encoderdecoder, in which one recurrent neural network learns a latent representation of a semantic input, and a second recurrent neural network learns to decode it to a sequence of words. We show that the learnt representations can be transferred across domains and can be leveraged effectively to improve training on new unseen domains. Experiments in three different domains and with six datasets demonstrate that the lexical-syntactic constructions learnt in one domain can be transferred to new domains and achieve up to 75-100% of the performance of in-domain training. This is based on objective metrics such as BLEU and semantic error rate and a subjective human rating study. Training a policy from prior knowledge from a different domain is consistently better than pure in-domain training by up to 10%.

Nina Dethlefs | Nina Dethlefs

[1] Marcel Bollmann. Adapting SimpleNLG to German , 2011, ENLG.

[2] Raymond J. Mooney,et al. Learning to Interpret Natural Language Navigation Instructions from Observations , 2011, Proceedings of the AAAI Conference on Artificial Intelligence.

[3] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[4] Ehud Reiter,et al. Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[5] Luciana Benotti,et al. The GIVE-2 Nancy Generation Systems NA and NM , 2010 .

[6] Milica Gasic,et al. Phrase-Based Statistical Language Generation Using Graphical Models and Active Learning , 2010, ACL.

[7] Albert Gatt,et al. SimpleNLG: A Realisation Engine for Practical Applications , 2009, ENLG.

[8] Dan Klein,et al. Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[9] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[10] Jordan L. Boyd-Graber,et al. Generating Sentences from Semantic Vector Space Representations , 2014 .

[11] John A. Bateman,et al. Enabling technology for multilingual natural language generation: the KPML development environment , 1997, Natural Language Engineering.