Domain Adaptation in Semantic Role Labeling Using a Neural Language Model and Linguistic Resources

We propose a method for adapting Semantic Role Labeling (SRL) systems from a source domain to a target domain by combining a neural language model and linguistic resources to generate additional training examples. We primarily aim to improve the results of Location, Time, Manner and Direction roles. In our methodology, main words of selected predicates and arguments in the source-domain training data are replaced with words from the target domain. The replacement words are generated by a language model and then filtered by several linguistic filters (including Part-Of-Speech (POS), WordNet and Predicate constraints). In experiments on the out-of-domain CoNLL 2009 data, with the Recurrent Neural Network Language Model (RNNLM) and a well-known semantic parser from Lund University, we show enhanced recall and F1 without penalizing precision on the four targeted roles. These results improve the results of the same SRL system without using the language model and the linguistic resources, and are better than the results of the same SRL system that is trained with examples that are enriched with word embeddings. We also demonstrate the importance of using a language model and the vocabulary of the target domain when generating new training examples.

[1]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[2]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[3]  Jason Weston,et al.  Deep learning via semi-supervised embedding , 2008, ICML '08.

[4]  Eugene Charniak,et al.  Effective Self-Training for Parsing , 2006, NAACL.

[5]  Cícero Nogueira dos Santos,et al.  Semantic Role Labeling , 2012 .

[6]  Zhen Zhang,et al.  Enhanced Word Classing for Recurrent Neural Network Language Model , 2013 .

[7]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[8]  Christopher D. Manning,et al.  Unsupervised Discovery of a Statistical Verb Lexicon , 2006, EMNLP.

[9]  Suzanne Stevenson,et al.  Unsupervised Semantic Role Labellin , 2004, EMNLP.

[10]  Dirk Noël Beth Levin. English Verb Classes and Alternations: A Preliminary Investigation , 1995 .

[11]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[12]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[13]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[14]  Martha Palmer,et al.  Verbnet: a broad-coverage, comprehensive verb lexicon , 2005 .

[15]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[16]  Yi Yang,et al.  Learning Representations for Weakly Supervised Natural Language Processing Tasks , 2014, CL.

[17]  Marie-Francine Moens,et al.  A survey on the application of recurrent neural networks to statistical language modeling , 2015, Comput. Speech Lang..

[18]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[19]  L MercerRobert,et al.  Class-based n-gram models of natural language , 1992 .

[20]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[21]  Mirella Lapata,et al.  Semi-Supervised Semantic Role Labeling via Structural Alignment , 2012, CL.

[22]  Marie-Francine Moens,et al.  Semi-supervised Semantic Role Labeling Using the Latent Words Language Model , 2009, EMNLP.

[23]  Vysoké Učení,et al.  Statistical Language Models Based on Neural Networks , 2012 .

[24]  Ari Rappoport,et al.  Unsupervised Argument Identification for Semantic Role Labeling , 2009, ACL.

[25]  Philipp Koehn,et al.  Synthesis Lectures on Human Language Technologies , 2016 .

[26]  Ralph Grishman,et al.  The NomBank Project: An Interim Report , 2004, FCP@NAACL-HLT.

[27]  Rasoul Samad Zadeh Kaljahi,et al.  Adapting Self-Training for Semantic Role Labeling , 2010, ACL.

[28]  Mark Dredze,et al.  Low-Resource Semantic Role Labeling , 2014, ACL.

[29]  Georgiana Dinu,et al.  Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors , 2014, ACL.

[30]  David A. Smith,et al.  Improving NLP through Marginalization of Hidden Syntactic Structure , 2012, EMNLP-CoNLL.

[31]  Pierre Nugues,et al.  Multilingual Semantic Role Labeling , 2009, CoNLL Shared Task.

[32]  Marie-Francine Moens,et al.  Model-Portability Experiments for Textual Temporal Analysis , 2011, ACL.

[33]  Alessandro Lenci,et al.  Distributional Memory: A General Framework for Corpus-Based Semantics , 2010, CL.

[34]  Xiaojin Zhu,et al.  Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[35]  Percy Liang,et al.  Semi-Supervised Learning for Natural Language , 2005 .

[36]  Marie-Francine Moens,et al.  The latent words language model , 2012, Comput. Speech Lang..

[37]  Beatrice Santorini,et al.  The Penn Treebank: An Overview , 2003 .

[38]  Ronan Collobert,et al.  Deep Learning for Efficient Discriminative Parsing , 2011, AISTATS.

[39]  Jason Weston,et al.  Semantic Frame Identification with Distributed Word Representations , 2014, ACL.