Semi-Supervised Lexicon Learning for Wide-Coverage Semantic Parsing

Semantic parsers critically rely on accurate and high-coverage lexicons. However, traditional semantic parsers usually utilize annotated logical forms to learn the lexicon, which often suffer from the lexicon coverage problem. In this paper, we propose a graph-based semi-supervised learning framework that makes use of large text corpora and lexical resources. This framework first constructs a graph with a phrase similarity model learned by utilizing many text corpora and lexical resources. Next, graph propagation algorithm identifies the label distribution of unlabeled phrases from labeled ones. We evaluate our approach on two benchmarks: Webquestions and Free917. The results show that, in both datasets, our method achieves substantial improvement when comparing to the base system that does not utilize the learned lexicon, and gains competitive results when comparing to state-of-the-art systems.

[1]  Raymond J. Mooney,et al.  Learning Synchronous Grammars for Semantic Parsing with Lambda Calculus , 2007, ACL.

[2]  Oren Etzioni,et al.  Paraphrase-Driven Learning for Open Question Answering , 2013, ACL.

[3]  Noah A. Smith,et al.  Semi-Supervised Frame-Semantic Parsing for Unknown Predicates , 2011, ACL.

[4]  Koby Crammer,et al.  New Regularized Algorithms for Transductive Learning , 2009, ECML/PKDD.

[5]  Andrew Y. Ng,et al.  Improving Word Representations via Global Context and Multiple Word Prototypes , 2012, ACL.

[6]  Mark Steedman,et al.  Inducing Probabilistic CCG Grammars from Logical Form with Higher-Order Unification , 2010, EMNLP.

[7]  Jayant Krishnamurthy,et al.  Probabilistic Models for Learning a Semantic Parser Lexicon , 2016, NAACL.

[8]  Raymond J. Mooney,et al.  Learning to Parse Database Queries Using Inductive Logic Programming , 1996, AAAI/IAAI, Vol. 2.

[9]  Ming-Wei Chang,et al.  Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base , 2015, ACL.

[10]  Xuchen Yao,et al.  Lean Question Answering over Freebase from Scratch , 2015, NAACL.

[11]  James F. Allen Learning a Lexicon for Broad-coverage Semantic Parsing , 2014, ACL 2014.

[12]  Yoav Artzi,et al.  Learning Compact Lexicons for CCG Semantic Parsing , 2014, EMNLP.

[13]  Xuchen Yao,et al.  Information Extraction over Structured Data: Question Answering with Freebase , 2014, ACL.

[14]  Dan Klein,et al.  Learning Dependency-Based Compositional Semantics , 2011, CL.

[15]  Bo An,et al.  Sentence Rewriting for Semantic Parsing , 2016, ACL.

[16]  Eunsol Choi,et al.  Scaling Semantic Parsers with On-the-Fly Ontology Matching , 2013, EMNLP.

[17]  Mark Steedman,et al.  Transforming Dependency Structures to Logical Forms for Semantic Parsing , 2016, TACL.

[18]  Kai Zhao,et al.  Learning Translation Models from Monolingual Continuous Representations , 2015, NAACL.

[19]  Jason Weston,et al.  Question Answering with Subgraph Embeddings , 2014, EMNLP.

[20]  Jonathan Berant,et al.  Semantic Parsing via Paraphrasing , 2014, ACL.

[21]  Hwee Tou Ng,et al.  A Generative Model for Parsing Natural Language to Meaning Representations , 2008, EMNLP.

[22]  Percy Liang,et al.  Data Recombination for Neural Semantic Parsing , 2016, ACL.

[23]  Alexander Yates,et al.  Semantic Parsing Freebase: Towards Open-domain Semantic Parsing , 2013, *SEMEVAL.

[24]  Manaal Faruqui,et al.  Morpho-syntactic Lexicon Generation Using Graph-based Semi-supervised Learning , 2015, TACL.

[25]  Tom M. Mitchell,et al.  Joint Syntactic and Semantic Parsing with Combinatory Categorial Grammar , 2014, ACL.

[26]  Claire Gardent,et al.  Sequence-based Structured Prediction for Semantic Parsing , 2016, ACL.

[27]  Hannah Bast,et al.  More Accurate Question Answering on Freebase , 2015, CIKM.

[28]  Chen Liang,et al.  Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision , 2016, ACL.

[29]  Jure Leskovec,et al.  Inducing Domain-Specific Sentiment Lexicons from Unlabeled Corpora , 2016, EMNLP.

[30]  Chris Callison-Burch,et al.  PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification , 2015, ACL.

[31]  Tom M. Mitchell,et al.  Weakly Supervised Training of Semantic Parsers , 2012, EMNLP.

[32]  Luke S. Zettlemoyer,et al.  Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars , 2005, UAI.

[33]  Mark Steedman,et al.  Lexical Generalization in CCG Grammar Induction for Semantic Parsing , 2011, EMNLP.

[34]  Alexander Yates,et al.  Large-scale Semantic Parsing via Schema Matching and Lexicon Extension , 2013, ACL.

[35]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[36]  Tiejun Zhao,et al.  Knowledge-Based Question Answering as Machine Translation , 2014, ACL.

[37]  Guodong Zhou,et al.  Improving Semantic Parsing with Enriched Synchronous Context-Free Grammar , 2015, EMNLP 2015.

[38]  Anoop Sarkar,et al.  Improving Statistical Machine Translation with a Multilingual Paraphrase Database , 2015, EMNLP.

[39]  Luke S. Zettlemoyer,et al.  Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions , 2013, TACL.

[40]  Jonathan Berant,et al.  Imitation Learning of Agenda-based Semantic Parsers , 2015, TACL.

[41]  Kristina Toutanova,et al.  Graph-based Semi-Supervised Learning of Translation Models from Monolingual Data , 2014, ACL.

[42]  Hae-Chang Rim,et al.  Joint Relational Embeddings for Knowledge-based Question Answering , 2014, EMNLP.

[43]  Gholamreza Haffari,et al.  Graph Propagation for Paraphrasing Out-of-Vocabulary Words in Statistical Machine Translation , 2013, ACL.