Towards Open-Text Semantic Parsing via Multi-Task Learning of Structured Embeddings

Open-text (or open-domain) semantic parsers are designed to interpret any statement in natural language by inferring a corresponding meaning representation (MR). Unfortunately, large scale systems cannot be easily machine-learned due to lack of directly supervised data. We propose here a method that learns to assign MRs to a wide range of text (using a dictionary of more than 70,000 words, which are mapped to more than 40,000 entities) thanks to a training scheme that combines learning from WordNet and ConceptNet with learning from raw text. The model learns structured embeddings of words, entities and MRs via a multi-task training process operating on these diverse sources of data that integrates all the learnt knowledge into a single system. This work ends up combining methods for knowledge acquisition, semantic parsing, and word-sense disambiguation. Experiments on various tasks indicate that our approach is indeed successful and can form a basis for future more sophisticated systems.

[1]  Luke S. Zettlemoyer,et al.  Learning Context-Dependent Mappings from Sentences to Logical Form , 2009, ACL.

[2]  Jason Weston,et al.  A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.

[3]  Pedro M. Domingos,et al.  Unsupervised Ontology Induction from Text , 2010, ACL.

[4]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[5]  Alessandro Moschitti,et al.  A General Purpose FrameNet-based Shallow Semantic Parser , 2010, LREC.

[6]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[7]  Eneko Agirre,et al.  On the Use of Automatically Acquired Examples for All-Nouns Word Sense Disambiguation , 2008, J. Artif. Intell. Res..

[8]  James Pustejovsky,et al.  Coarse Word-Sense Disambiguation Using Common Sense , 2010, AAAI Fall Symposium: Commonsense Knowledge.

[9]  Gerhard Weikum,et al.  YAGO: A Large Ontology from Wikipedia and WordNet , 2008, J. Web Semant..

[10]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[11]  Léon Bottou,et al.  From machine learning to machine reasoning , 2011, Machine Learning.

[12]  Jason Weston,et al.  Learning Structured Embeddings of Knowledge Bases , 2011, AAAI.

[13]  Oren Etzioni,et al.  TextRunner: Open Information Extraction on the Web , 2007, NAACL.

[14]  Raymond J. Mooney,et al.  Learning Semantic Parsers: An Important but Under-Studied Problem , 2004 .

[15]  Lei Shi,et al.  Open Text Semantic Parsing Using FrameNet and WordNet , 2004, NAACL.

[16]  Olatz Ansa,et al.  Enriching very large ontologies using the WWW , 2000, ECAI Workshop on Ontology Learning.

[17]  Sanda M. Harabagiu,et al.  Knowledge processing on an extended wordnet , 1998 .

[18]  Geoffrey E. Hinton,et al.  Learning Distributed Representations of Concepts Using Linear Relational Embedding , 2001, IEEE Trans. Knowl. Data Eng..

[19]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[20]  Patrick Gallinari,et al.  Ranking with ordered weighted pairwise classification , 2009, ICML '09.

[21]  Dan Klein,et al.  Learning Dependency-Based Compositional Semantics , 2011, CL.

[22]  H. Robbins A Stochastic Approximation Method , 1951 .

[23]  E. Cambria,et al.  AffectiveSpace: Blending Common Sense and Affective Knowledge to Perform Emotive Reasoning , 2009 .

[24]  Hoifung Poon,et al.  Unsupervised Semantic Parsing , 2009, EMNLP.

[25]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[26]  Jason Weston,et al.  Large scale image annotation: learning to rank with joint word-image embeddings , 2010, Machine Learning.

[27]  Alessandro Moschitti,et al.  Shallow Semantic Parsing Based on FrameNet, VerbNet and PropBank , 2006, ECAI.

[28]  Daniel Jurafsky,et al.  Semantic Taxonomy Induction from Heterogenous Evidence , 2006, ACL.

[29]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[30]  Daniel S. Weld,et al.  Open Information Extraction Using Wikipedia , 2010, ACL.

[31]  Raymond J. Mooney,et al.  Learning a Compositional Semantic Parser using an Existing Syntactic Parser , 2009, ACL.

[32]  Yoshua Bengio,et al.  Neural net language models , 2008, Scholarpedia.

[33]  Martha Palmer,et al.  From TreeBank to PropBank , 2002, LREC.

[34]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[35]  Walter Daelemans,et al.  GAMBL, genetic algorithm optimization of memory-based WSD , 2004, SENSEVAL@ACL.