Learning a Theory of Marriage (and Other Relations) from a Web Corpus

This paper describes a method for learning which relations are highly associated with a given seed relation such as marriage or working for a company. Relation instances taken from a large knowledge base are used as seeds for obtaining candidate sentences expressing the associated relations. Relations of interest are identified by parsing the sentences and extracting dependency graph fragments, which are then ranked to determine which of them are most closely associated with the seed relation. We call the sets of associated relations relation theories. The quality of the induced theories is evaluated using human judgements.

[1]  Patrick Pantel,et al.  DIRT @SBT@discovery of inference rules from text , 2001, KDD '01.

[2]  Oren Etzioni,et al.  Open Language Learning for Information Extraction , 2012, EMNLP.

[3]  Noah A. Smith,et al.  Conference on Empirical Methods in Natural Language Processing EMNLP 2016 , 2016 .

[4]  Oren Etzioni,et al.  Learning First-Order Horn Clauses from Web Text , 2010, EMNLP.

[5]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Schemas and their Participants , 2009, ACL.

[6]  Oren Etzioni,et al.  Machine Reading , 2006, AAAI.

[7]  Peter Fankhauser,et al.  Boilerplate detection using shallow text features , 2010, WSDM '10.

[8]  Ido Dagan,et al.  Global Learning of Typed Entailment Rules , 2011, ACL.

[9]  James R. Curran,et al.  Wide-Coverage Efficient Statistical Parsing with CCG and Log-Linear Models , 2007, Computational Linguistics.

[10]  Rada Mihalcea,et al.  Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Langu , 2011, ACL 2011.

[11]  Oren Etzioni,et al.  Identifying Relations for Open Information Extraction , 2011, EMNLP.

[12]  Fabian M. Suchanek,et al.  AMIE: association rule mining under incomplete evidence in ontological knowledge bases , 2013, WWW.

[13]  Dekang Lin,et al.  DIRT – Discovery of Inference Rules from Text , 2001 .

[14]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[15]  Maria Liakata,et al.  Learning theories from text , 2004, COLING.

[16]  Mark Steedman,et al.  Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning , 2012 .

[17]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[18]  Philipp Koehn,et al.  Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) , 2007 .