论文信息 - The Domain Restriction Hypothesis: Relating Term Similarity and Semantic Consistency

The Domain Restriction Hypothesis: Relating Term Similarity and Semantic Consistency

In this paper, we empirically demonstrate what we call the domain restriction hypothesis, claiming that semantically related terms extracted from a corpus tend to be semantically coherent. We apply this hypothesis to define a post-processing module for the output of Espresso, a state of the art relation extraction system, showing that irrelevant and erroneous relations can be filtered out by our module, increasing the precision of the final output. Results are confirmed by both quantitative and qualitative analyses, showing that very high precision can be reached.

[1] Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[2] Carlo Strapparava,et al. The role of domain information in Word Sense Disambiguation , 2002, Natural Language Engineering.

[3] Dan I. Moldovan,et al. Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations , 2003, NAACL.

[4] T. Landauer,et al. Indexing by Latent Semantic Analysis , 1990 .

[5] Eduard H. Hovy,et al. Learning surface text patterns for a Question Answering System , 2002, ACL.

[6] Alfio Massimiliano Gliozzo. The GOD model , 2006, EACL.

[7] Umberto Eco,et al. Lector in fabula , 1989 .

[8] Carlo Strapparava,et al. Semantic Domains in Computational Linguistics , 2009 .

[9] Patrick Pantel,et al. Discovering word senses from text , 2002, KDD.

[10] Sanda M. Harabagiu,et al. The Informative Role of WordNet in Open-Domain Question Answering , 2004, HLT-NAACL 2004.

[11] Patrick Pantel,et al. Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[12] Daniel Jurafsky,et al. Semantic Taxonomy Induction from Heterogenous Evidence , 2006, ACL.

[13] Bernardo Magnini,et al. Integrating Subject Field Codes into WordNet , 2000, LREC.

[14] Philipp Cimiano,et al. Ontology Learning from Text: Methods, Evaluation and Applications , 2005 .

[15] Doug Downey,et al. Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..