Effect of Word Sets with Non-Taxonomical Relation for Retrieval Support

At least two kinds of relations exist among related words: the taxonomical relation and the thematic relation. However, although words with a taxonomical relation are easy to identify from linguistic resources such as dictionaries and thesauri, words with a thematic relation are difficult to identify because they are rarely maintained in linguistic resources. In this paper, we present a method of extracting thematically (non-taxonomically) related word sets among words for retrieval support by employing case-marking articles derived from syntactic analysis. For verifying the capability of such word sets, we compared the results retrieved with words related only taxonomically and those retrieved with words including a word related non-taxonomically to the other words. We found additional term which is thematically related to other terms is effective at retrieving informative pages.

[1]  Eugene Charniak,et al.  Finding Parts in Very Large Corpora , 1999, ACL.

[2]  Miriam Bassok,et al.  What Makes a Man Similar to a Tie? Stimulus Compatibility with Comparison and Integration , 1999, Cognitive Psychology.

[3]  Hitoshi Isahara,et al.  Extraction of Hierarchies Based on Inclusion of Co-occurring Words with Frequency Information , 2005, IJCAI.

[4]  Patrick Pantel,et al.  Ontologizing Semantic Relations , 2006, ACL.

[5]  Karin Friberg,et al.  Query Expansion Using Domain Information in Compounds , 2007, NAACL.

[6]  Ido Dagan,et al.  The Distributional Inclusion Hypotheses and Lexical Entailment , 2005, ACL.

[7]  Dan I. Moldovan,et al.  Automatic Discovery of Part-Whole Relations , 2006, CL.

[8]  Roxana Gîrju,et al.  Automatic Detection of Causal Relations for Question Answering , 2003, ACL 2003.

[9]  Ido Dagan,et al.  Scaling Web-based Acquisition of Entailment Relations , 2004, EMNLP.

[10]  Patrick Pantel,et al.  Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[11]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.

[12]  Norihiro Hagita,et al.  Robust recognition of degraded machine-printed characters using complementary similarity measure and error-correction learning , 1995, Electronic Imaging.

[13]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[14]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.