论文信息 - Acquiring Hyponymy Relations from Web Documents

Acquiring Hyponymy Relations from Web Documents

This paper describes an automatic method for acquiring hyponymy relations from HTML documents on the WWW. Hyponymy relations can play a crucial role in various natural language processing systems. Most existing acquisition methods for hyponymy relations rely on particular linguistic patterns, such as “NP such as NP”. Our method, however, does not use such linguistic patterns, and we expect that our procedure can be applied to a wide range of expressions for which existing methods cannot be used. Our acquisition algorithm uses clues such as itemization or listing in HTML documents and statistical measures such as document frequencies and verb-noun co-occurrences.

Kentaro Torisawa | Keiji Shinzato | Kentaro Torisawa | Keiji Shinzato

[1] Eduard H. Hovy,et al. Offline Strategies for Online Question Answering: Answering Questions Before They Are Asked , 2003, ACL.

[2] Jun'ichi Tsujii,et al. A Hybrid Japanese Parser with Hand-crafted Grammar and Statistics , 2000, COLING.

[3] Christian Jacquemin,et al. Automatic Acquisition and Expansion of Hypernym Links , 2004, Comput. Humanit..

[4] Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[5] Sharon A. Caraballo. Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[6] George A. Miller,et al. Introduction to WordNet: An On-line Lexical Database , 1990 .

[7] 石崎俊,et al. Automatic Extraction of Hyponyms from Newspaper Using Lexicosyntactic Patterns , 2003 .