论文信息 - Concept-Instance Relation Extraction from Simple Noun Sequences Using a Full-Text Search Engine

Concept-Instance Relation Extraction from Simple Noun Sequences Using a Full-Text Search Engine

This paper describes a simple method for acquiring conceptinstance relations from simple noun sequences that frequently appear in Japanese Web documents. In Japanese, many noun sequences can consist of two NPs that have a concept-instance relation. This phenomenon is similar to apposition in English but differs in that many of these noun sequences do not provide any explicit clues, such as the proper noun capitalization or commas used in English apposition, that indicate the boundary between the concept name and the instance name. We developed a method to detect such implicit boundaries between concept names and instance names, and to filter out erroneous concept-instance relations by using a search engine.

[1] Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[2] Takenobu Tokunaga,et al. Analysis of Japanese Compound Nouns using Collocational Information , 1994, COLING.

[3] Sharon A. Caraballo. Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[4] 石崎俊,et al. Automatic Extraction of Hyponyms from Newspaper Using Lexicosyntactic Patterns , 2003 .

[5] Eduard H. Hovy,et al. Offline Strategies for Online Question Answering: Answering Questions Before They Are Asked , 2003, ACL.

[6] Kentaro Torisawa,et al. Acquiring Hyponymy Relations from Web Documents , 2004, NAACL.

[7] Eduard Hovy,et al. Towards terascale knowledge acquisition , 2004, COLING 2004.

[8] Doug Downey,et al. Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..