论文信息 - A Simple but Powerful Automatic Term Extraction Method

A Simple but Powerful Automatic Term Extraction Method

In this paper, we propose a new idea for the automatic recognition of domain specific terms. Our idea is based on the statistics between a compound noun and its component single-nouns. More precisely, we focus basically on how many nouns adjoin the noun in question to form compound nouns. We propose several scoring methods based on this idea and experimentally evaluate them on the NTCIRI TMREC test collection. The results are very promising especially in the low recall area.

Hiroshi Nakagawa | Tatsunori Mori | H. Nakagawa | Tatsunori Mori | Hiroshi Nakagawa

[1] Kathleen McKeown,et al. Automatically Extracting and Representing Collocations for Language Generation , 1990, ACL.

[2] Sophia Ananiadou,et al. Extracting Nested Collocations , 1996, COLING.

[3] Kyo Kageura,et al. METHODS OF AUTOMATIC TERM RECOGNITION : A REVIEW , 1996 .

[4] Makoto Iwayama,et al. Term Extraction Using A New Measure of Term Representativeness , 1999, NTCIR.

[5] Sophia Ananiadou,et al. The C-value/NC-value domain-independent method for multi-word term extraction , 1999 .

[6] Jun'ichi Tsujii,et al. A Method of Measuring Term Representativeness - Baseline Method Using Co-occurrence Distribution , 2000, COLING.

[7] Kyo Kageura,et al. Automatic Thesaurus Generation through Multiple Filtering , 2000, COLING.