论文信息 - On SemEval-2010 Japanese WSD Task

On SemEval-2010 Japanese WSD Task

An overview of the SemEval-2 Japanese WSD task is presented. The new characteristics of our task are (1) the task will use the first balanced Japanese sense-tagged corpus, and (2) the task will take into account not only the instances that have a sense in the given set but also the instances that have a sense that cannot be found in the set. It is a lexical sample task, and word senses are defined according to a Japanese dictionary, the Iwanami Kokugo Jiten. This dictionary and a training corpus were distributed to participants. The number of target words was 50, with 22 nouns, 23 verbs, and 5 adjectives. Fifty instances of each target word were provided, consisting of a total of 2,500 instances for the evaluation. Nine systems from four organizations participated in the task.

Kanako Komiya | Hikaru Yokono | Kiyoaki Shirai | Manabu Okumura

[1] Makoto Nakamura,et al. JAIST: Clustering and Classification Based Approaches for Japanese WSD , 2010, *SEMEVAL.

[2] Hiroyuki Shindo,et al. MSS: Investigating the Effectiveness of Domain Combinations and Topic Features for Word Sense Disambiguation , 2010, SemEval@ACL.

[3] Adam Kilgarriff,et al. English Senseval: Report and Results , 2000, LREC.

[4] Eneko Agirre,et al. On Robustness and Domain Adaptation using SVD for Word Sense Disambiguation , 2008, COLING.

[5] Kiyoaki Shirai. SENSEVAL-2 Japanese Dictionary Task , 2001, SENSEVAL@ACL.

[6] Jianhua Lin,et al. Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[7] Hwee Tou Ng,et al. Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation , 2006, ACL.

[8] Ido Dagan,et al. Similarity-based methods for word sense disambiguation , 1997 .

[9] Noriko Kando,et al. RALI: Automatic Weighting of Text Window Distances , 2010, SemEval@ACL.

[10] Kikuo Maekawa,et al. Balanced corpus of contemporary written Japanese , 2013, Language Resources and Evaluation.