论文信息 - Toward Text Understanding: Integrating Relevance-tagged Corpus and Automatically Constructed Case Frames

Toward Text Understanding: Integrating Relevance-tagged Corpus and Automatically Constructed Case Frames

This paper proposes a wide-range anaphora resolution system toward text understanding. This system resolves zero, direct and indirect anaphors in Japanese texts by integrating two sorts of linguistic resources: a hand-annotated corpus with various relations and automatically constructed case frames. The corpus has relevance tags which consist of predicate-argument relations, relations between nouns and coreferences, and is utilized for learning parameters of the system and testing it. The case frames are indispensable knowledge both for detecting zero/indirect anaphors and estimating appropriate antecedents. Our preliminary experiments showed promising results.

Daisuke Kawahara | Sadao Kurohashi | Ryohei Sasano

[1] Sadao Kurohashi,et al. Semantic Analysis of Japanese Noun Phrases - A New Approach to Dictionary-Based Understanding , 1999, ACL.

[2] Makoto Nagao,et al. A Method of Case Structure Analysis for Japanese Sentences Based on Examples in Case Frame Dictionary , 1994 .

[3] Daisuke Kawahara,et al. Fertilization of Case Frame Dictionary for Robust Japanese Case Analysis , 2002, COLING.

[4] Massimo Poesio,et al. Acquiring Lexical Knowledge for Anaphora Resolution , 2002, LREC.

[5] Kôiti Hasida,et al. Construction of a Japanese Relevance-tagged Corpus , 2002, LREC.

[6] Daisuke Kawahara,et al. Zero Pronoun Resolution Based on Automatically Constructed Case Frames and Structural Preference of Antecedents , 2004, IJCNLP.

[7] Jian Su,et al. Coreference Resolution Using Competition Learning Approach , 2003, ACL.