Annotation and Analysis of Discourse Relations, Temporal Relations and Multi-Layered Situational Relations in Japanese Texts

This paper proposes a methodology for building a specialized Japanese data set for recognizing temporal relations and discourse relations. In addition to temporal and discourse relations, multi-layered situational relations that distinguish generic and specific states belonging to different layers in a discourse are annotated. Our methodology has been applied to 170 text fragments taken from Wikinews articles in Japanese. The validity of our methodology is evaluated and analyzed in terms of degree of annotator agreement and frequency of errors.

[1]  Alice ter Meulen,et al.  Genericity: An Introduction , 1995 .

[2]  Masayuki Asahara,et al.  BCCWJ-TimeBank: Temporal and Event Information Annotation on Japanese Text , 2014, Int. J. Comput. Linguistics Chin. Lang. Process..

[3]  Gregory Norman Carlson,et al.  Reference to kinds in English , 1977 .

[4]  Gerhard Jäger,et al.  Topic‐Comment Structure and the Contrast Between Stage Level and Individual Level Predicates , 2001, J. Semant..

[5]  Nicholas Asher,et al.  Annotation for and Robust Parsing of Discourse Structure on Unrestricted Texts , 2007 .

[6]  Angelika Kratzer,et al.  Stage-Level and Individual-Level Predicates , 1995 .

[7]  G. Milsark Existential sentences in English , 1979 .

[8]  Theodore B. Fernald,et al.  Predicates and Temporal Arguments , 2000 .

[9]  Eneko Agirre,et al.  SemEval-2015 Task 4: TimeLine: Cross-Document Event Ordering , 2015, *SEMEVAL.

[10]  Rashmi Prasad,et al.  Reflections on the Penn Discourse TreeBank, Comparable Corpora, and Complementary Annotation , 2014, CL.

[11]  Martin van den Berg,et al.  A Rule Based Approach to Discourse Parsing , 2004, SIGDIAL Workshop.

[12]  William C. Mann,et al.  RHETORICAL STRUCTURE THEORY: A THEORY OF TEXT ORGANIZATION , 1987 .

[13]  Yoshiki Ogawa,et al.  The Stage/Individual Distinction and (In)alienable Possession , 2001 .

[14]  Kikuo Maekawa,et al.  Balanced corpus of contemporary written Japanese , 2013, Language Resources and Evaluation.

[15]  James Pustejovsky,et al.  TempEval-3: Evaluating Events, Time Expressions, and Temporal Relations , 2012, ArXiv.

[16]  Daisuke Bekki,et al.  Toward a Discourse Theory for Annotating Causal Relations in Japanese , 2014, PACLIC.

[17]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[18]  Rashmi Prasad,et al.  The Penn Discourse TreeBank as a Resource for Natural Language Generation , 2005 .