Topic-Chain-Based Coherence Annotation Scheme for Chinese Text

There are few explicit discourse connectives in Chinese texts,which bring in new challenge for the traditional connective-grounded coherence annotation scheme.The paper proposes a new idea to deal with the problem.We introduce topic chain(TC)as a main coherence representation and design several topic-comment relations to describe the complex event relations among TC-linked sentences.Therefore,a new coherence annotation scheme based on TCs and connectives are built accordingly.The tentative confirmatory experiments on the Tsinghua Chinese Treebank(TCT)data set show that more than 76%and 50% Chinese complex sentences have TCs and connectives respectively.They can co-occur in most Chinese sentences.The phenomena verify the feasibility and availability of this scheme.

[1]  Helmut Prendinger,et al.  A Novel Discourse Parser Based on Support Vector Machine Classification , 2009, ACL.

[2]  Bonnie L. Webber,et al.  Discourse structure and language technology , 2011, Natural Language Engineering.

[3]  Josef Ruppenhofer,et al.  In Search of Missing Arguments: A Linguistic Approach , 2011, RANLP.

[4]  Jian Su,et al.  Predicting Discourse Connectives for Implicit Discourse Relation Recognition , 2010, COLING.

[5]  Charles N. Li,et al.  Subject and topic , 1979 .

[6]  曹 逢甫,et al.  A functional study of topic in Chinese : the first step towards discourse analysis , 1979 .

[7]  Roser Morante,et al.  SemEval-2010 Task 10: Linking Events and Their Participants in Discourse , 2009, SemEval@ACL.

[8]  Michael Halliday,et al.  Cohesion in English , 1976 .

[9]  Ralph Weischedel,et al.  Modeling Unrestricted Coreference in OntoNotes , 2011 .

[10]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[11]  Bonnie L. Webber,et al.  Discourse Structure and Computation: Past, Present and Future , 2012, Discoveries@ACL.

[12]  G. Meade Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001 .

[13]  Hwee Tou Ng,et al.  A PDTB-styled end-to-end discourse parser , 2012, Natural Language Engineering.

[14]  Livio Robaldo,et al.  The Penn Discourse Treebank 2.0 Annotation Manual , 2007 .

[15]  Yuping Zhou,et al.  PDTB-style Discourse Annotation of Chinese Text , 2012, ACL.