From Lexical Cohesion to Textual Coherence: A Data Driven Perspective

This paper presents research that connects the cohesion structure of a text to the derivation of its coherence structure. Two different algorithms that derive the cohesion structure in the form of lexical paths from large thesauri are illustrated. Their results are correlated with (1) cue phrases of discourse usage and (2) coherence constraints empirically derived. A novel model of coherence structure is devised, based on the data provided by lexical paths from real world texts.

[1]  Kathleen Dahlgren,et al.  Naive semantics for natural language understanding , 1988 .

[2]  Jeannett Martin,et al.  English Text: System and structure , 1992 .

[3]  Jerry R. Hobbs,et al.  Interpretation as Abduction , 1993, Artif. Intell..

[4]  Eduard Hovy,et al.  Organising discourse structure relations using metafunctions , 1993 .

[5]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[6]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[7]  Sanda M. Harabagiu,et al.  Wordnet-based inference of textual context, cohesion and coherence , 1997 .

[8]  Sanda M. Harabagiu,et al.  TextNet - A text-based intelligent system , 1997, Nat. Lang. Eng..

[9]  Michael Halliday,et al.  Cohesion in English , 1976 .

[10]  Eduard H. Hovy,et al.  Automated Discourse Generation Using Discourse Structure Relations , 1993, Artif. Intell..

[11]  A. Kehler Interpreting cohesive forms in the context of discourse inference , 1996 .

[12]  Sanda M. Harabagiu,et al.  Parallel System for Text Inference Using Marker Propagations , 1998, IEEE Trans. Parallel Distributed Syst..

[13]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[14]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[15]  Diane J. Litman,et al.  Classifying Cue Phrases in Text and Speech Using Machine Learning , 1994, AAAI.

[16]  Julia Hirschberg,et al.  Empirical Studies on the Disambiguation of Cue Phrases , 1993, Comput. Linguistics.

[17]  Peter Mark Roget,et al.  Roget's International Thesaurus , 1977 .

[18]  Kathleen McKeown,et al.  Emergent Linguistic Rules from inducing Decision Trees: Disambiguating Discourse Clue Words , 1994, AAAI.

[19]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[20]  George A. Miller,et al.  Using a Semantic Concordance for Sense Identification , 1994, HLT.

[21]  Ellen M. Voorhees,et al.  Towards Building Contextual Representations of Word Senses Using Statistical Models , 1996 .

[22]  Sanda Harabagiu Testing Gricean Constraints on a WordNet-based Coherence Evaluation System , 1996 .

[23]  William W. Cohen Efficient Pruning Methods for Separate-and-Conquer Rule Learning Systems , 1993, IJCAI.