Performing aggregation and ellipsis using discourse structures

This article describes the generation of aggregated and elliptic sentences, using Dependency Trees connected by rhetorical relations as input. The system we have developed can generate both hypotactic and paratactic constructions with appropriate cue words, and various forms of ellipsis such as Gapping and Conjunction Reduction. We contend that Dependency Trees connected by rhetorical relations are excellent input for a generation system that has to generate ellipsis, and we propose a taxonomy of the most common Dutch cue words, grouped according to the kind of discourse relations they signal. Finally, we argue that syntactic aggregation should be performed in the Surface Realizer of a language generation system, because it requires access to language-specific syntactic information.

[1]  Lynne Cahill,et al.  Component tasks in applied NLG systems , 2007 .

[2]  Chris Mellish,et al.  Current research in natural language generation , 1990 .

[3]  James Shaw,et al.  Segregatory Coordination and Ellipsis in Text Generation , 1998, ACL.

[4]  Owen Rambow,et al.  A Framework for MT and Multilingual NLG Systems Based on Uniform Lexico-Structural Processing , 2000, ANLP.

[5]  James C. Lester,et al.  Evaluating the Effects of Natural Language Generation Techniques on Reader Satisfaction , 2001 .

[6]  Dirk Heylen,et al.  Generating expressive speech for storytelling applications , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  William C. Mann,et al.  RHETORICAL STRUCTURE THEORY: A THEORY OF TEXT ORGANIZATION , 1987 .

[8]  Robert Dale,et al.  Using Linguistic Phenomena to Motivate a Set of Rhetorical Relations , 2007 .

[9]  Ted Sanders,et al.  The Role of Coherence Relations and Their Linguistic Markers in Text Processing , 2000 .

[10]  Dirk Heylen,et al.  Emotional Characters for Automatic Plot Creation , 2004, TIDSE.

[11]  Karin Harbusch,et al.  ELLEIPO: a module that computes coordinative ellipsis for language generators that don't , 2006 .

[12]  Kleanthes K. Grohmann,et al.  Right Node Raising and Gapping: Interface Conditions on Prosodic Deletion (review) , 2003 .

[13]  Katharina Hartmann,et al.  Right node raising and gapping , 2000 .

[14]  Eduard H. Hovy,et al.  Automated Discourse Generation Using Discourse Structure Relations , 1993, Artif. Intell..

[15]  Gertjan van Noord,et al.  Alpino: Wide-coverage Computational Analysis of Dutch , 2000, CLIN.

[16]  Maite Taboada,et al.  Applications of Rhetorical Structure Theory , 2006 .

[17]  Michael Moortgat,et al.  Syntactic Analysis in the Spoken Dutch Corpus (CGN) , 2002, LREC.

[18]  Leo G. M. Noordman,et al.  Toward a taxonomy of coherence relations , 1992 .

[19]  Michael White,et al.  Efficient Realization of Coordinate Structures in Combinatory Categorial Grammar , 2006 .

[20]  Robert Dale,et al.  Choosing a Set of Coherence Relations for Text Generation: A Data-Driven Approach , 1993, EWNLG.

[21]  Ehud Reiter,et al.  Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[22]  Linda Schwartz,et al.  The syntax of coordination , 1990 .

[23]  Michael Zock,et al.  Trends in Natural Language Generation An Artificial Intelligence Perspective , 1996, Lecture Notes in Computer Science.

[24]  Igor Mel’čuk,et al.  Dependency Syntax: Theory and Practice , 1987 .

[25]  Nanda Slabbers,et al.  Narration for virtual storytelling , 2006 .

[26]  Kathleen R. McKeown,et al.  Clause aggregation: an approach to generating concise text , 2002 .

[27]  T. Sanders,et al.  The classification of coherence relations and their linguistic markers: An exploration of two languages , 1998 .

[28]  A. Knott,et al.  Using Linguistic Phenomena to Motivate a Set of Coherence Relations. , 1994 .

[29]  Advaith Siddharthan,et al.  Syntactic Simplification and Text Cohesion , 2006 .

[30]  Hercules Dalianis,et al.  Aggregation in Natural Language Generation , 1999 .

[31]  F. Zwarts,et al.  Categoriale grammatica en algebraïsche semantiek. Een onderzoek naar negatie en polariteit in het Nederlands , 1986 .

[32]  Clarisse Sieckenius de Souza,et al.  Getting the message across in RST-based text generation , 1990 .

[33]  Petra Hendriks,et al.  Coherence Relations, Ellipsis and Contrastive Topics , 2004, J. Semant..

[34]  Anna Hildegard Neijt-kappen Gapping: A Contribution to Sentence Grammar , 1980 .

[35]  M. Reape,et al.  Just what is aggregation anyway ? , 2007 .

[36]  John M. Carroll On coordination reduction , 1978 .