WIP: The Coordinated Generation of Multimodal Presentations from a Common Representation

The task of the knowledge-based presentation system WIP is the generation of a variety of multimodal documents from an input consisting of a formal description of the communicative intent of a planned presentation. WIP generates illustrated texts that are customized for the intended audience and situation. We present the architecture of WIP and introduce as its major components the presentation planner, the layout manager, the text generator and the graphics generator. An extended notion of coherence for multimodal documents is introduced that can be used to constrain the presentation planning process. The paper focuses on the coordination of contents planning and layout that is necessary to produce a coherent illustrated text. In particular, we discuss layout revisions after contents planning and the influence of layout constraints on text generation. We show that in WIP the design of a multimodal document is viewed as a non-monotonic planning process that includes various revisions of preliminary results in order to achieve a coherent output with an optimal media mix.

[1]  Steven K. Feiner,et al.  A grid-based approach to automating display layout , 1998 .

[2]  Joseph E. Grimes,et al.  The Thread of Discourse , 1984 .

[3]  Steven K. Feiner,et al.  Interactive Multimedia Explanation for Equipment Maintenance and Repair , 1990, HLT.

[4]  Thomas Rist,et al.  Wissensbasierte Perspektivenwahl für die automatische Erzeugung von 3D-Objektdarstellungen , 1990, Graphik und KI.

[5]  Jerry R. Hobbs Why Is Discourse Coherent , 1978 .

[6]  Som Bandyopadhyay Towards an understanding of coherence in multimodal discourse , 1990 .

[7]  Steven F. Roth,et al.  Graphics and Natural Language as Components of Automatic Explanation , 1988, SGCH.

[8]  Johanna D. Moore,et al.  A Reactive Approach to Explanation , 1989, IJCAI.

[9]  Rachel Reichman,et al.  Getting computers to talk like you and me , 1985 .

[10]  Steven K. Feiner,et al.  Coordinating Text and Graphics in Explanation Generation , 1989, HLT.

[11]  Ehud Reiter,et al.  Avoiding Unwanted Conversational Implicatures in Text and Graphics , 1990, AAAI.

[12]  Søren Kjørup,et al.  Pictorial speech acts , 1978 .

[13]  Wolfgang Wahlster,et al.  User and discourse models for multimodal communication , 1991 .

[14]  Josef Müller-Brockmann Grid systems in graphic design : a visual communication manual for graphic designers, typographers and three dimensional designers = Raster Systeme für die visuelle Gestaltung : ein Handbuch für Grafiker, Typografen und Ausstellungsgestalter , 1981 .

[15]  John R. Searle,et al.  Speech Acts: An Essay in the Philosophy of Language , 1970 .

[16]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[17]  Bjørn N. Freeman-Benson,et al.  An incremental constraint solver , 1990, CACM.

[18]  Bernhard Nebel,et al.  Reasoning and Revision in Hybrid Representation Systems , 1990, Lecture Notes in Computer Science.

[19]  Alan Borning,et al.  Constraint hierarchies , 1992 .

[20]  Alan Borning,et al.  Constraint-Based Tools for Building User Interfaces , 1986, ACM Trans. Graph..

[21]  Jerry R. Hobbs Coherence and Coreference , 1979, Cogn. Sci..

[22]  Wolfgang Wahlster,et al.  Designing Illustrated Texts: How Language Production Is Influenced by Graphics Generation , 1991, EACL.

[23]  Richard John Beach Setting tables and illustrations with style , 1985 .

[24]  Thomas Rist,et al.  Towards a Plan-Based Synthesis of Illustrated Documents , 1990, ECAI.

[25]  Stuart C. Shapiro,et al.  Intelligent Multi-Media Interface Technology , 1988, SGCH.

[26]  Alfred Kobsa,et al.  User Models in Dialog Systems , 1989, Symbolic Computation.

[27]  Johanna D. Moore,et al.  Planning Text for Advisory Dialogues , 1989, ACL.

[28]  Thomas Rist,et al.  Synthesizing illustrated documents : a plan-based approach , 1991 .

[29]  Norbert Reithinger,et al.  XTRA: A Natural-Language Access System to Expert Systems , 1989, Int. J. Man Mach. Stud..

[30]  Wendy G. Lehnert,et al.  A Critical Perspective on KRL , 1979, Cogn. Sci..

[31]  J. Kruse Book review: User Models in Dialog Systems Edited by A. Kobsa and W. Wahlster (Springer-Verlag, 1989) , 1991, SGAR.

[32]  Oliviero Stock,et al.  Natural Language and Exploration of an Information Space: The ALFresco Interactive System , 1991, IJCAI.

[33]  Wolfgang Wahlster,et al.  User Modelling in Anaphora Generation: Ellipsis and Definite Description , 1982, ECAI.

[34]  T. A. V. Dijk,et al.  Textwissenschaft : eine interdisziplinäre Einführung , 1980 .

[35]  Karin Harbusch,et al.  Constraining Tree Adjoining Grammars by Unification , 1990, COLING.

[36]  Günter Neumann,et al.  POPEL-HOW: A Distributed Parallel Model for Incremental Natural Language Production with Feedback , 1989, IJCAI.