Automatically generating text to accompany information graphics

Automatically generating text to accompany information graphics Mary Ellen Foster Master of Science Graduate Department of Computer Science University of Toronto 1999 Generally, when quantitative information is to be presented, some form of graphical presentation is used, often with a textual caption to ensure that the audience notices particular aspects of the data. This thesis presents the principles that should be followed by a system aiming to produce such captions automatically. The process of caption generation is examined in the context of the standard tasks in text generation. Most previous systems in this area produce textual summaries intended to stand alone; the issues involved in producing a caption differ, as the text must be coordinated with the graphic it is to accompany. The thesis also presents C APUT, a prototype caption-generation system which follows these principles to generate single-sentence captions for information graphics of the type that might appear in a newspaper article. Finally, extensions to CAPUT that would bring it from a prototype to a full-fledged caption generation system are proposed.

[1]  Kathleen McKeown,et al.  Empirically Designing and Evaluating a New Revision-Based Model for Summary Generation , 1996, Artif. Intell..

[2]  P. Fayers,et al.  The Visual Display of Quantitative Information , 1990 .

[3]  William E. Hefley,et al.  Intelligent Multimedia Presentation Systems: Research and Principles , 1991, AAAI Workshop on Intelligent Multimedia Interfaces.

[4]  Richard I. Kittredge,et al.  Using natural-language processing to produce weather forecasts , 1994, IEEE Expert.

[5]  James Gosling,et al.  The Java Programming Language" The Java Series , 1996 .

[6]  Massimo Fasciano Génération intégrée de textes et de graphiques statistiques , 1996 .

[7]  Jock D. Mackinlay,et al.  Automating the design of graphical presentations of relational information , 1986, TOGS.

[8]  Karen Kukich,et al.  Design of a Knowledge-Based Report Generator , 1983, ACL.

[9]  Steven F. Roth,et al.  Graphics and Natural Language as Components of Automatic Explanation , 1988, SGCH.

[10]  Michael Elhadad,et al.  An Overview of SURGE: a Reusable Comprehensive Syntactic Realization Component , 1996, INLG.

[11]  Johanna D. Moore,et al.  Saying it in graphics: from intentions to visualizations , 1998, Proceedings IEEE Symposium on Information Visualization (Cat. No.98TB100258).

[12]  Ken Arnold,et al.  The Java Programming Language , 1996 .

[13]  Alain Polguère,et al.  Generation of Extended Bilingual Statistical Reports , 1992, COLING.

[14]  Gene Zelazny Say It With Charts: The Executive's Guide to Visual Communication , 2001 .

[15]  Johanna D. Moore,et al.  Integrating planning and task-based design for multimedia presentation , 1997, IUI '97.

[16]  Igor Mel’čuk,et al.  Dependency Syntax: Theory and Practice , 1987 .

[17]  Robert Dale,et al.  Building applied natural language generation systems , 1997, Natural Language Engineering.

[18]  Edward R. Tufte,et al.  Envisioning Information , 1990 .

[19]  Edward Rolf Tufte,et al.  The visual display of quantitative information , 1985 .

[20]  Johanna D. Moore,et al.  Describing Complex Charts in Natural Language: A Caption Generation System , 1998, CL.

[21]  Benoit Lavoie,et al.  A Fast and Portable Realizer for Text Generation Systems , 1997, ANLP.

[22]  Stephen M. Kosslyn,et al.  Elements of graph design , 1993 .

[23]  Lidija Iordanskaja,et al.  Content determination and text structuring; two interrelated processes , 1993 .

[24]  Johanna D. Moore,et al.  A Media-Independent Content Language for Integrated Text and Graphics Generation , 1998 .

[25]  Giuseppe Carenini,et al.  Generating Visual Arguments: a Media-independent Approach , 1998 .