论文信息 - An Architecture for Data-to-Text Systems

An Architecture for Data-to-Text Systems

I present an architecture for data-to-text systems, that is NLG systems which produce texts from non-linguistic input data; this essentially extends the architecture of Reiter and Dale (2000) to systems whose input is raw data instead of AI knowledge bases. This architecture is being used in the BabyTalk project, and is based on experiences in several projects at Aberdeen; it also seems to be compatible with many data-to-text systems developed elsewhere. It consists of four stages which are organised in a pipeline: Signal Analysis, Data Interpretation, Document Planning, and Microplanning and Realisation.

Ehud Reiter | Ehud Reiter

[1] Ehud Reiter,et al. Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[2] Karen Kukich,et al. Design of a Knowledge-Based Report Generator , 1983, ACL.

[3] Chris Mellish,et al. Choosing the content of textual summaries of large time-series data sets , 2006, Natural Language Engineering.

[4] Donia Scott,et al. Structural variation in generated health reports , 2005, IWP@IJCNLP.

[5] Jim Hunter,et al. Generating English summaries of time series data using the Gricean maxims , 2003, KDD '03.

[6] Anna S. Law,et al. A Comparison of Graphical and Textual Presentations of Time Series Data to Support Medical Decision Making in the Neonatal Intensive Care Unit , 2005, Journal of Clinical Monitoring and Computing.

[7] Jim Hunter,et al. Choosing words in computer-generated weather forecasts , 2005, Artif. Intell..

[8] Ehud Reiter,et al. Generating Spatio-Temporal Descriptions in Pollen Forecasts , 2006, EACL.

[9] Fabio Pianesi,et al. Multimodal support to group dynamics , 2007, Personal and Ubiquitous Computing.

[10] Paul Piwek,et al. What is NLG? , 2002, INLG.

[11] James Shaw,et al. Practical Issues in Automatic Documentation Generation , 1994, ANLP.