Text Generation in Clinical Medicine – a Review

OBJECTIVE This article aims at an analysis of ways of producing documents (such as findings or referral letters) in clinical medicine. Special emphasis is given to the question of whether the field of "Natural Language Generation" (NLG) can provide new approaches to ameliorate the current situation. METHODS In order to assess the currently used techniques in text production, an analysis of commercially available systems was performed in addition to an extensive review of the literature. The sketch of current NLG approaches is also based on a literature review. To estimate the applicability of several techniques to clinical documents, a typology of documents in clinical medicine was developed, based on rhetorical structure theory, speech act theory and certain recurrent linguistic phenomena exposed in the said documents. RESULTS Current ways of producing text for documents in medicine are less than optimal in several respects. The field of NLG draws on the idea of generating text from a conceptual representation of not only certain facts, but also knowledge about how to express them via (written) language. Unfortunately, NLG does not yet offer "ready-to-run" solutions for the automatic production of most of the document types in the given typology. It seems, however, highly plausible that the demands of medical informatics for these kinds of systems will be satisfiable as NLG matures. CONCLUSIONS NLG offers a promising way of generating text for clinical documents, a problem of enormous economical importance. The medical informatics community should therefore commit itself to the idea of NLG in medicine.

[1]  J. Austin How to do things with words , 1962 .

[2]  A. Koller,et al.  Speech Acts: An Essay in the Philosophy of Language , 1969 .

[3]  H. Grice Logic and conversation , 1975 .

[4]  Adrian Akmajian,et al.  Linguistics: An Introduction to Language and Communication , 1979 .

[5]  Wolfgang Wahlster,et al.  User Modelling in Anaphora Generation: Ellipsis and Definite Description , 1982, ECAI.

[6]  B. Partee Nominal and temporal anaphora , 1984 .

[7]  Q. Whiting-O'Keefe,et al.  A computerized summary medical record system can provide more information than the standard medical record. , 1985, JAMA.

[8]  Daniel B. Hier,et al.  Generating Medical Case Reports with the Linguistic String Parser , 1986, AAAI.

[9]  Dirk Kraus,et al.  ARZTBRIEF: generating medical reports in a multimedia environment , 1991, MIE.

[10]  D P Pretschner,et al.  An interactive report generator for bone scan studies. , 1991, Proceedings. Symposium on Computer Applications in Medical Care.

[11]  W. Kratzer,et al.  Structured Reporting of Medical Findings: Evaluation of a System in Gastroenterology , 1992, Methods of Information in Medicine.

[12]  Robert Dale Generating referring expressions - constructing descriptions in a domain of objects and processes , 1992, ACL-MIT press series in natural language processing.

[13]  Johanna D. Moore,et al.  A Problem for RST: The Need for Multi-Level Discourse Analysis , 1992, CL.

[14]  Manfred Stede,et al.  Customizing RST for the Automatic Production of Technical Manuals , 1992, NLG.

[15]  M Maksud,et al.  PureMD: a Computerized Patient Record software for direct data entry by physicians using a keyboard-free pen-based portable computer. , 1992, Proceedings. Symposium on Computer Applications in Medical Care.

[16]  Pierre Nugues,et al.  Question answering in an oral dialogue system , 1993, Proceedings of the 15th Annual International Conference of the IEEE Engineering in Medicine and Biology Societ.

[17]  K Kuhn,et al.  Structured data collection and knowledge-based user guidance for abdominal ultrasound reporting. , 1993, Proceedings. Symposium on Computer Applications in Medical Care.

[18]  Wolfgang Wahlster,et al.  Plan-Based Integration of Natural Language and Graphics Generation , 1993, Artif. Intell..

[19]  Manfred Stede,et al.  Generating Multilingual Documents from a Knowledge Base: The TECHDOC Project , 1994, COLING.

[20]  Mark T. Maybury,et al.  Intelligent multimedia interfaces , 1994, CHI Conference Companion.

[21]  Richard I. Kittredge,et al.  Using natural-language processing to produce weather forecasts , 1994, IEEE Expert.

[22]  A. V. van Ginneken,et al.  A Model for Structured Data Entry Based on Explicit Descriptional Knowledge , 1994, Methods of Information in Medicine.

[23]  Robert Dale,et al.  Generating One-Anaphoric Expressions: Where Does the Decision Lie? , 1995, ArXiv.

[24]  Robert Dale,et al.  Computational Interpretations of the Gricean Maxims in the Generation of Referring Expressions , 1995, Cogn. Sci..

[25]  J R Scherrer,et al.  Multilingual natural language generation as part of a medical terminology server. , 1995, Medinfo. MEDINFO.

[26]  Ralph J. DeFriece MedIO: A Program for Intelligent Clinical Data Entry , 1995 .

[27]  A T McCray,et al.  The Representation of Meaning in the UMLS , 1995, Methods of Information in Medicine.

[28]  James Shaw Conciseness through Aggregation in Text Generation , 1995, ACL.

[29]  Johanna D. Moore,et al.  Generating 'Distributed' Referring Expressions: an Initial Report , 1996, INLG.

[30]  Eduard H. Hovy,et al.  On Lexical Aggregation and Ordering , 1996, INLG.

[31]  José Coch Overview of AlethGen , 1996, INLG.

[32]  Chris Mellish,et al.  An Empirical Study on the Generation of Anaphora in Chinese , 1997, Comput. Linguistics.

[33]  Bonnie L. Webber,et al.  Brief Review: Natural Language Generation in Health Care , 1997, J. Am. Medical Informatics Assoc..

[34]  C Colburn Structured text--documentation meets technology. , 1997, Journal of AHIMA.

[35]  Robert Dale,et al.  Building applied natural language generation systems , 1997, Natural Language Engineering.

[36]  Graeme Hirst,et al.  Authoring and Generating Health-Education Documents That Are Tailored to the Needs of the Individual Patient , 1997 .

[37]  C. Birkmann,et al.  A formal model of diabetological terminology and its application for data entry. , 1997, Studies in health technology and informatics.

[38]  Benoit Lavoie,et al.  A Fast and Portable Realizer for Text Generation Systems , 1997, ANLP.

[39]  C. E. Kahn,et al.  A Generalized Language for Platform-Independent Structured Reporting , 1997, Methods of Information in Medicine.

[40]  A Hasman,et al.  Medical narratives in electronic medical records. , 1997, International journal of medical informatics.

[41]  John A. Bateman,et al.  Enabling technology for multilingual natural language generation: the KPML development environment , 1997, Natural Language Engineering.

[42]  James Shaw,et al.  Segregatory Coordination and Ellipsis in Text Generation , 1998, ACL.

[43]  A. Rossi Mori,et al.  Standards to Support Development of Terminological Systems for Healthcare Telematics , 1998, Methods of Information in Medicine.

[44]  Werner Ceusters,et al.  Reconciling users' needs and formal requirements: issues in developing a reusable ontology for medicine , 1998, IEEE Transactions on Information Technology in Biomedicine.

[45]  James Shaw Clause Aggregation Using Linguistic Knowledge , 1998, INLG.

[46]  Werner Ceusters,et al.  Syntactic-Semantic Tagging of Medical Texts: The Multi-TALE Project , 1998, Studies in Health Technology and Informatics.

[47]  A. Rector Clinical Terminology: Why Is it so Hard? , 1999, Methods of Information in Medicine.

[48]  Judith C. Wagner,et al.  Natural language generation of surgical procedures , 1999, Int. J. Medical Informatics.

[49]  Martin Romacker,et al.  Discourse structures in medical reports - Watch out! The generation of referentially coherent and valid text knowledge bases in the medSYNDIKATE system , 1999, Int. J. Medical Informatics.

[50]  T Wetter,et al.  Requirements for speech recognition to support medical documentation. , 2000, Methods of information in medicine.

[51]  Kees van Deemter,et al.  Generating Vague Descriptions , 2000, INLG.

[52]  James Shaw,et al.  Generating Referring Quantified Expressions , 2000, INLG.

[53]  D. Kraus Suregen2: a model-based generator for surgical reports. , 2000, Studies in health technology and informatics.

[54]  Kathleen R. McKeown,et al.  Towards generating patient specific summaries of medical articles , 2001 .

[55]  Ehud Reiter,et al.  Book Reviews: Building Natural Language Generation Systems , 2000, CL.