Case-Based Reasoning for Knowledge-Intensive Template Selection During Text Generation

The present paper describes a case-based reasoning solution for solving the task of selecting adequate templates for realizing messages describing actions in a given domain. This solution involves the construction of a case base from a corpus of example texts, using information from WordNet to group related verbs together. A case retrieval net is used as a memory model. A taxonomy of the concepts involved in the texts is used to compute similarity between concepts. The set of data to be converted into text acts as a query to the system. The process of solving a given query may involve several retrieval processes – to obtain a set of cases that together constitute a good solution for transcribing the data in the query as text messages – and a process of knowledge-intensive adaptation which resorts to a knowledge base to identify appropriate substitutions and completions for the concepts that appear in the cases, using the query as a source. We describe this case-based solution, and we present examples of how it solves the task of selecting an appropriate set of templates to render a given set of data as text.

[1]  Raquel Hervás,et al.  Una arquitectura software para el desarrollo de aplicaciones de generación de lenguaje natural , 2004, Proces. del Leng. Natural.

[2]  Walter Daelemans,et al.  Introduction to the special issue on memory-based language processing , 1999, J. Exp. Theor. Artif. Intell..

[3]  Dekang Lin,et al.  Dependency-Based Evaluation of Minipar , 2003 .

[4]  Manfred Stede,et al.  Lexical Options in Multilingual Generation from a Knowledge Base , 1993, EWNLG.

[5]  Claire Cardie,et al.  Integrating case-based learning and cognitive biases for machine learning of natural language , 1999, J. Exp. Theor. Artif. Intell..

[6]  Hwee Tou Ng,et al.  Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach , 1996, ACL.

[7]  Raquel Hervás,et al.  Case Retrieval Nets for Heuristic Lexicalization in Natural Language Generation , 2005, EPIA.

[8]  Marie Wenzel Meteer,et al.  The “GENERATION GAP”: the problem of expressibility in text planning , 1990 .

[9]  Nuno Seco,et al.  Selection and Reuse of Software Design Patterns Using CBR and WordNet , 2003, SEKE.

[10]  Robert Dale,et al.  Building Natural Language Generation Systems: Figures , 2000 .

[11]  Mario Lenz,et al.  Case Retrieval Nets: Basic Ideas and Extensions , 1996, KI.

[12]  Songsak Channarukul,et al.  A Natural Language Generation Component for Dialog Systems , 1999 .

[13]  Michael Zock,et al.  Trends in Natural Language Generation An Artificial Intelligence Perspective , 1996, Lecture Notes in Computer Science.

[14]  Ehud Reiter,et al.  Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[15]  Raquel Hervás,et al.  Story plot generation based on CBR , 2004, Knowl. Based Syst..

[16]  Walter Daelemans,et al.  The Acquisition of Stress: A Data-Oriented Approach , 1994, Comput. Linguistics.

[17]  A. Campbell,et al.  Progress in Artificial Intelligence , 1995, Lecture Notes in Computer Science.

[18]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.