Using Bidirectional Semantic Rules for Generation

This paper describes the use of a system of semantic rules to generate noun compounds, vague or polysemous words, and cases of metonymy. The rules are bidirectional and are used by the understanding system to interpret the same constructions. I n t r o d u c t i o n In generation systems that are paired with understanding systems, bidirectionality is desirable for reasons that are both theoretical (a single model of linguistic behaviour) and practical (shorter development time, greater consistency, etc.) 1. Recently, [Shieber et al. 89] and [Calder at al. 89] have presented generation algor i thms tha t share both semantics and syntax with the understanding system. This paper presents an extension of these algori thms to deal with phenomena that have often been lumped together under 'pragmat ics ' , namely noun compounding, metonymy (the use of a word to refer to a related concept), and vague or polysemous words like "have." The difficulty with these constructions is tha t they are productive, and cannot be handled easily by simply listing meanings in a lexicon. Taking noun compounding as an example, we have "corn oil" and "olive oil" referring to oil made from corn or olives. We could add a lexical sense for "corn" meaning "made from corn," but then we face an explosion in the size of the lexicon, and an inability to understand or generate novel compounds: if we acquire "safflower" as the name of a plant, we would like the system to be able to handle "safflower oil" immediately, but this won' t be possible if we need a separate lexical sense to handle compounding. The system will be more robust (and the lexicon more compact) if we can derive the desired sense of "safflower" from the basic noun sense when we need it. We have therefore developed a system of bidirectional semantic rules to handle these phenomena at the appropriate level of generality. IF or more detailed arguments along these lines, see [Appelt 87], [Shieber 88], [Jacobs 88a]. We have implemented these rules in Common Lisp as par t of the KBNL system [Barnett et M. 90] at MCC, but nothing depends on the idiosyncracies of our formalisms or implementa t ion , so the technique is compatible with a wide variety of theories of the kinds of relations that are likely to occur in these constructions, as in, e.g., [Finin 80] for noun compounds and [Nunberg 78] for oblique reference. T h e F r a m e w o r k The algorithms for recognition and generation use an agenda-based blackboard for communication and control [Cohen et al. 89]. Our syntax component uses an extension of Categorial Unification Grammar [Wittenburg 86] as the phrase-structure component of an LFG-style functional representation (f-struCture), and the semantic component maps from this representat ion to sets of assertions in the interface language of the CYC knowledge base [Lenat et al. 90]. Semantic rules map partial semantic interpretations onto other partial interpretations. They consist of a left-hand side and a right-hand side, each consisting of one or more templates, plus a mechanism for mapping an instantiation of either set of templates onto an instantiat ion of the other set. The intuitive semantics of these rules is that any interpretation that matches the left-hand side licenses a second interpretation matching the right-hand side. For example, we can use the name of an author to refer to his works ("I read Shakespeare"), and the corresponding semantic rule states that the existence of an NP denoting an artist licences the use of the same NP to refer to his works. The generation system applies the rules in a backward-chaining direction, while the understanding system runs them forward. A later section contains a fuller discussion of the implementat ion of the rules, while the next sections discuss their use at runtime.

[1]  Kathleen R. McKeown Generating natural language text in response to questions about database structure , 1982 .

[2]  Eduard Hovy,et al.  Generating language with a phrasal lexicon , 1988 .

[3]  Paul S. Jacobs,et al.  PHRED: A Generator for Natural Language Interfaces , 1985, Comput. Linguistics.

[4]  Paul S. Jacobs Achieving bidirectionality , 1988, COLING.

[5]  Douglas E. Appelt TELEGRAM: A Grammar Formalism for Language Planning , 1983, IJCAI.

[6]  James Pustejovsky,et al.  Lexical Selection in the Process of Language Generation Sergei Nirenburg , 1987, ACL.

[7]  David D. McDonald,et al.  From Water To Wine: Generating Natural Language Text From Today's Applications Programs , 1988, ANLP.

[8]  斉藤 康己,et al.  Douglas B. Lenat and R. V. Guha : Building Large Knowledge-Based Systems, Representation and Inference in the Cyc Project, Addison-Wesley (1990). , 1990 .

[9]  Henk Zeevat,et al.  An Algorithm for Generation in Unification Categorial Grammar , 1989, EACL.

[10]  Kent Wittenburg,et al.  A Parser for Portable NL Interfaces Using Graph-Unification-Based Grammars , 1986, AAAI.

[11]  Douglas E. Appelt,et al.  Bidirectional Grammars and the Design of Natural Language Generation Systems , 1987, TINLAP.

[12]  Gertjan van Noord,et al.  A Semantic-Head-Driven Generation Algorithm for Unification-Based Formalisms , 1989, ACL.

[13]  David D. McDonald,et al.  A Model of Revision in Natural Language Generation , 1986, ACL.

[14]  G. Nunberg The pragmatics of reference , 1978 .

[15]  Ramanathan V. Guha,et al.  Building large knowledge-based systems , 1989 .

[16]  Stuart M. Shieber,et al.  A Uniform Architecture for Parsing and Generation , 1988, COLING.

[17]  Deborah A. Dahl,et al.  Nominalizations in PUNDIT , 1987, ACL.

[18]  Ramanathan V. Guha,et al.  Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project , 1990 .

[19]  Douglas E. Appelt,et al.  TEAM: An Experiment in the Design of Transportable Natural-Language Interfaces , 1987, Artif. Intell..

[20]  Timothy W. Finin,et al.  The semantic interpretation of compound nominals , 1980 .