Complex events in an ontological-semantic natural language processing system

The goal of this dissertation is to elucidate principles for representing complex-event knowledge (or “scripts”) for use in an ontological-semantic natural language processing system, specifically the Mikrokosmos system. Complex events are, simply, events that comprise other events. A shorthand example is: the event of buying may comprise picking out merchandise, bringing it to the cash register, offering money to the cashier, receiving change, and leaving the store with the merchandise. As previous research has shown, texts make widespread use of such “world knowledge” by leaving many events implicit, relying on the hearer/reader to infer this information within the discourse context. The challenge for natural language processing programs is to construct a model of this world knowledge to fill in these gaps. Previous programs armed with such knowledge, e.g., SAM (Cullingford 1978) and Ms. Malaprop (Charniak 1977), have made some advances, but several design problems prevented real progress. It is argued in this dissertation that the ontological-semantic paradigm generates semantic descriptions of texts rich enough to make use of complex-event knowledge, thereby eliminating one barrier to implementation of such knowledge. Furthermore, this dissertation develops specific modifications to the ontological-semantic system that enable representations of complex-event knowledge to: (i) represent, in principle, any sequence of “real-world” events, (ii) achieve broad conceptual coverage, (iii) discern “fine-grained” differences in conceptual information, and (iv) significantly reduce redundancy in knowledge representation. Several constructed complex-event descriptions are adduced as evidence of feasibility of the formalism. Though this knowledge has not yet been implemented, the suggestions made in this dissertation should prove to be a significant step in representing the immense body of complex-event knowledge for natural language processing systems.

[1]  Marvin Minsky,et al.  A framework for representing knowledge" in the psychology of computer vision , 1975 .

[2]  R. Ratcliff,et al.  Making the connection: Generalized knowledge structures in story understanding , 1989 .

[3]  R. Schank,et al.  Inside Computer Understanding , 1981 .

[4]  Roger C. Schank,et al.  SCRIPTS, PLANS, GOALS, AND UNDERSTANDING , 1988 .

[5]  J. Fodor,et al.  The structure of a semantic theory , 1963 .

[6]  James F. Allen Towards a General Theory of Action and Time , 1984, Artif. Intell..

[7]  Yehoshua Bar-Hillel,et al.  The Present Status of Automatic Translation of Languages , 1960, Adv. Comput..

[8]  Roger C. Schank,et al.  Conceptual dependency: A theory of natural language understanding , 1972 .

[9]  James F. Allen Natural language understanding , 1987, Bejnamin/Cummings series in computer science.

[10]  Makoto Nagao,et al.  Knowledge and Inference , 1990 .

[11]  Richard Edward Cullingford,et al.  Script application: computer understanding of newspaper stories. , 1977 .

[12]  Karen Spärck Jones,et al.  Readings in natural language processing , 1986 .

[13]  Terry Winograd,et al.  Understanding natural language , 1974 .

[14]  Evelyne Viegas An Overt Semantics with a Machine-guided Approach for Robust LKBs , 1999, SIGLEX Workshop On Standardizing Lexical Resources.

[15]  Roger C. Schank,et al.  Dynamic Memory Revisited , 1999 .

[16]  Ramanathan V. Guha,et al.  Building large knowledge-based systems , 1989 .

[17]  Dieter Metzing,et al.  Frame conceptions and text understanding , 1980 .