On the search for semantic primitives

Dailey (1986) has transformed the classic problem of the "vicious cycle or circle" into the neoclassical paradigm of "NP-completeness". As presented, there can be no disagreement with his conclusion that the problem of extracting a minimum set of semantic primitives from a monolingual dictionary is NP-complete. However, I believe (1) his use of NP-completeness is inappropriate to the problem of finding semantic primitives in a monolingual dictionary and (2) his notion of how a primitive must emerge from an examination of definitions is not sufficiently perceptive. Some suggestions for dealing with NP-complete problems include developing algorithms that are fast enough for small problems, that deal with special cases, or that exploit special features of a particular instance of the problem. With a large dictionary, such as Webster's Third New International Dictionary, it would seem that we are at a size where NP-completeness comes into play. Even here, it is possible to reduce the problem to a manageable level without invoking any special procedures, using only a correct formulation of the problem. The definienda and definitions of a monolingual dictionary can be mapped into a graph theoretic structure with nodes corresponding to definienda (as Dailey does), but with links going from a word occurring in a definition to the word being defined (opposite to Dailey's direction). The problem of finding semantic primitives in this model is then equivalent to finding what is called in digraph theory the point basis of a digraph, i.e., those nodes from which all others in the dictionary are reachable. Using straightforward algorithms to do this, the size of the problem can be reduced considerably. (In Litkowski 1978 and 1980, I show how this approach reduced an initial set of 20,000 verbs to 4,000 "more primitive" verbs. Amsler (1980) similarly used many heuristics in analyzing verb definitions in a monolingual pocket dictionary.) After applying such "gross" techniques for pruning, the problem is more tractable and it is possible to take advantage of special features of the problem at hand. In particular, it is possible to develop many heuristics which take into account many of the characteristics of a dictionary as well as many semantic considerations. These heuristics make it possible to reduce the problem even further, moving ever closer to semantic primitives. With all that has been said above, I do not want to leave the impression that the problem is in any sense easy. If that were the case, the literature would be replete with claims based on analysis of dictionaries that such and such constitute primitives. Such analysis is

[1]  D. Rumelhart,et al.  A model for analogical reasoning. , 1973 .

[2]  Sandra Carberry Understanding Pragmatically Ill-Formed Input , 1984, COLING.

[3]  E. Judith Weiner,et al.  A Nowledge Representation Approach to Understanding Metaphors , 1984, Comput. Linguistics.

[4]  Kenneth C. Litkowski Models of the Semantic Structure of Dictionaries , 1978, CL.

[5]  David P. Dailey The Extraction of a Minimum Set of Semantic Primitives from a Monolingual Dictionary is NP-Complete , 1986, CL.

[6]  Julia Hirschberg,et al.  User Participation in the Reasoning Processes of Expert Systems , 1982, AAAI.

[7]  Candace L. Sidner,et al.  Focusing in the comprehension of definite anaphora , 1986 .

[8]  Scott Weinstein,et al.  Providing a Unified Account of Definite Noun Phrases in Discourse , 1983, ACL.

[9]  Samuel Jerrold Kaplan,et al.  Cooperative responses from a portable natural language data base query system. , 1979 .

[10]  Allan Collins,et al.  Misconceptions in student's understanding , 1979 .

[11]  Austin Henderson,et al.  RABBIT: An Intelligent Database Assistant , 1982, AAAI.

[12]  Robert Alfred Amsler The Structure of the Merriam-Webster Pocket Dictionary , 1980 .

[13]  Kathleen R. McKeown Generating natural language text in response to questions about database structure , 1982 .

[14]  Albert L Stevens,et al.  Multiple Conceptual Models of a Complex System. , 1978 .

[15]  Allan M. Collins,et al.  Mixed-Initiative Systems for Training and Decision-Aid Applications. , 1970 .

[16]  Derek H. Sleeman Inferring (Mal) Rules from Pupil's Protocols , 1982, ECAI.

[17]  Barbara J. Grosz,et al.  Focusing and Description in Natural Language Dialogues , 1979 .

[18]  David D. McDonald,et al.  Human-computer discourse in the design of a PASCAL tutor , 1983, CHI '83.

[19]  David D. McDonald Natural language production as a process of decision-making under constraints , 1980 .

[20]  John Seely Brown,et al.  Diagnostic Models for Procedural Bugs in Basic Mathematical Skills , 1978, Cogn. Sci..

[21]  Kenneth C. Litkowski Requirements of Text - Processing-Lexicons , 1980, ACL.

[22]  Candace L. Sidner,et al.  Discourse Structure and the Proper Treatment of Interruptions , 1985, IJCAI.

[23]  A. Tversky Features of Similarity , 1977 .

[24]  Daniel G. Bobrow,et al.  On Overview of KRL, a Knowledge Representation Language , 1976, Cogn. Sci..

[25]  Roger C. Schank,et al.  Conceptual dependency: A theory of natural language understanding , 1972 .

[26]  B. Woolf Context Dependent Planning in a Machine Tutor , 1984 .