Designing a Portable Natural Language Database Query System

One barrier to the acceptance of natural language database query systems is the substantial installation effort required for each new database. Much of this effort involves the encoding of semantic knowledge for the domain of discourse, necessary to correctly interpret and respond to natural language questions. For such systems to be practical, techniques must be developed to increase their portability to new domains. This paper discusses several issues involving the portability of natural language interfaces to database systems, and presents the approach taken in CO-OP — a natural language database query system that provides cooperative responses to English questions and operates with a typical CODA-SYL database system. CO-OP derives its domain-specific knowledge from a lexicon (the list of words known to the system) and the information already present in the structure and content of the underlying database. Experience with the implementation suggests that strategies that are not directly derivative of cognitive or linguistic models may nonetheless play an important role in the development of practical natural language systems.

[1]  Bertram C. Bruce Case Systems for Natural Language , 1975, Artif. Intell..

[2]  J. Sadock,et al.  Ambiguity Tests and How to Fail Them , 1975 .

[3]  Terry Winograd,et al.  Understanding natural language , 1974 .

[4]  Gary G. Hendrix Human Engineering for Applied Natural Language Processing , 1977, IJCAI.

[5]  Samuel Jerrold Kaplan,et al.  Cooperative responses from a portable natural language data base query system. , 1979 .

[6]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[7]  Michael Brady,et al.  Cooperative Responses From a Portable Natural Language Database Query System , 1983 .

[8]  Ramez Elmasri,et al.  The Structural Model for Database Design , 1979, ER.

[9]  Larry R. Harris,et al.  The ROBOT System: Natural language processing applied to data base query , 1978, ACM Annual Conference.

[10]  Kathleen McKeown,et al.  Paraphrasing Using Given and New Information in a Question-Answer System , 1979, ACL.

[11]  S. Jerrold Kaplan On The Difference Between Natural Language And High Level Query Languages , 1978, ACM Annual Conference.

[12]  Gary G. Hendrix,et al.  Mechanical intelligence : research and applications , 1977 .

[13]  Gary G. Hendrix,et al.  Developing a natural language interface to complex data , 1977, TODS.

[14]  Laurian M. Chirica,et al.  The entity-relationship model: toward a unified view of data , 1975, SIGF.

[15]  Gary G. Herdrix Human engineering fcr applied natural language processing , 1977, IJCAI 1977.

[16]  Dennis McLeod,et al.  A Semantic Data Base Model and Its Associated Structured User Interface. , 1978 .

[17]  Gary G. Hendrix,et al.  An Approach to Acquiring and Applying Knowledge , 1980, AAAI.

[18]  William A. Martin Some Comments On EQS, A Near Term Natural Language Data Base Query System , 1978, ACM Annual Conference.

[19]  F. B. Thompson,et al.  REL: A Rapidly Extensible Language system , 1969, ACM '69.