DATR: A Language for Lexical Knowledge Representation

Much recent research on the design of natural language lexicons has made use of nonmonotonic inheritance networks as originally developed for general knowledge representation purposes in Artificial Intelligence. DATR is a simple, spartan language for defining nonmonotonic inheritance networks with path/value equations, one that has been designed specifically for lexical knowledge representation. In keeping with its intendedly minimalist character, it lacks many of the constructs embodied either in general-purpose knowledge representation languages or in contemporary grammar formalisms. The present paper shows that the language is nonetheless sufficiently expressive to represent concisely the structure of lexical information at a variety of levels of linguistic analysis. The paper provides an informal example-based introduction to DATR and to techniques for its use, including finite-state transduction, the encoding of DAGs and lexical rules, and the representation of ambiguity and alternation. Sample analysis of phenomena such as inflectional syncretism and verbal subcategorization are given that show how the language can be used to squeeze out redundancy from lexical descriptions.

[1]  Teruko Mitamura,et al.  Hierarchical Lexical Structure and Interpretive Mapping in Machine Translation , 1992, COLING.

[2]  Marc Light,et al.  INSYST: An Automatic Inserter System for Hierarchical Lexica , 1993, EACL.

[3]  David S. Touretzky,et al.  The Mathematics of Inheritance Systems , 1984 .

[4]  Ted Briscoe,et al.  Order independent and persistent typed default unification , 1996 .

[5]  Nicoletta Calzolari,et al.  Current issues in computational linguistics : in honour of Don Walker , 1994 .

[6]  Sabine Reinhard,et al.  Verarbeitungsprobleme nichtlinearer Morphologien Umlautbeschreibung in einem hierarchischen Lexikon , 1990, GLDV-Jahrestagung.

[7]  Walter Daelemans,et al.  Review of Inheritance, defaults, and the lexicon by Ted Briscoe, Valeria de Paiva, and Ann Copestake. Cambridge University Press 1993. , 1994 .

[8]  Gerald Gazdar,et al.  Inference in DATR , 1989, EACL.

[9]  Walter Daelemans,et al.  Word Grammar: an inheritance-based theory of language , 1990 .

[10]  D. Flickinger Lexical rules in the hierarchical lexicon , 1989 .

[11]  Hans-Ulrich Krieger,et al.  Inheritance, Defaults and the Lexicon: Feature-Based Inheritance Networks for Computational Lexicons , 1994 .

[12]  Ann Copestake,et al.  The Representation of Lexical Semantic Information , 1992 .

[13]  Pius ten Hacken,et al.  Word manager: A system for morphological dictionaries , 1992 .

[14]  Ingrid Renz,et al.  DATR As A Lexical Component For PATR , 1991, EACL.

[15]  Andrew R. Hippisley,et al.  Conflict in Russian Genitive Plural Assignment: A Solution Represented in DATR , 1994 .

[16]  Jo Calder Feature-value Logics: Some Limits on the Role of Defaults , 1994 .

[17]  Aline Villavicencio,et al.  A Hierarchial Description of the Portuguese Verb , 1995, SBIA.

[18]  Lynne J. Cahill,et al.  Some reflections on the conversion of the TIC lexicon into DATR , 1994 .

[19]  Gerald Penn,et al.  Default Finite State Machines and Finite State Phonology , 1994, SIGMORPHON.

[20]  Bob Carpenter,et al.  The Generative Power of Categorial Grammars and Head-Driven Phrase Structure Grammars with Lexical Rules , 1991, Comput. Linguistics.

[21]  Roger Evans,et al.  An Application of DATR: The TIC Lexicon , 1990, ECAI.

[22]  Stephen G. Pulman Unification Encodings of Grammatical Notations , 1996, Comput. Linguistics.

[23]  Adam Kilgarriff,et al.  Inheriting Verb Alternations , 1993, EACL.

[24]  B. T. S. Atkins,et al.  Predictable Meaning Shift: Some Linguistic Properties of Lexical Implication Rules , 1991, SIGLEX Workshop.

[25]  J. Lyons,et al.  Grammar and meaning : essays in honour of Sir John Lyons , 1995 .

[26]  Greville G. Corbett,et al.  Network Morphology: a DATR account of Russian nominal inflection , 1993, Journal of Linguistics.

[27]  Greville G. Corbett,et al.  Gender, Animacy, and Declensional Class Assignment: A Unified Account for Russian , 1995 .

[28]  James Pustejovsky,et al.  Lexical Knowledge Representation and Natural Language Processing , 1993, Artif. Intell..

[29]  Walter Daelemans,et al.  Default inheritance in an object-oriented representation of linguistic categories , 1994, Int. J. Hum. Comput. Stud..

[30]  Gerald Gazdar,et al.  The semantics of DATR , 1989 .

[31]  Mark A. Young Nonmonotonic Sorts for Feature Structures , 1992, AAAI.

[32]  Bob Carpenter,et al.  Categorial Grammars, Lexical Rules and the English Predicative , 1995 .

[33]  Wolfgang Spohn,et al.  The Representation of , 1986 .

[34]  Hans-Ulrich Krieger,et al.  Feature-based inheritance networks for computational lexicons , 1994 .

[35]  Dafydd Gibbon Generalized DATR for flexible lexical access : PROLOG specification , 1993 .

[36]  Petra Barg,et al.  Automatic acquisition of DATR theories from observations , 1994 .

[37]  Martin Kay,et al.  Regular Models of Phonological Rule Systems , 1994, CL.

[38]  C. J. Rupp,et al.  Constraints, language and computation , 1994 .

[39]  Hagen Langer,et al.  Reverse Queries in DATR , 1994, COLING.

[40]  Walter Daelemans,et al.  Evaluation of lexical representation formalisms , 1992 .

[41]  William C. Rounds,et al.  A Logical Semantics for Nonmonotonic Sorts , 1993, ACL.

[42]  John A. Carroll,et al.  A Practical Approach to Multiple Default Inheritance for Unification-Based Lexicons , 1992, CL.

[43]  James Pustejovsky,et al.  Lexical Semantics and Knowledge Representation , 1991, Lecture Notes in Computer Science.

[44]  Tibor Kiss,et al.  Formal grammar : theory and implementation , 1994 .

[45]  Lynne J. Cahill Morphonology in the Lexicon , 1993, EACL.

[46]  Ted Briscoe,et al.  Regular polysemy and semi-productive sense extension , 1995 .

[47]  Bruce Caldwell,et al.  Some Reflections onBeyond Positivism , 1985 .

[48]  Gregory Stump,et al.  On the theoretical status of position class restrictions on inflectional affixes , 1992 .

[49]  Nancy Ide,et al.  Outline of a Model for Lexical Databases , 1993, Inf. Process. Manag..

[50]  Hans-Ulrich Krieger,et al.  Derivation without lexical rules , 1993 .

[51]  Chris Mellish,et al.  Using Classification to Generate Text , 1992, ACL.

[52]  Walter Daelemans,et al.  Inheritance in Natural Language Processing , 1992, Comput. Linguistics.

[53]  Gerald Gazdar,et al.  Paradigm Function Morphology in DATR , 1992 .

[54]  David J. Weir,et al.  Encoding Lexicalized Tree Adjoining Grammars with a Nonmonotonic Inheritance Hierachy , 1995, ACL.

[55]  Dafydd Gibbon,et al.  Prosodic Inheritance And Morphological Generalisations , 1991, EACL.

[56]  Ted Briscoe,et al.  Lexical Operations in a Unification-based Framework , 1991, SIGLEX Workshop.

[57]  Chris Mellish,et al.  Using Classification as a Programming Language , 1993, IJCAI.

[58]  Lynne J. Cahill An Inheritance-based Lexicon for Message Understanding Systems , 1994, ANLP.

[59]  Norman M. Fraser,et al.  Making DATR Work for Speech: Lexicon Compilation in SUNDIAL , 1992, Comput. Linguistics.

[60]  James Pustejovsky,et al.  The Generative Lexicon , 1995, CL.

[61]  Hans-Ulrich Krieger,et al.  Feature-Based Allomorphy , 1993, ACL.

[62]  Bill Keller DATR Theories and DATR Models , 1995, ACL.

[63]  John Nerbonne,et al.  Feature-based lexicons : an example and a comparison to DATR , 1992 .

[64]  Marc Light Classification in Feature-based Default Inheritance Hierarchies , 1996, ArXiv.

[65]  Aisb,et al.  AISB89 : proceedings of the seventh conference of the society for the study of artificial intelligence and simulation of behaviour , 1989 .

[66]  Gerald Gazdar,et al.  Prioritised multiple inheritance in DATR , 1994 .