论文信息 - Logical Aspects of Computational Linguistics

Logical Aspects of Computational Linguistics

In categorial systems with a fixed structural component, the learning problem comes down to finding the solution for a set of typeassignment equations. A hard-wired structural component is problematic if one want to address issues of structural variation. Our starting point is a type-logical architecture with separate modules for the logical and the structural components of the computational system. The logical component expresses invariants of grammatical composition; the structural component captures variation in the realization of the correspondence between form and meaning. Learning in this setting involves finding the solution to both the type-assignment equations and the structural equations of the language at hand. We develop a view on these two subtasks which pictures learning as a process moving through a two-stage cycle. In the first phase of the cycle, type assignments are computed statically from structures. In the second phase, the lexicon is enhanced with facilities for structural reasoning. These make it possible to dynamically relate structures during on-line computation, or to establish off-line lexical generalizations. We report on the initial experiments in [15] to apply this method in the context of the Spoken Dutch Corpus. For the general type-logical background, we refer to [12]; §1 has a brief recap of some key features. 1 Constants and Variation One can think of type-logical grammar as a functional programming language with some special purpose features to customize it for natural language processing tasks. Basic constructs are demonstrations of the form Γ A, stating that a structure Γ is a well-formed expression of type A. These statements are the outcome of a process of computation. Our programming language has a built-in vocabulary of logical constants to construct the type formulas over some set of atomic formulas in terms of the indexed unary and binary operations of (1a). Parallel to the formula language, we have the structure-building operations of (1b) with (· ◦i ·) and 〈·〉j as counterparts of •i and ♦j respectively. The indices i and j are taken from given, finite sets I, J which we refer to as composition modes. a. Typ ::= Atom | ♦jTyp | 2jTyp | Typ •i Typ | Typ/iTyp | Typ\iTyp b. Struc ::= Typ | 〈Struc〉j | Struc ◦i Struc (1) P. de Groote, G. Morrill, C. Retoré (Eds.): LACL 2001, LNAI 2099, pp. 1–16, 2001. c © Springer-Verlag Berlin Heidelberg 2001

Glyn Morrill | Philippe de Groote | Christian Retoré

[1] Jane J. Robinson. Dependency Structures and Transformational Rules , 1970 .

[2] Alexis Nasr. A Formalism and a Parser for Lexicalised Dependency Grammars , 1995, IWPT.

[3] G. A. Miller. THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[4] Daniel Dominic Sleator,et al. Parsing English with a Link Grammar , 1995, IWPT.

[5] Vincenzo Lombardo,et al. Formal Aspects and Parsing Issues of Dependency Theory , 1998, COLING-ACL.

[6] Zellig S. Harris. A Cycling Cancellation-Automaton for Sentence Well-Formedness , 1970 .

[7] Peter Neuhaus,et al. The Complexity of Recognition of Linguistically Adequate Dependency Grammars , 1997, ACL.

[8] Larisa S. Modina. On Some Formal Grammars Generating Dependency Trees , 1975, MFCS.

[9] Vincenzo Lombardo,et al. An Earley-type recognizer for dependency grammar , 1996, COLING.

[10] Edward P. Stabler,et al. Derivational Minimalism , 1996, LACL.

[11] Joachim Lambek. Pregroups: A New Algebraic Approach to Sentence Structure , 2000, Recent Topics in Mathematical and Computational Linguistics.