Parsing Swedish
暂无分享,去创建一个
CG is a reductionistic constraint rule formalism whose input is lexically analysed ambiguous text and whose output is disambiguated text. Disambiguation is carried out by constraints on lemmas and tags that discard alternative analyses on the basis of contextual information, typically coded by a linguist. The ENGCG morphosyntactic tagger was introduced in 1992 (Voutilainen & al.) and compared with a state-of-the-art statistical tagger in 1997 (Samuelsson & Voutilainen). CG was successful in word-class tagging but not adequate for full-scale parsing. A considerable effort on finite-state parsing was made by Koskenniemi, Tapanainen and Voutilainen (see their articles in Roche & Schabes, eds., 1997). A more successful effort was made by Tapanainen and Järvinen, who extended CG into a functional dependency grammar formalism and interpreter/compiler capable of introducing explicit functional dependencies and of applying large grammars efficiently.
[1] Sofie Johansson Kokkinakis,et al. A Cascaded Finite-State Parser for Syntactic Analysis of Swedish , 1999, EACL.
[2] Atro Voutilainen,et al. Comparing a Linguistic and a Stochastic Tagger , 1997, ACL.
[3] Timo Järvinen,et al. A non-projective dependency parser , 1997, ANLP.
[4] Björn Gambäck. Processing Swedish sentences : a unification-based grammar and some applications , 1997 .