Minimally supervised lemmatization scheme induction through bilingual parallel corpora
暂无分享,去创建一个
We present a lemma induction scheme on a target language through minimally supervised alignment and transfer methods utilizing English-to-German parallel corpora. Compared to previous alignment and transfer approaches, the approach outlined here increases computational efficiency and significantly reduces the level of supervision necessary in inducing clusters of inflectional forms. Furthermore, we increase our search field to include not only verbs but also nouns and adjectives in the target language, and achieve comparable results to previous unsupervised monolingual methods.
[1] David Yarowsky,et al. Minimally Supervised Morphological Analysis by Multimodal Alignment , 2000, ACL.
[2] Mona T. Diab,et al. An Unsupervised Method for Multilingual Word Sense Tagging Using Parallel Corpora , 2000, ACL 2000.
[3] Key-Sun Choi,et al. Bilingual Knowledge Acquisition from Korean-English Parallel Corpus Using Alignment , 1996, COLING.
[4] Dekai Wu,et al. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.