论文信息 - BabyFST - Towards a Finite-State Based Computational Model of Ancient Babylonian

BabyFST - Towards a Finite-State Based Computational Model of Ancient Babylonian

Akkadian is a fairly well resourced extinct language that does not yet have a comprehensive morphological analyzer available. In this paper we describe a general finite-state based morphological model for Babylonian, a southern dialect of the Akkadian language, that can achieve a coverage up to 97.3% and recall up to 93.7% on lemmatization and POS-tagging task on token level from a transcribed input. Since Akkadian word forms exhibit a high degree of morphological ambiguity, in that only 20.1% of running word tokens receive a single unambiguous analysis, we attempt a first pass at weighting our finite-state transducer, using existing extensive Akkadian corpora which have been partially validated for their lemmas and parts-of-speech but not the entire morphological analyses. The resultant weighted finite-state transducer yields a moderate improvement so that for 57.4% of the word tokens the highest ranked analysis is the correct one. We conclude with a short discussion on how morphological ambiguity in the analysis of Akkadian could be further reduced with improvements in the training data used in weighting the finite-state transducer as well as through other, context-based techniques.

[1] Mans Hulden,et al. Foma: a Finite-State Compiler and Library , 2009, EACL.

[2] Kimmo Koskenniemi,et al. Finite-state description of Semitic morphology: a case study of ancient Accadian , 1988, COLING.

[3] Johan Schalkwyk,et al. OpenFst: A General and Efficient Weighted Finite-State Transducer Library , 2007, CIAA.

[4] Jordan Lachler,et al. Computational Modeling of Verbs in Dene Languages: The Case of Tsuut’ina , 2017 .

[5] Tommi A. Pirinen,et al. HFST - Framework for Compiling and Applying Morphologies , 2011, SFCM.

[6] Aaron Macks. Parsing Akkadian Verbs with Prolog , 2002, SEMITIC@ACL.

[7] N. Kouwenberg. The Akkadian Verb and Its Semitic Background , 2010 .

[8] J. V. Rauff,et al. Finite State Morphology , 2007 .

[9] J. Black,et al. A concise dictionary of Akkadian , 1999 .

[10] Giorgio Buccellati,et al. A structural grammar of Babylonian , 2011 .

[11] Lene Antonsen,et al. Learning from the computational modelling of Plains Cree verbs , 2017, Morphology.

[12] François Barthélemy. A Morphological Analyzer for Akkadian Verbal Forms with a Model of Phonetic Transformations , 1998, SEMITIC@COLING.