A Multi-purpose Bayesian Model for Word-Based Morphology

This paper introduces a probabilistic model of morphology based on a word-based morphological theory. Morphology is understood here as a system of rules that describe systematic correspondences between full word forms, without decomposing words into any smaller units. The model is formulated in the Bayesian learning framework and can be trained in both supervised and unsupervised setting. Evaluation is performed on tasks of generating unseen words, lemmatization and inflected form production.

[1]  Daniel Jurafsky,et al.  Morphological features help POS tagging of unknown words across language varieties , 2005, IJCNLP.

[2]  Benilde Grana López,et al.  COMPOUND FORMATION IN GENERATIVE GRAMMAR , 1994 .

[3]  Christian Simon,et al.  Morphisto - An Open Source Morphological Analyzer for German , 2009, FSMNLP.

[4]  Mark Aronoff,et al.  Word Formation in Generative Grammar , 1979 .

[5]  Mikko Kurimo,et al.  Supervised Morphological Segmentation in a Low-Resource Learning Setting using Conditional Random Fields , 2013, CoNLL.

[6]  John DeNero,et al.  Supervised Learning of Complete Morphological Paradigms , 2013, NAACL.

[7]  David Yarowsky,et al.  Modeling and learning multilingual inflectional morphology in a minimally supervised framework , 2003 .

[8]  Maciej Janicki Unsupervised Learning of A-Morphous Inflection with Graph Clustering , 2013, RANLP.

[9]  R. Ewy,et al.  ABSTRACT , 1986 .

[10]  Robert E. Tarjan,et al.  Finding optimum branchings , 1977, Networks.

[11]  Andrei Mikheev,et al.  Automatic Rule Induction for Unknown-Word Guessing , 1997, CL.

[12]  Wolfgang Lezius,et al.  TIGER: Linguistic Interpretation of a German Corpus , 2004 .

[13]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[14]  Mikko Kurimo,et al.  Morpho Challenge 2005-2010: Evaluations and Results , 2010, SIGMORPHON.

[15]  Burcu Can,et al.  Statistical models for unsupervised learning of morphology and POS tagging , 2011 .

[16]  Nizar Habash,et al.  Unsupervised Morphology-Based Vocabulary Expansion , 2014, ACL.

[17]  Hoifung Poon,et al.  Unsupervised Morphological Segmentation with Log-Linear Models , 2009, NAACL.

[18]  Amit Kirschenbaum Unsupervised Segmentation for Different Types of Morphological Processes Using Multiple Sequence Alignment , 2013, SLSP.

[19]  Ming-Wei Chang,et al.  Unified Expectation Maximization , 2012, NAACL.

[20]  Sean A. Fulop,et al.  Unsupervised Learning of Morphology Without Morphemes , 2002, SIGMORPHON.

[21]  Lars Borin,et al.  Unsupervised Learning of Morphology , 2011, CL.

[22]  Erwin Chan,et al.  Learning Probabilistic Paradigms for Morphology in a Latent Class Model , 2006, SIGMORPHON.

[23]  Josef van Genabith,et al.  Learning Morphology with Morfette , 2008, LREC.

[24]  Mikko Kurimo,et al.  Morfessor 2.0: Python Implementation and Extensions for Morfessor Baseline , 2013 .

[25]  Gita Martohardjono,et al.  Pace Panini: Towards a Word-Based Theory of Morphology , 1997 .

[26]  David Yarowsky,et al.  Minimally Supervised Morphological Analysis by Multimodal Alignment , 2000, ACL.

[27]  Richard Wicentowski,et al.  Proceedings of the Eighth Meeting of the ACL Special Interest Group on Computational Phonology and Morphology at HLT-NAACL 2006 , 2006, HLT-NAACL 2006.