CUNI–Malta system at SIGMORPHON 2019 Shared Task on Morphological Analysis and Lemmatization in context: Operation-based word formation

This paper presents the submission by the Charles University-University of Malta team to the SIGMORPHON 2019 Shared Task on Morphological Analysis and Lemmatization in context. We present a lemmatization model based on previous work on neural transducers (Makarov and Clematide, 2018b; Aharoni and Goldberg, 2016). The key difference is that our model transforms the whole word form in every step, instead of consuming it character by character. We propose a merging strategy inspired by Byte-Pair-Encoding that reduces the space of valid operations by merging frequent adjacent operations. The resulting operations not only encode the actions to be performed but the relative position in the word token and how characters need to be transformed. Our morphological tagger is a vanilla biLSTM tagger that operates over operation representations, encoding operations and words in a hierarchical manner. Even though relative performance according to metrics is below the baseline, experiments show that our models capture important associations between interpretable operation labels and fine-grained morpho-syntax labels.

[1]  Ryan Cotterell,et al.  Marrying Universal Dependencies and Universal Morphology , 2018, UDW@EMNLP.

[2]  Hao Zhu,et al.  Neural Finite-State Transducers: Beyond Rational Relations , 2019, NAACL.

[3]  Katharina Kann,et al.  Single-Model Encoder-Decoder with Explicit Morphological Representation for Reinflection , 2016, ACL.

[4]  Rico Sennrich,et al.  Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[5]  Simon Clematide,et al.  Imitation Learning for Neural Morphological String Transduction , 2018, EMNLP.

[6]  Annette Rios,et al.  A Basic Language Technology Toolkit for Quechua , 2015, Proces. del Leng. Natural.

[7]  Ryan Cotterell,et al.  Weighting Finite-State Transductions With Neural Context , 2016, NAACL.

[8]  Josef van Genabith,et al.  An Extensive Empirical Evaluation of Character-Based Morphological Tagging for 14 Languages , 2017, EACL.

[9]  Mehryar Mohri Weighted Finite-State Transducer Algorithms. An Overview , 2004 .

[10]  Marco Antonio Sobrevilla Cabezudo,et al.  Ship-LemmaTagger: Building an NLP Toolkit for a Peruvian Native Language , 2017, TSD.

[11]  Jason Eisner,et al.  Parameter Estimation for Probabilistic Finite-State Transducers , 2002, ACL.

[12]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[13]  Ryan Cotterell,et al.  UniMorph 3.0: Universal Morphology , 2018, LREC.

[14]  Simon Clematide,et al.  UZH at CoNLL-SIGMORPHON 2018 Shared Task on Universal Morphological Reinflection , 2018, CoNLL Shared Task.

[15]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[17]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[19]  Simon Clematide,et al.  Neural Transition-based String Transduction for Limited-Resource Setting in Morphology , 2018, COLING.

[20]  Yoav Goldberg,et al.  Morphological Inflection Generation with Hard Monotonic Attention , 2016, ACL.

[21]  Ryan Cotterell,et al.  A Simple Joint Model for Improved Contextual Neural Lemmatization , 2019, NAACL.

[22]  Daniel Zeman,et al.  A Morphological Analyzer for Shipibo-Konibo , 2018 .

[23]  Philip Gage,et al.  A new algorithm for data compression , 1994 .