Multitiered nonlinear morphology using multitape finite automata: a case study on Syriac and Arabic

This paper presents a computational model for nonlinear morphology with illustrations from Syriac and Arabic. The model is a multitiered one in that it allows for multiple lexical representations corresponding to the multiple tiers of autosegmental phonology. The model consists of three main components: (i) a lexicon, which is made of sublexica, with each sublexicon representing lexical material from a specific tier, (ii) a rewrite rules component that maps multiple lexical representations into one surface form and vice versa, and (iii) a morphotactic component that employs regular grammars. The system is finite-state in that lexica and rules can be represented by multitape finite-state machines.

[1]  Richard Sproat,et al.  An Efficient Compiler for Weighted Rewrite Rules , 1996, ACL.

[2]  J. J. Mc Carthy A Prosodic Theory of Nonconcatenative Morphology , 1981 .

[3]  Lauri Karttunen,et al.  Finite-state lexicon compiler , 1993 .

[4]  Jorge E. Mezei,et al.  On Relations Defined by Generalized Finite Automata , 1965, IBM J. Res. Dev..

[5]  Zellig S. Harris,et al.  Linguistic Structure of Hebrew , 1941 .

[6]  Kenneth R. Beesley,et al.  Arabic Morphology Using Only Finite-State Operations , 1998, SEMITIC@COLING.

[7]  Lauri Karttunen,et al.  Two-level rule compiler , 1992 .

[8]  Richard Sproat,et al.  Morphology and computation , 1992 .

[9]  Dana S. Scott,et al.  Finite Automata and Their Decision Problems , 1959, IBM J. Res. Dev..

[10]  Kenneth R. Beesley Computer Analysis of Arabic Morphology: A two-level approach with detours , 1991 .

[11]  M. A. Hill An Eecient Compiler for Weighted Rewrite Rules , 1996 .

[12]  J. McCarthy OCP effects: Gemination and antigemination , 1986 .

[13]  BirdSteven,et al.  One-level phonology , 1994 .

[14]  Kenneth R. Beesley Constraining Separated Morphotactic Dependencies in Finite-State Grammars , 1998 .

[15]  John J. McCarthy,et al.  Template form in prosodic morphology , 1993 .

[16]  George Anton Kiraz Computational Nonlinear Morphology with Emphasis on Semitic Languages. Studies in Natural Language Processing. , 2001 .

[17]  Patrick C. Fischer,et al.  Multi-tape and infinite-state automata—a survey , 1965, Commun. ACM.

[18]  Lauri Karttunen,et al.  Two-Level Morphology with Composition , 1992, COLING.

[19]  George Anton Kiraz Linearization of Nonlinear Lexical Representations , 1997, SIGMORPHON@EACL.

[20]  M. McShane,et al.  Bootstrapping Morphological Analyzers by Combining Human Elicitation and Machine Learning , 2001, Computational Linguistics.

[21]  Kenneth R. Beesley Arabic Finite-State Morphological Analysis and Generation , 1996, COLING.

[22]  Stephen G. Pulman,et al.  A feature-based formalism for two-level phonology: a description and implementation , 1993, Comput. Speech Lang..

[23]  Gertjan van Noord FSA Utilities: A Toolbox to Manipulate Finite-State Automata , 1996, Workshop on Implementing Automata.

[24]  George Anton Kiraz,et al.  Lexical Tools to the Syriac New Testament , 1995 .

[25]  Bruce W. Watson,et al.  Incremental construction of minimal acyclic finite state automata , 2000, CL.

[26]  Steven Bird,et al.  One-Level Phonology: Autosegmental Representations and Rules as Finite Automata , 1994, Comput. Linguistics.

[27]  John Bear,et al.  Morphology with Two-Level Rules and Negative Rule Features , 1988, COLING.

[28]  George Anton Kiraz Multi-Tape Two-Level Morphology: A Case Study in Semitic Non-linear Morphology , 1994, COLING.

[29]  Noam Chomsky,et al.  The Sound Pattern of English , 1968 .

[30]  Alan S. Prince,et al.  Prosodic morphology and templatic morphology , 1990 .

[31]  Kenneth R. Beesley Arabic Stem Morphotactics via Finite-State Intersection , 1999 .

[32]  Bruce Wiebe,et al.  Modelling autosegmental phonology with multi-tape finite state transducers , 1992 .

[33]  George Anton Kiraz Compiling Regular Formalisms with Rule Features into Finite-State Automata , 1997, ACL.

[34]  Martin Kay,et al.  Regular Models of Phonological Rule Systems , 1994, CL.

[35]  Richard Sproat,et al.  Multilingual Text-to-Speech Synthesis: The Bell Labs Approach , 1998, CL.

[36]  Stephen G. Pulman,et al.  Compiling a Partition-Based Two-Level Formalism , 1996, COLING.

[37]  Kimmo Koskenniemi,et al.  Two-Level Morphology , 1983 .

[38]  Michael Hammond Templatic transfer in Arabic broken plurals , 1988 .

[39]  Lauri Karttunen,et al.  The Replace Operator , 1995, ACL.

[40]  Noam Chomsky,et al.  Morphophonemics of modern Hebrew , 1979 .

[41]  John J. McCarthy,et al.  Formal Problems in Semitic Phonology and Morphology , 2018 .

[42]  Richard Sproat Multilingual Text-to-Speech Synthesis , 1997 .

[43]  Alon Lavie,et al.  On the Applicability of Two Level Morphology to the Inflection of Hebrew Verbs , 1988 .

[44]  Douglas Pulleyblank Tone in Lexical Phonology , 1986 .

[45]  Markus Walther Computational nonlinear morphology with emphasis on semitic languages , 2002, Computational Linguistics.

[46]  E. Crook,et al.  Word Recognition , 2010 .

[47]  Alan S. Prince,et al.  Prosodic Morphology 1986 , 1996 .

[48]  Alan Prince,et al.  Foot and word in prosodic morphology: The Arabic broken plural , 1990 .

[49]  George Anton Kiraz,et al.  SEMHE: A Generalised Two-Level System , 1996, ACL.

[50]  Sabatino Moscati,et al.  An Introduction to the comparative grammar of the Semitic languages : phonology and morphology , 1964 .

[51]  George Anton Kiraz,et al.  Multi-tape Automata for Speech and Language Systems: A Prolog Implementation , 1997, Workshop on Implementing Automata.

[52]  Martin Kay,et al.  Nonconcatenative Finite-State Morphology , 1987, EACL.

[53]  Kimmo Koskenniemi,et al.  Finite-state description of Semitic morphology: a case study of ancient Accadian , 1988, COLING.