IMORPHĒ: An Inheritance and Equivalence Based Morphology Description Compiler

IMORPHĒ is a significantly extended version of MORPHĒ, a morphology description compiler. MORPHĒ’s morphology description language is based on two constructs: 1) a morphological form hierarchy, whose nodes relate and differentiate surface forms in terms of the common and distinguishing inflectional features of lexical items; and 2) transformational rules, attached to leaf nodes of the hierarchy, which generate the surface form of an item from the base form stored in the lexicon. While MORPHĒ’s approach to morphology description is intuitively appealing and was successfully used for generating the morphology of several European languages, its application to Modern Standard Arabic yielded morphological descriptions that were highly complex and redundant. Previous modifications and enhancements attempted to capture more elegantly and concisely different aspects of the complex morphology of Arabic, finding theoretical grounding in Lexeme-Based Morphology. Those extensions are being incorporated in a more flexible and less ad hoc fashion in IMORPHĒ, which retains the unique features of our previous work but embeds them in an inheritance-based framework in order to achieve even more concise and modular morphology descriptions and greater runtime efficiency, and lays the groundwork for IMORPHĒ to become an analyzer as well as a generator. 1 Work on EMORPHĒ and IMORPHĒ has been partially supported by grant OISE-0107369 from the National Science Foundation.

[1]  Ali Farghaly,et al.  Roots & patterns vs. stems plus grammar-lexis specifications: on what basis should a multilingual database centred on Arabic be built? , 2003, MTSUMMIT.

[2]  Martha Evens,et al.  Acquisition System for Arabic Noun Morphology , 2002, SEMITIC@ACL.

[3]  George Anton Kiraz,et al.  Multitiered nonlinear morphology using multitape finite automata: a case study on Syriac and Arabic , 2000, CL.

[4]  Nizar Habash,et al.  Large Scale Lexeme Based Arabic Morphological Generation , 2004 .

[5]  George Anton Kiraz,et al.  Arabic Computational Morphology in the West , 1998 .

[6]  Johannes Retti,et al.  5. Österreichische Artificial Intelligence-Tagung , 1989 .

[8]  Violetta Cavalli-Sforza,et al.  Arabic Computational Morphology: A Trade-off Between Multiple Operations and Multiple Stems , 2007 .

[9]  Raphael A. Finkel,et al.  Generating Hebrew Verb Morphology by Default Inheritance Hierarchies , 2002, SEMITIC@ACL.

[10]  A. Soudi,et al.  The arabic noun system generation , 2005 .

[11]  Kareem Darwish,et al.  Building a Shallow Arabic Morphological Analyser in One Day , 2002, SEMITIC@ACL.

[12]  Alon Itai,et al.  A corpus based morphological analyzer for unvocalized modern Hebrew , 2003, MTSUMMIT.

[13]  Kimmo Koskenniemi,et al.  A General Computational Model for Word-Form Recognition and Production , 1984 .

[14]  Teruko Mitamura,et al.  The KANT System: Fast, Accurate, High-Quality Translation in Practical Domains , 1992, COLING.

[15]  Hadj Ahmed Cherkaoui A Computational Lexeme-Based Treatment of Arabic Morphology , 2001 .

[16]  Teruko Mitamura,et al.  Arabic Morphology Generation Using a Concatenative Strategy , 2000, ANLP.

[17]  Wolfgang Finkler,et al.  MORPHIX A Fast Realization of a Classification-Based Approach to Morphology , 1988 .

[18]  Robert E. Beard Lexeme-morpheme base morphology : a general theory of inflection and word formation , 1996 .

[19]  Günter Neumann,et al.  Arabic Computational Morphology: Knowledge-based and Empirical Methods , 2007 .

[20]  M. Aronoff Morphology by Itself: Stems and Inflectional Classes , 1993 .

[21]  Antal van den Bosch,et al.  Memory-Based Morphological Analysis Generation and Part-of-Speech Tagging of Arabic , 2005, SEMITIC@ACL.

[22]  Kenneth R. Beesley Arabic Finite-State Morphological Analysis and Generation , 1996, COLING.

[23]  李幼升,et al.  Ph , 1989 .

[24]  Aaron Macks Parsing Akkadian Verbs with Prolog , 2002, SEMITIC@ACL.