Rapid Development of Morphological Descriptions for Full Language Processing Systems

I describe a compiler and development environment for feature-augmented two-level morphology rules integrated into a full NLP system. The compiler is optimized for a class of languages including many or most European ones, and for rapid development and debugging of descriptions of new languages. The key design decision is to compose morphophonological and morphosyntactic information, but not the lexicon, when compiling the description. This results in typical compilation times of about a minute, and has allowed a reasonably full, feature-based description of French inflectional morphology to be developed in about a month by a linguist new to the system.

[1]  Harald Trost The application of two-level morphology to non-concatenative German morphology , 1990, COLING.

[2]  Martin Kay,et al.  Regular Models of Phonological Rule Systems , 1994, CL.

[3]  Lauri Karttunen,et al.  Two-Level Morphology with Composition , 1992, COLING.

[4]  Harvey Abramson A Logic Programming View of Relational Morphology , 1992, COLING.

[5]  Harald Trost,et al.  Morphology with a Null-Interface , 1994, COLING.

[6]  Tanya Bowden Cooperative Error Handling and Shallow Processing , 1995, EACL.

[7]  Kemal Oflazer,et al.  Two-level Description of Turkish Morphology , 1993, EACL.

[8]  Kimmo Koskenniemi,et al.  A General Computational Model for Word-Form Recognition and Production , 1984 .

[9]  Lauri Karttunen,et al.  Incremental Construction of a Lexical Transducer for Korean , 1994, COLING.

[10]  Harald Trost X2MORF: A Morphological Component Based on Augmented Two-Level Morphology , 1991, IJCAI.

[11]  George Anton Kiraz Multi-Tape Two-Level Morphology: A Case Study in Semitic Non-linear Morphology , 1994, COLING.

[12]  Chris Mellish Implementing Systemic Classification by Unification , 1988, Comput. Linguistics.

[13]  David M. Carter Lattice-Based Word Identification in CLARE , 1992, ACL.

[14]  H. Alshawi,et al.  The Core Language Engine , 1994 .

[15]  Kemal Oflazer,et al.  Spelling Correction in Agglutinative Languages , 1994, ANLP.

[16]  Martin Kay,et al.  Nonconcatenative Finite-State Morphology , 1987, EACL.