Robust, applied morphological generation

In practical natural language generation systems it is often advantageous to have a separate component that deals purely with morphological processing. We present such a component: a fast and robust morphological generator for English based on finite-state techniques that generates a word form given a specification of the lemma, part-of-speech, and the type of inflection required. We describe how this morphological generator is used in a prototype system for automatic simplification of English newspaper text, and discuss practical morphological and orthographic issues we have encountered in generation of unrestricted text within this application.

[1]  Mehryar Mohri,et al.  On some applications of finite-state automata theory to natural language processing , 1996, Nat. Lang. Eng..

[2]  Christian Matthiessen Systemic Grammar In Computation: The Nigel Case , 1983, EACL.

[3]  Gregory Grefenstette,et al.  Regular expressions for language engineering , 1996, Natural Language Engineering.

[4]  Hamish Cunningham,et al.  GATE-a General Architecture for Text Engineering , 1996, COLING.

[5]  Geoffrey Sampson English for the computer , 1995 .

[6]  M ShieberStuart,et al.  Semantic-head-driven generation , 1990 .

[7]  Richard Power,et al.  What You See Is What You Meant: direct knowledge editing with natural language feedback , 1998, ECAI.

[8]  Kimmo Koskenniemi,et al.  Two-Level Model for Morphological Analysis , 1983, IJCAI.

[9]  Geoffrey K. Pullum,et al.  Licensing of prosodic features by syntactic rules: the key to auxiliary reduction , 1997 .

[10]  H. Alshawi,et al.  The Core Language Engine , 1994 .

[11]  Lauri Karttunen Constructing Lexical Transducers , 1994, COLING.

[12]  Paul Procter,et al.  Cambridge international dictionary of English , 2000 .

[13]  Lynne J. Cahill Morphonology in the Lexicon , 1993, EACL.

[14]  Michael Elhadad,et al.  An Overview of SURGE: a Reusable Comprehensive Syntactic Realization Component , 1996, INLG.

[15]  John R. Levine,et al.  Lex & yacc, 2nd edition , 1992 .

[16]  Alfred V. Aho,et al.  Compilers: Principles, Techniques, and Tools , 1986, Addison-Wesley series in computer science / World student series edition.

[17]  C. Chapelle The Computational Analysis of English—A Corpus‐Based Approach , 1988 .

[18]  J. Tiedemann Book Reviews: Linguistic Databases , 1999, CL.

[19]  Gregory P. Knowles,et al.  Manual of information to accompany the SEC corpus , 1988 .

[20]  Gertjan van Noord,et al.  Semantic-Head-Driven Generation , 1990, Comput. Linguistics.

[21]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[22]  Gerald Gazdar,et al.  DATR: A Language for Lexical Knowledge Representation , 1996, CL.

[23]  R. H. Baayen,et al.  The CELEX Lexical Database (CD-ROM) , 1996 .