Generating Arabic text : The Decoding Component in an Interlingual System for Man-Machine Communication in Natural Language

This paper describes the decoding part in an interlingual system for man-machine communication in natural language. It is based on the Universal Networking Language (UNL) framework. Given a semantic network that represents a relation between a number of concepts, this network can be decoded (or ‘DeConverted’ in UNL technical terms) back to any natural language. This depends on the existence of a dictionary and a grammar for the language to which this network is to be decoded. The role of the dictionary is to find the word in which a given concept is to be expressed. The role of the grammar is to arrange the nodes or the concepts of the network in a way that produces a syntactically well-formed sentence in the target language. The paper addresses the overall technical structure of both the grammar and the dictionary used in the DeConvesion process of the UNL network to generate Arabic text. In addition it will describe different challenges faced in making this mission feasible.

[1]  Iman Saleh,et al.  DAR: A Digital Assets Repository for Library Collections , 2005, ECDL.

[2]  Doug Arnold,et al.  Machine Translation: An Introductory Guide , 1994 .

[3]  Eldakar Youssef,et al.  The Million Book Project at Bibliotheca Alexandrina , 2005 .

[4]  Roland R. Hausser Foundations of Computational Linguistics: Man-Machine Communication in Natural Language , 1999 .

[5]  Sameh Al-Ansary Building a Computational Lexicon for Arabic , 2005 .

[6]  Chris Mellish,et al.  Current research in natural language generation , 1990 .

[7]  Karsten Konrad,et al.  Model Generation for Natural Language Interpretation and Analysis , 2004, Lecture Notes in Computer Science.

[8]  Paul Piwek,et al.  Natural Language Generation , 2004, Lecture Notes in Computer Science.

[9]  Bonnie J. Dorr,et al.  Machine Translation: A View from the Lexicon , 1994, CL.

[10]  Sergei Nirenburg,et al.  Readings in Machine Translation , 2003 .

[11]  Michael Zock,et al.  Trends in Natural Language Generation An Artificial Intelligence Perspective , 1996, Lecture Notes in Computer Science.

[12]  Noha Adly,et al.  A Framework for the Encoding of Multilayered Documents , 2007, 2006 1st International Conference on Digital Information Management.

[13]  J. V. Rauff,et al.  Finite State Morphology , 2007 .

[14]  Ehud Reiter,et al.  Book Reviews: Building Natural Language Generation Systems , 2000, CL.

[15]  Markus Walther Computational nonlinear morphology with emphasis on semitic languages , 2002, Computational Linguistics.

[16]  Michael Zock,et al.  Advances in natural language generation : an interdisciplinary perspective , 1988 .

[17]  MellishChris,et al.  A Reference Architecture for Natural Language Generation Systems , 2006 .

[18]  H. Uchida,et al.  The Universal Networking Language beyond Machine Translation , 2001 .

[19]  Kimmo Koskenniemi,et al.  A General Computational Model for Word-Form Recognition and Production , 1984 .