This paper presents a comprehensive NLP system by Melingo that has been recently developed for Arabic, based on Morfix™ - an operational formerly developed highly successful comprehensive Hebrew NLP system.The system discussed includes modules for morphological analysis, context sensitive lemmatization, vocalization, text-to-phoneme conversion, and syntactic-analysis-based prosody (intonation) model. It is employed in applications such as full text search, information retrieval, text categorization, textual data mining, online contextual dictionaries, filtering, and text-to-speech applications in the fields of telephony and accessibility and could serve as a handy accessory for non-fluent Arabic or Hebrew speakers.Modern Hebrew and Modern Standard Arabic share some unique Semitic linguistic characteristics. Yet up to now, the two languages have been handled separately in Natural Language Processing circles, both on the academic and on the applicative levels. This paper reviews the major similarities and the minor dissimilarities between Modern Hebrew and Modern Standard Arabic from the NLP standpoint, and emphasizes the benefit of developing and maintaining a unified system for both languages.
[1]
L. Glinert.
The Grammar of Modern Hebrew
,
1989
.
[2]
Reuben Alcalay,et al.
The Complete Hebrew-English Dictionary
,
1996
.
[3]
Duncan Forbes,et al.
Grammar of the arabic language
,
2011
.
[4]
Kenneth R. Beesley,et al.
Finite-State Morphological Analysis and Generation of Arabic at Xerox Research: Status and Plans in 2001
,
2001
.
[5]
Clive Holes,et al.
Modern Arabic: Structures, Functions, and Varieties
,
1996
.
[6]
J. M. Cowan,et al.
A dictionary of modern written Arabic
,
1963
.
[7]
Carl Paul Caspari,et al.
A grammar of the Arabic language
,
1859
.