Arabic Language Resources and Tools for Speech and Natural Language

Although 5% of the world population speak Arabic as a native language, the research on Arabic is not up to its number of speakers. However, more research has been don on Arabic in the last decade due to many factors including the widespread of communication and information technology applications. Two of the eminent research centres in the Arab World which contributed markedly on research related to Arabic are King Abdulaziz City for Science and Technology (KACST) and University of Balamand (UOB). This paper is to highlight the efforts of these two institutions on enriching the Arabic language resources including that on natural language processing, speech processing and character recognition.

[1]  Chafic Mokbel,et al.  MEDAR: Collaboration between European and Mediterranean Arabic Partners to Support the Development of Language Technology for Arabic , 2008, LREC.

[2]  R. Bayeh,et al.  Broadcast News Transcription Baseline System using the NEMLAR database , 2006 .

[3]  Chafic Mokbel,et al.  On the use of morphological constraints in n-gram statistical language model , 2005, INTERSPEECH.

[4]  Mohammad S. Khorsheed,et al.  Off-Line Arabic Character Recognition – A Review , 2002, Pattern Analysis & Applications.

[5]  Yousef Ajami Alotaibi,et al.  Speech Recognition System of Arabic Digits based on A Telephony Arabic Corpus , 2008, IPCV.

[6]  Chafic Mokbel,et al.  Towards multilingual speech recognition using data driven source/target acoustical units association , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Jordan Cohen,et al.  The GALE project: A description and an update , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[8]  M. Abirached,et al.  Communication and information technologies at the International Office of Water (IOW) , 2004, Proceedings. 2004 International Conference on Information and Communication Technologies: From Theory to Applications, 2004..

[9]  Chafic Mokbel,et al.  Online adaptation of HMMs to real-life conditions: a unified framework , 2001, IEEE Trans. Speech Audio Process..

[10]  Chafic Mokbel,et al.  BECARS: a free software for speaker verification , 2004, Odyssey.

[11]  Chafic Mokbel,et al.  Building Annotated Written and Spoken Arabic LRs in NEMLAR Project , 2006, LREC.