Towards an open platform based on HPSG formalism for the standard Arabic language

The aim of this paper is to present an open software platform for analysing texts in standard Arabic language. The originality of this platform is that it is an integrated software environment which offers all the necessary resources and tools for parsing Arabic texts. For formalising the several elements of the language, the HPSG formalism has been adopted because of its effectiveness and its ability to be adapted to any natural language. Currently, the platform is operational with an appreciable coverage of many Arabic syntactic structures. In the medium-term, our objective is to use the platform for developing applications for the Arabic language such as interfaces, learning, information retrieval…etc.

[1]  Mohammad S. Khorsheed,et al.  Comparative evaluation of text classification techniques using a large diverse Arabic dataset , 2013, Language Resources and Evaluation.

[2]  Ann Copestake,et al.  Implementing typed feature structure grammars , 2001, CSLI lecture notes series.

[3]  José Luis Martínez-Fernández,et al.  A real time Named Entity Recognition system for Arabic text mining , 2011, Language Resources and Evaluation.

[4]  Mofleh Al-Diabat,et al.  Arabic Text Categorization Using Classification Rule Mining , 2012 .

[5]  Philip Resnik,et al.  Soft syntactic constraints for Arabic–English hierarchical phrase-based translation , 2011, Machine Translation.

[6]  Muhammad Abdul-Mageed,et al.  SAMAR: A System for Subjectivity and Sentiment Analysis of Arabic Social Media , 2012, WASSA@ACL.

[7]  Laurent Romary,et al.  A prototype for projecting HPSG syntactic lexica towards LMF , 2012, J. Lang. Technol. Comput. Linguistics.

[8]  Khaled Shaalan,et al.  Arabic Natural Language Processing: Challenges and Solutions , 2009, TALIP.

[9]  Aqil M. Azmi,et al.  A text summarizer for Arabic , 2012, Comput. Speech Lang..

[10]  Said Ouatik El Alaoui,et al.  Exploring term proximity statistic for Arabic information retrieval , 2014, 2014 Third IEEE International Colloquium in Information Science and Technology (CIST).

[11]  Jun'ichi Tsujii,et al.  Extremely Lexicalized Models for Accurate and Fast HPSG Parsing , 2006, EMNLP.

[12]  Kareem Darwish,et al.  Named Entity Recognition using Cross-lingual Resources: Arabic as an Example , 2013, ACL.

[13]  Jun'ichi Tsujii,et al.  Probabilistic Disambiguation Models for Wide-Coverage HPSG Parsing , 2005, ACL.

[14]  Fadi A. Thabtah,et al.  Arabic Text Mining Using Rule Based Classification , 2012, J. Inf. Knowl. Manag..

[15]  Amar Balla,et al.  PHARAS : Une plate-forme d’analyse basée sur le formalisme HPSG pour l’Arabe standard : Développements récents et perspectives , 2014 .

[16]  Stefan Müller,et al.  The Grammix CD-ROM A Software Collection for Developing Typed Feature Structure Grammars , 2007 .

[17]  Ahmad T. Al-Taani,et al.  A top-down chart parser for analyzing arabic sentences , 2012, Int. Arab J. Inf. Technol..

[18]  Michael Hahn,et al.  Arabic relativization patterns: A unified HPSG analysis , 2012, Proceedings of the International Conference on Head-Driven Phrase Structure Grammar.

[19]  Izzat Alsmadi,et al.  Opinion Mining and Analysis for Arabic Language , 2014 .