A prototype for projecting HPSG syntactic lexica towards LMF

The comparative evaluation of Arabic HPSG grammar lexica requires a deep study of their linguistic coverage. The complexity of this task results mainly from the heterogeneity of the descriptive components within those lexica (underlying linguistic resources and different data categories, for example). It is therefore essential to define more homogeneous representations, which in turn will enable us to compare them and eventually merge them. In this context, we present a method for comparing HPSG lexica based on a rule system. This method is implemented within a prototype for the projection from Arabic HPSG to a normalised pivot language compliant with LMF (ISO 24613 - Lexical Markup Framework) and serialised using a TEI (Text Encoding Initiative) based representation. The design of this system is based on an initial study of the HPSG formalism looking at its adequacy for the representation of Arabic, and from this, we identify the appropriate feature structures corresponding to each Arabic lexical category and their possible LMF counterparts.

[1]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[2]  Marc Kemps-Snijders,et al.  Creating multimedia dictionaries of endangered languages using LEXUS , 2007, INTERSPEECH.

[3]  Andreas Witt,et al.  Representing human and machine dictionaries in Markup languages , 2009, ArXiv.

[4]  Bernd Kiefer,et al.  Compilation of HPSG to TAG , 1995, ACL.

[5]  Laurent Romary,et al.  An abstract model for the representation of multilingual terminological data: TMF - Terminological Markup Framework , 2001 .

[6]  Nancy Ide,et al.  Encoding dictionaries , 1995, Comput. Humanit..

[7]  Yusuke Miyao,et al.  Grammar conversion from LTAG to HPSG , 2002 .

[8]  Nicoletta Calzolari,et al.  Standardization in the Lexicon , 1999 .

[9]  Nancy Ide,et al.  A Registry of Standard Data Categories for Linguistic Annotation , 2004, LREC.

[10]  Nancy Ide,et al.  International Standard for a Linguistic Annotation Framework , 2003, Natural Language Engineering.

[11]  Walt Detmar Meurers,et al.  Head-driven phrase structure grammar: linguistic approach, formal foundations, and computational realization , 2006 .

[12]  Abdelmajid Ben Hamadou,et al.  Étude et analyse de la phrase nominale arabe en HPSG , 2006 .

[13]  Mathias Rossignol,et al.  A lexicon for Vietnamese language processing , 2007, Lang. Resour. Evaluation.

[14]  Christiane Fellbaum,et al.  From Resources to Applications. Designing the Multilingual ISLE Lexical Entry , 2002, LREC.

[15]  Gil Francopoulo,et al.  Standards going concrete : from LMF to Morphalou , 2004, COLING 2004.