HaG — A Computational Grammar ofHausa

In this paper, I shall give an overview of HaG (=Hausa Grammar), an emerging computational grammar of Hausa1, developed within the framework of Head-driven Phrase Structure (Pollard and Sag, 1987, 1994; Sag, 1997). Since HPSG is an integrated theory of syntax and semantics, meaning representations are built up in tandem with syntactic analysis. Semantics in HaG are represented using Minimal Recursion Semantics (MRS; Copestake, Flickinger, Pollard and Sag, 2005), essentially providing predicate-argument structures with an unspecified representation of (quantifier) scope. The grammar described in this paper runs on top of the Lingo Linguistic Knowledge Builder (LKB; Copestake, 2002), a platform for typed feature structure grammars originally developed at CSLI, Stanford. The LKB system not only provides a bottom-up chart parser (Oepen and Carroll, 2000), but also a chart generator (Carroll, Copestake, Flickinger and Poznanski, 1999; Carroll and Oepen, 2000). Consequently, HaG was designed from the start as a reversible grammar, i.e. a grammar that is suitable for both analysis and synthesis. In addition to the development platform LKB, HaG can also be run using the efficient C++ parser Pet (Callmeier, 2000), and since autumn 2011, on the reversible ace parser/generator developed in C by Woodley Packard (http://sweaglesw.org/linguistics/ace/). The grammar, as well as all the development and run-time systems, are available under free and open source licenses. As an alternative to a full-fledged install of the grammar and development systems, we provide a web demo (http://hag.delph-in.net) which provides a concise interface to the grammar, displaying full semantic representations, as well as the constituent structure. Development of the grammar started in 2009, based on the LinGO grammar matrix (Bender, Flickinger and Oepen, 2002), a core system of basic types extracted from the English Resource Grammar (ERG; Copestake and Flickinger, 2000) which ensures basic compatibility of semantic representations between LKB grammars using MRS. Implementation of a formal grammar of Hausa is motivated by two major goals: first, the availability of an implemented grammar will contribute a reusable linguistic resource for a computationally underresourced language. Second, the implementation of a competence grammar based on a linguistically motivated formalism such as HPSG will provide testable models of linguistic theories. Since HaG is the first implemented grammar of a tone language that systematically integrates suprasegmental phonology, we hope to also further our understanding of the computational treatment of (African) tone languages. Current development of HaG focuses on the implementation of syntactic constructions and the system of morpho-syntactic rules. This decision is deliberate, since we plan to expand the lexicon using grammar-based machine learning techniques (Zhang and Kordoni, 2006). As a result, the grammar already covers a substantial part of Hausa core grammar, despite the comparatively small lexicon. In this paper, I shall provide an overview of the main constructions of the language as implemented in HaG. Following a detailed overview of the central issues concerning the treatment of tone and length (section 1), I shall briefly discuss the implementation of inflectional morphology (section 2). Section 3 will be devoted to morpho-syntax, including direct object marking, mixed categories, and pronominal

[1]  Ivan A. Sag,et al.  Information-Based Syntax and Semantics: Volume 1, Fundamentals , 1987 .

[2]  Larry M. Hyman,et al.  How autosegmental is phonology? , 2013 .

[3]  Emily M. Bender,et al.  The Grammar Matrix: An Open-Source Starter-Kit for the Rapid Development of Cross-linguistically Consistent Broad-Coverage Precision Grammars , 2002, COLING 2002.

[4]  Hans-Ulrich Krieger,et al.  TDL-A Type Description Language for Constraint-Based Grammars , 1994, COLING.

[5]  Berthold Crysmann Resumption and Island-Hood in Hausa , 2010, FG.

[6]  B. Hayes Precompiled Phrasal Phonology , 2022 .

[7]  Stephan Oepen,et al.  High Efficiency Realization for a Wide-Coverage Unification Grammar , 2005, IJCNLP.

[8]  Stephan Oepen,et al.  Maximum Entropy Models for Realization Ranking , 2005 .

[9]  John Carroll,et al.  An Efficient Chart Generator for (Semi-)Lexicalist Grammars , 2001 .

[10]  Yi Zhang,et al.  Automated Deep Lexical Acquisition for Robust Open Texts Processing , 2006, LREC.

[11]  G. Pullum,et al.  CLITICIZATION VS. INFLECTION: ENGLISH N'T , 1983 .

[12]  Ulrich Callmeier,et al.  PET – a platform for experimentation with efficient HPSG processing techniques , 2000, Natural Language Engineering.

[13]  On the categorial status of Hausa genitive prepositions , 2012 .

[14]  Paul Newman,et al.  Hausa Language , 2000 .

[15]  Dan Flickinger,et al.  An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG , 2000, LREC.

[16]  Paul Newman,et al.  Modern Hausa-English Dictionary , 1981 .

[17]  Laurice Anne Tuller,et al.  Bijective relations in universal grammar and the syntax of Hausa , 1986 .

[18]  Stephan Oepen,et al.  Ambiguity Packing in Constraint-based Parsing Practical Results , 2000, ANLP.

[19]  Jan Tore Lønning,et al.  Som å kapp-ete med trollet? – Towards MRS-based Norwegian-English machine translation , 2004, TMI.

[20]  A. Zwicky CLITICS AND PARTICLES , 1985 .

[21]  Jonathan Ginzburg,et al.  Interrogative Investigations: The Form, Meaning, and Use of English Interrogatives , 2001 .

[22]  Mahamane Laoualy Abdoulaye,et al.  Aspects of Hausa morphosyntax in role and reference grammar , 1992 .

[23]  Siegmund Brauner,et al.  Lehrbuch der Hausa-Sprache , 1965 .

[24]  Philip J. Jaggar,et al.  Ex-situ and in-situ focus in Hausa , 2003 .

[25]  Stephan Oepen,et al.  Parse Disambiguation for a Rich HPSG Grammar , 2002 .

[26]  Bob Carpenter,et al.  The logic of typed feature structures , 1992 .

[27]  Edit Doron,et al.  On the syntax and semantics of resumptive pronouns , 2011 .

[28]  Berthold Crysmann,et al.  Hybrid Processing for Grammar and Style Checking , 2008, COLING.

[29]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[30]  Katharina Hartmann,et al.  In Place – Out of Place : Focus in Hausa , 2004 .

[31]  Berthold Crysmann,et al.  Some Fine Points of Hybrid Natural Language Parsing , 2008, LREC.

[32]  Gilles-Maurice de Schryver,et al.  Automatic Diacritic Restoration for Resource-Scarce Languages , 2007, TSD.

[33]  Berthold Crysmann,et al.  A Unified Account of Hausa Genitive Constructions , 2009, FG.

[34]  Aladarn Tsarin,et al.  HAUSA , 2005, Cheers!.

[35]  I. Sag English relative clause constructions , 1997, Journal of Linguistics.

[36]  Berthold Crysmann Autosegmental representations in an HPSG of Hausa , 2009 .

[37]  Ann Copestake,et al.  Implementing typed feature structure grammars , 2001, CSLI lecture notes series.

[38]  P. Newman The Hausa Language. An Encyclopedic Reference Grammar , 2002 .

[39]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[40]  Berthold Crysmann,et al.  An inflectional approach to Hausa Final Vowel Shortening , 2005 .