Government and Binding Theory for Hindi Language

ABSTRACT Government and Binding theory is used to analyze phase structure rules in Natural Language Processing. It is used to filter out grammatically incorrect sentence. Government and Biding (GB) theory is useful and well applicable in English language. This paper shows way to apply GB theory on Hindi Language. For Applying on Hindi, GB theory is modified little bit, but its flavor is intact. Using this, all components and attributes of Government and Binding can be easily described for Hindi language . General Terms Natural Language Processing, Government and Binding, Grammar and Parser. Keywords Lexicon, D-structure, F-structure, S-structure. Logical form, C-command, M-command.. 1. INTRODUCTION Government and binding is used in Natural Language Processing for syntax transformation. It has great expressive power and is easy to understand. GB theory helps to design computational model in natural language processing. It is developed principally by Noam Chomsky in the 1980s [1, 2, 3]. This theory is a subsequent modification and revision of Chomsky’s earlier theories. It was modified latter in 1950 by The Minimalist Program and several subsequent research papers. The latest revision was Three Factors in Language Design (2005) [4, 5, 6, 7, 8]. Government and Binding (GB) theory of Chomsky introduced in 1981 and improved in 1986. Government and Binding approach to linguistic theory. GB theory is based on Extended Standard Theory in transformational grammar. GB theory consist set of theories that interact Government and Binding being two of them. It gives emphasis on principles rather than rules. Each theory implies principles of rules and representations that are a subsystem of UG. GB theory operates always on syntactic structure, whatever level of language (d-structure, s-structure or LF) are they affecting. The interactions between the theories becomes complex. If we keep principle simple than interactions between simple principles may lead to complex properties. But we can understand why language is complex but we can easily learn. A sentence in the theory of Government and Binding is represented at following four levels: (1) D-structure, (2) S-structure, (3) Phonetic Structure and (4) Logical Form. We can represent it in the simplest form as follow: D-structure (DS) captures the argument structure of lexical categories: Verb, Noun, Pre (or Postposition and Adjective, present in the sentence. Logical Form (LF) comes nearest to the meaning representation of the sentence. in terms of quantifier scoping, anaphoric reference indexing etc. Quite often PS and SS are the same, but they could also be different as in “I wanna go" (PS) and “I want to go" (SS). Figure 1 shows D-structure and S-structure representation of lexicon. Facts about them are as follows:  D-structure combined all lexical items.  D -structure is mapped into S structure that syntactically represents surface order of sentence. S-structure is factorized into Logical Form and Phonological Form.  Phonological Form (PF) is directly concerned with phonology i.e. related to sounds of items .  Logical Form is the interface with the Semantics. Predication relationships and the scope of operators and quantifiers of various kinds are represented explicitly in the phrase structure at Logical Form.  Every word is of lexical category which is called head. Examples are noun, verb, adjective, preposition