Computational analysis of the syntax and interpretation of "free" word order in Turkish

In this dissertation, I examine a language with "free" word order, specifically Turkish, in order to develop a formalism that can capture the syntax and the context-dependent interpretation of "free" word order within a computational framework. In "free" word order languages, word order is used to convey distinctions in meaning that are not captured by traditional truth-conditional semantics. The word order indicates the "information structure", e.g. what is the "topic" and the "focus" of the sentence. The context-appropriate use of "free" word order is of considerable importance in developing practical applications in natural language interpretation, generation, and machine translation. I develop a formalism called Multiset-CCG, an extension of Combinatory Categorial Grammars, CCGs, (Ades/Steedman 1982, Steedman 1985), and demonstrate its advantages in an implementation of a data-base query system that interprets Turkish questions and generates answers with contextually appropriate word orders. Multiset-CCG is a context-sensitive and polynomially parsable grammar that captures the formal and descriptive properties of "free" word order and restrictions on word order in simple and complex sentences (with discontinuous constituents and long distance dependencies). Multiset-CCG captures the context-dependent meaning of word order in Turkish by compositionally deriving the predicate-argument structure and the information structure of a sentence in parallel. The advantages of using such a formalism are that it is computationally attractive and that it provides a compositional and flexible surface structure that allows syntactic constituents to correspond to information structure constituents. A formalism that integrates information structure and syntax such as Multiset-CCG is essential to the computational tasks of interpreting and generating sentences with contextually appropriate word orders in "free" word order languages.

[1]  K. Sakuma The structure of the Japanese language , 1951 .

[2]  Y. Bar-Hillel A Quasi-Arithmetical Notation for Syntactic Description , 1953 .

[3]  J. Lambek The Mathematics of Sentence Structure , 1958 .

[4]  Sheila A. Greibach,et al.  A New Normal-Form Theorem for Context-Free Phrase Structure Grammars , 1965, JACM.

[5]  Daniel H. Younger,et al.  Recognition and Parsing of Context-Free Languages in Time n^3 , 1967, Inf. Control..

[6]  John Robert Ross,et al.  Constraints on variables in syntax , 1967 .

[7]  D. Crystal,et al.  Intonation and Grammar in British English , 1967 .

[8]  R. J. Nelson,et al.  Introduction to Automata , 1968 .

[9]  Alfred V. Aho,et al.  Indexed Grammars—An Extension of Context-Free Grammars , 1967, SWAT.

[10]  Alfred V. Aho,et al.  Properties of Syntax Directed Translations , 1969, J. Comput. Syst. Sci..

[11]  Noam Chomsky,et al.  Deep structure, surface structure, and semantic interpretation , 1969 .

[12]  Alfred V. Aho,et al.  Syntax Directed Translations and the Pushdown Assembler , 1969, J. Comput. Syst. Sci..

[13]  P. Stanley Peters,et al.  On the generative power of transformational grammars , 1973, Inf. Sci..

[14]  P. Sgall,et al.  Topic, focus and generative semantics , 1973 .

[15]  Aravind K. Joshi,et al.  Tree Adjunct Grammars , 1975, J. Comput. Syst. Sci..

[16]  M. Rochemont A theory of stylistic rules in English , 1978 .

[17]  Judith Aissen The syntax of causative constructions , 1979 .

[18]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[19]  SHALOM LAPPIN,et al.  DOMINANCE AND THE FUNCTIONAL EXPLANATION OF ISLAND PHENOMENA , 1979 .

[20]  David R. Dowty,et al.  Introduction to Montague semantics , 1980 .

[21]  E. Prince Topicalization, Focus-Movement, and Yiddish-Movement: A Pragmatic Differentiation , 1981 .

[22]  Ellen F. Prince,et al.  Toward a taxonomy of given-new information , 1981 .

[23]  Noam Chomsky,et al.  Lectures on Government and Binding , 1981 .

[24]  T. Reinhart Pragmatics and Linguistics: an analysis of Sentence Topics , 1981, Philosophica.

[25]  Mark Steedman,et al.  On the order of words , 1982 .

[26]  D. Slobin,et al.  Children use canonical sentence schemas: A crosslinguistic study of word order and inflections , 1982, Cognition.

[27]  David R. Dowty Grammatical Relations and Montague Grammar , 1982 .

[28]  Scott Weinstein,et al.  Providing a Unified Account of Definite Noun Phrases in Discourse , 1983, ACL.

[29]  Feride Erkü Discourse pragmatics and word order in Turkish , 1983 .

[30]  C. Pollard,et al.  Center for the Study of Language and Information , 2022 .

[31]  Elisabeth Selkirk,et al.  Phonology and Syntax: The Relation between Sound and Structure , 1984 .

[32]  Eser Emine Erguvanlı The Function of Word Order in Turkish Grammar , 1984 .

[33]  J. Kornfilt Case marking, agreement, and empty categories in Turkish , 1984 .

[34]  Mats Rooth Association with focus , 1985 .

[35]  Julia Horvath,et al.  FOCUS in the Theory of Grammar and the Syntax of Hungarian , 1986 .

[36]  Stuart M. Shieber,et al.  Evidence against the context-freeness of natural language , 1985 .

[37]  C Snow,et al.  Child language data exchange system , 1984, Journal of Child Language.

[38]  Hans den Besten,et al.  The Ergative Hypothesis and Free Word Order in Dutch and German , 1985 .

[39]  Mark Steedman,et al.  Dependency and Coordination in the Grammar of Dutch and English , 1985 .

[40]  Jeanette K. Gundel ‘Shared knowledge’ and topicality , 1985 .

[41]  Geoffrey K. Pullum,et al.  Generalized Phrase Structure Grammar , 1985 .

[42]  Laura E. Knecht Subject and object in Turkish , 1985 .

[43]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[44]  Engin Sezer,et al.  The Unmarked Sentential Subject Constraint in Turkish , 1986 .

[45]  Kent Barrows Wittenburg,et al.  Natural language parsing with combinatory categorial grammar in a graph-unification-based formalism , 1986 .

[46]  Anthony S. Kroch,et al.  Analyzing extraposition in a Tree Adjoining Gram-mar , 1987 .

[47]  Müserref Dede Definiteness and Referentiality in Turkish Verbal Sentences , 1986 .

[48]  Stuart M. Shieber,et al.  An Introduction to Unification-Based Approaches to Grammar , 1986, CSLI Lecture Notes.

[49]  Anna Szabolcsi,et al.  Bound variables in syntax (Are there any , 1987 .

[50]  Mark Steedman,et al.  Combinatory grammars and parasitic gaps , 1987 .

[51]  Ivan A. Sag,et al.  Information-Based Syntax and Semantics: Volume 1, Fundamentals , 1987 .

[52]  Carl Pollard,et al.  A Centering Approach to Pronouns , 1987, ACL.

[53]  郡司 隆男 Japanese phrase structure grammar : a unification-based approach , 1987 .

[54]  Ivan A. Sag,et al.  Information-based syntax and semantics , 1987 .

[55]  Caroline Heycock,et al.  The structure of the Japanese causative , 1987 .

[56]  Hans Uszkoreit,et al.  Word Order and Constituent Structure in German , 1987, CSLI Lecture Notes.

[57]  Eser Erguvanlı-Taylan THE ROLE OF SEMANTIC FEATURES IN TURKISH WORD ORDER , 1987 .

[58]  Mark Johnson,et al.  Attribute-value logic and the theory of grammar , 1988 .

[59]  Johan van Benthem,et al.  The Lambek Calculus , 1988 .

[60]  Mark C. Baker,et al.  Incorporation: A Theory of Grammatical Function Changing , 1988 .

[61]  David R. Dowty Type Raising, Functional Composition, and Non-Constituent Conjunction , 1988 .

[62]  Jürgen Wedekind Generation as structure driven derivation , 1988, COLING.

[63]  David J. Weir,et al.  Combinatory Categorial Grammars: Generative Power and Relationship to Linear Context-Free Rewriting Systems , 1988, ACL.

[64]  David J. Weir,et al.  Characterizing mildly context-sensitive grammar formalisms , 1988 .

[65]  Emmon Bach,et al.  Categorial Grammars as Theories of Language , 1988 .

[66]  Gerald Gazdar,et al.  Applicability of Indexed Grammars to Natural Languages , 1988 .

[67]  M. Moortgat Categorial Investigations: Logical and Linguistic Aspects of the Lambek Calculus , 1988 .

[68]  Gertjan van Noord,et al.  A Semantic-Head-Driven Generation Algorithm for Unification-Based Formalisms , 1989, ACL.

[69]  Gert Webelhuth,et al.  Syntactic saturation phenomena and the modern Germanic languages , 1989 .

[70]  P. Culicover,et al.  English Focus Constructions and the Theory of Grammar , 1990 .

[71]  Anoop Mahajan,et al.  The A/A-bar distinction and movement theory , 1990 .

[72]  Pauline Jacobson,et al.  Raising as function composition , 1990 .

[73]  G. Grewendorf,et al.  Scrambling and barriers , 1990 .

[74]  Megumi Kameyama,et al.  Zero anaphora: The case of Japanese , 1990 .

[75]  Jeanette K. Gundel,et al.  Givenness, implicature and the form of referring expressions in discourse , 1990 .

[76]  Enric Vallduví,et al.  The Informational Component , 1990 .

[77]  Gertjan van Noord,et al.  Semantic-Head-Driven Generation , 1990, Comput. Linguistics.

[78]  Gertjan van Noord An overview of head-driven bottom-up generation , 1990 .

[79]  Noam Chomsky,et al.  Filters and Control , 1990 .

[80]  Aravind K. Joshi,et al.  Long-Distance Scrambling and Tree Adjoining Grammars , 1991, EACL.

[81]  Jacob Hoeksema,et al.  Complex predicates and liberation in Dutch and English , 1991 .

[82]  B. Partee Topic, Focus and Quantification , 1991 .

[83]  Manfred Krifka,et al.  A Compositional Semantics for Multiple Focus Constructions , 1991 .

[84]  J. Berstel,et al.  Context-free languages , 1993, SIGA.

[85]  Mark Steedman Structure and Intonation , 1991 .

[86]  E. Koktová The meaning of the sentence in its semantic and pragmatic aspects , 1991 .

[87]  David R. Dowty To ward a Minimalist Theory of Syntactic Structure , 1992 .

[88]  M. Kural Scrambling and Mixed Positions in Turkish , 1992 .

[89]  Robert Dale Generating referring expressions - constructing descriptions in a domain of objects and processes , 1992, ACL-MIT press series in natural language processing.

[90]  Noam Chomsky A minimalist program for linguistic theory , 1992 .

[91]  Tadao Maruta English Focus Constructions and the Theory of Grammar , by Michael S. Rochemont and Peter W. Culicover, Cambridge University Press, Cambridge, 1990, vii+210 pp , 1992 .

[92]  Beryl Hoffman A CCG Approach to Free Word Order Languages , 1992, ACL.

[93]  E. Prince The ZPG Letter: Subjects, Definiteness, and Information-status , 1992 .

[94]  Mark Steedman,et al.  Generating Contextually Appropriate Intonation , 1993, EACL.

[95]  Beryl Hoffman,et al.  The Formal Consequences of using Variables in CCG Categories , 1993, ACL.

[96]  David J. Weir,et al.  Parsing Some Constrained Grammar Formalisms , 1993, Comput. Linguistics.

[97]  Kemal Oflazer,et al.  Two-level Description of Turkish Morphology , 1993, EACL.

[98]  Gosse Bouma,et al.  Constraint-Based Categorial Grammar , 1994 .

[99]  Marilyn A. Walker,et al.  Japanese Discourse and the Process of Centering , 1994, Comput. Linguistics.

[100]  Gertjan van Noord,et al.  Adjuncts and the Processing of Lexical Rules , 1994, COLING.

[101]  Beatrice Santorini,et al.  Towards resolving Webelhuth’s paradox: evidence from German and Korean , 1994 .

[102]  Ralf Steinberger Treating 'Free Word Order' in Machine Translation , 1994, COLING.

[103]  S. Shieber RESTRICTING THE WEAK‐GENERATIVE CAPACITY OF SYNCHRONOUS TREE‐ADJOINING GRAMMARS , 1994, Comput. Intell..

[104]  Owen C. Ranbow,et al.  Formal and computational aspects of natural language syntax , 1994 .

[105]  Josef Bayer,et al.  Against scrambling as an instance of Move-alpha , 1994 .

[106]  Tracy Holloway King Configuring topic and focus in Russian , 1995 .

[107]  Beryl Hoffman,et al.  Integrating “Free” Word Order Syntax and Information Structure , 1995, EACL.

[108]  U. Turan Null vs. Overt Subjects in Turkish Discourse: A Centering Analysis , 1995 .

[109]  E. Prince Word Order, Information Structure, and Centering in Turkish , 1997 .