Formal Basis of a Language Universal

Abstract Steedman (2020) proposes as a formal universal of natural language grammar that grammatical permutations of the kind that have given rise to transformational rules are limited to a class known to mathematicians and computer scientists as the “separable” permutations. This class of permutations is exactly the class that can be expressed in combinatory categorial grammars (CCGs). The excluded non-separable permutations do in fact seem to be absent in a number of studies of crosslinguistic variation in word order in nominal and verbal constructions. The number of permutations that are separable grows in the number n of lexical elements in the construction as the Large Schröder Number Sn−1. Because that number grows much more slowly than the n! number of all permutations, this generalization is also of considerable practical interest for computational applications such as parsing and machine translation. The present article examines the mathematical and computational origins of this restriction, and the reason it is exactly captured in CCG without the imposition of any further constraints.

[1]  Noam Chomsky Derivation by phase , 1999 .

[2]  Khalil Sima'an,et al.  Reordering Grammar Induction , 2015, EMNLP.

[3]  Edward P. Stabler,et al.  Computational Perspectives on Minimalism , 2011 .

[4]  Gwendolyn Hyslop,et al.  The Noun Phrase , 2017 .

[5]  Peter Young,et al.  Non-local scrambling: the equivalence of TAG and CCG revisited , 2008, TAG.

[6]  Noam Chomsky,et al.  Studies on Semantics in Generative Grammar , 1972 .

[7]  Aravind K. Joshi Complexity of Scrambling: A New Twist to the Competence - Performance Distinction , 1994, ArXiv.

[8]  Paola Merlo,et al.  Movement and structure effects on Universal 20 word order frequencies: A quantitative study , 2018 .

[9]  Alexander Koller,et al.  Dependency Trees and the Strong Generative Capacity of CCG , 2009, EACL.

[10]  Daniel Gildea,et al.  Factorization of Synchronous Context-Free Grammars in Linear Time , 2007, SSST@HLT-NAACL.

[11]  David R. Dowty Type Raising, Functional Composition, and Non-Constituent Conjunction , 1988 .

[12]  Ulrike Mosel Teop – an Oceanic language with multifunctional verbs, nouns and adjectives , 2017 .

[13]  Fredrik Meyer,et al.  Representation theory , 2015 .

[14]  Noam Chomsky,et al.  Remarks on Nominalization , 2020, Nominalization.

[15]  Daniel Gildea,et al.  Stochastic Lexicalized Inversion Transduction Grammar for Alignment , 2005, ACL.

[16]  Julian West,et al.  Generating trees and forbidden subsequences , 1996, Discret. Math..

[17]  Anna Szabolcsi THE POSSESSOR THAT RAN AWAY FROM HOME , 1983 .

[18]  Luigi Rizzi,et al.  The Cartography of Syntactic Structures , 2009 .

[19]  K. N. Dollman,et al.  - 1 , 1743 .

[20]  Mark Steedman,et al.  Interfaces and the grammar , 2005 .

[21]  David S. Gunderson Handbook of Mathematical Induction: Theory and Applications , 2010 .

[22]  Dekai Wu,et al.  A Polynomial-Time Algorithm for Statistical Machine Translation , 1996, ACL.

[23]  Martin Kay,et al.  Syntactic Process , 1979, ACL.

[24]  Paola Merlo Predicting Word Order Universals , 2015, J. Lang. Model..

[25]  Giorgio Satta,et al.  Lexicalization and Generative Power in CCG , 2015, CL.

[26]  Mirella Lapata,et al.  Language to Logical Form with Neural Attention , 2016, ACL.

[27]  Edward P. Stabler,et al.  Derivational Minimalism , 1996, LACL.

[28]  Stanley Peters,et al.  Cross-Serial Dependencies in Dutch , 1982 .

[29]  K. Abels The fundamental left–right asymmetry in the Germanic verb cluster , 2016 .

[30]  Giuliana Giusti,et al.  Parallels in clausal and nominal periphery , 2006 .

[31]  P. Smolensky,et al.  Learning biases predict a word order universal , 2012, Cognition.

[32]  Guglielmo Cinque,et al.  The Fundamental Left-Right Asymmetry of Natural Languages , 2009 .

[33]  Artemis Alexiadou,et al.  The syntax of adjectives , 2014 .

[34]  Mark Steedman,et al.  The surface-compositional semantics of English intonation , 2014 .

[35]  Gunter Senft,et al.  Kilivila : The Language of the Trobriand Islanders , 1986 .

[36]  Mark Steedman,et al.  Taking Scope - The Natural Semantics of Quantifiers , 2011 .

[37]  Peter J. Eccles An Introduction to Mathematical Reasoning: Index , 1997 .

[38]  Peter J. Eccles An Introduction to Mathematical Reasoning: Numbers, Sets and Functions , 1998 .

[39]  David Adger,et al.  A Syntax of Substance , 2012 .

[40]  MARK STEEDMAN,et al.  A formal universal of natural language grammar , 2020 .

[41]  Susi Wurmbrand,et al.  West Germanic verb clusters: The empirical domain , 2004 .

[42]  Aravind K. Joshi,et al.  Natural language parsing: Tree adjoining grammars: How much context-sensitivity is required to provide reasonable structural descriptions? , 1985 .

[43]  Bruce Dwayne Cain,et al.  Dhivehi (Maldivian) : a synchronic and diachronic study , 2000 .

[44]  Mats Rooth,et al.  Generalized Conjunction and Type Ambiguity , 2008 .

[45]  Prosenjit Bose,et al.  Pattern Matching for Permutations , 1993, WADS.

[46]  John A. Hawkins,et al.  Word order universals , 1983 .

[47]  Noam Chomsky,et al.  Bare Phrase Structure , 1994 .

[48]  John Thayer Jensen,et al.  Yapese Reference Grammar , 1977 .

[49]  Louis W. Shapiro,et al.  Bootstrap Percolation, the Schröder Numbers, and the N-Kings Problem , 1991, SIAM J. Discret. Math..

[50]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[51]  J. Culbertson,et al.  Language learners privilege structured meaning over surface frequency , 2014, Proceedings of the National Academy of Sciences.

[52]  G. Cinque Deriving Greenberg's Universal 20 and Its Exceptions , 2005, Linguistic Inquiry.

[53]  M. Dryer On the order of demonstrative, numeral, adjective, and noun , 2018 .

[54]  Pete Whitelock,et al.  Shake-and-Bake Translation , 1992, COLING.

[55]  M. D. MacLaren The Art of Computer Programming—Volume 1: Fundamental Algorithms (Donald E. Knuth) , 1969 .

[56]  Susi Wurmbrand,et al.  Verb Clusters, Verb Raising, and Restructuring , 2007 .

[57]  Aravind K. Joshi,et al.  Flexible Composition, Multiple Adjoining and Word Order Variation , 2008, TAG.

[58]  Mark Steedman,et al.  Dependency and Coordination in the Grammar of Dutch and English , 1985 .

[59]  Noam Chomsky,et al.  Minimalist inquiries : the framework , 1998 .

[60]  Abdoulaye Laziz Nchare The Grammar of Shupamem , 2012 .

[61]  Jason Eisner Efficient Normal-Form Parsing for Combinatory Categorial Grammar , 1996, ACL.

[62]  Shalom Lappin,et al.  Unsupervised Learning and Grammar Induction , 2010 .

[63]  Jason Baldridge,et al.  Multi-Modal Combinatory Categorial Grammar , 2003, EACL.

[64]  D. P. Medeiros ULTRA: Universal Grammar as a Universal Parser , 2018, Front. Psychol..

[65]  A. Neeleman,et al.  Linear Asymmetries and the LCA , 2012 .

[66]  J. Zwart The Minimalist Program , 1998, Journal of Linguistics.

[67]  Yonatan Bisk,et al.  Normal-form parsing for Combinatory Categorial Grammars with generalized composition and type-raising , 2010, COLING.

[68]  Guglielmo Cinque,et al.  The Syntax of Adjectives: A Comparative Study , 2010 .

[69]  Martin W. Liebeck A Concise Introduction to Pure Mathematics , 2000 .