Productivity and Reuse in Language

A much-celebrated aspect of language is the way in which it allows us to make "infinite use of finite means" (von Humboldt, 1836). This property is made possible because language is fundamentally a productive computational system: Novel expressions can be composed from a large inventory of stored, reusable parts. For any given language, however, there are many more potential ways of forming novel expressions than can actually be used in practice. For example, English contains suffixes that are highly productive (e.g., -ness ; Lady-Gagaesqueness, pine-scentedness), but also contains suffixes which can only be reused in specific, existing words (e.g., -th; truth, width, warmth). How are such differences in productivity and reusability represented? How can the child acquire this system of knowledge? This thesis presents a formal model of productivity and reuse which treats the problem as a structure-by-structure inference in a Bayesian framework. The model—Fragment Grammars, a generalization of Adaptor Grammars (Johnson et al., 2007a)—is built around two proposals. The first is that anything that can be computed can be stored. The specific computational mechanism by which this is accomplished, stochastic memoization, is inherited from Adaptor Grammars (Goodman et al., 2008; Johnson et al., 2007a). The second proposal is that any stored item can include subparts which must be computed productively. This is made possible by the computational mechanism of stochastically lazy evaluation, introduced in the thesis. Throughout the thesis, Fragment Grammars are systematically compared to four other probabilistic models of productivity and reuse which formalize historical proposals from the linguistics and psycholinguistics literatures. The five models are evaluated on two very different sub-systems of English morphology: the English past tense, which is characterized by a sharp dichotomy in productivity between regular (i.e., +ed) and irregular (e.g., sing/sang) forms, and English derivational morphology, which is characterized by a graded cline from very productive (e.g., -ness) to very unproductive (e.g., -th). The thesis examines many aspects of these two domains including: performance on past-tense inflection, past-tense processing phenomena, developmental overregularization, the productivity of derivational suffixes, ordering restrictions on derivational suffixes, and base-driven selectional restrictions on suffix combinations.

[1]  Noam Chomsky,et al.  Lectures on Government and Binding , 1981 .

[2]  Stuart M. Shieber,et al.  Evidence against the context-freeness of natural language , 1985 .

[3]  Donald G. MacKay,et al.  Derivational rules and the internal lexicon , 1978 .

[4]  Ingo Plag,et al.  What Constrains Possible Suffix Combinations? On the Interaction of Grammatical and Processing Restrictions in Derivational Morphology , 2004 .

[5]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[6]  Alec Marantz,et al.  No escape from syntax: Don't try morphological analysis in the privacy of your own lexicon , 1997 .

[7]  J. Goldsmith,et al.  The handbook of phonological theory , 2011 .

[8]  M T Ullman,et al.  The Declarative/Procedural Model of Lexicon and Grammar , 2001, Journal of psycholinguistic research.

[9]  R. Baayen,et al.  Regular morphologically complex neologisms leave detectable traces in the mental lexicon , 2007 .

[10]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Juan Segui,et al.  Morphological processing in word recognition:a review with particular reference to spanish data , 2000 .

[12]  Ulrike Hahn,et al.  Where Defaults Don't Help: the Case of the German Plural System , 1996, ArXiv.

[13]  R. Baayen,et al.  Shifting paradigms: gradient structure in morphology , 2005, Trends in Cognitive Sciences.

[14]  Alexander Clark,et al.  Efficient, Correct, Unsupervised Learning for Context-Sensitive Languages , 2010, CoNLL.

[15]  Charles D. Yang,et al.  Knowledge and learning in natural language , 2000 .

[16]  S. Kuczaj The acquisition of regular and irregular past tense forms , 1977 .

[17]  W. Marslen-Wilson,et al.  Morphology and meaning in the English mental lexicon. , 1994 .

[18]  Mark Johnson,et al.  PCFG Models of Linguistic Tree Representations , 1998, CL.

[19]  Philip H. Miller,et al.  Strong generative capacity - the semantics of linguistic formalism , 2000, CSLI lecture notes series.

[20]  Partha Niyogi,et al.  Optimizing the mutual intelligibility of linguistic agents in a shared world , 2004, Artif. Intell..

[21]  R. Higginson Fixing : assimilation in language acquisition , 1985 .

[22]  H. Clahsen,et al.  Lexical entries and rules of language: A multidisciplinary study of German inflection , 1999, Behavioral and Brain Sciences.

[23]  Charles Yang,et al.  Three factors in language variation , 2010 .

[24]  R. F. Stanners,et al.  Memory representation for morphologically related words. , 1979 .

[25]  R. Harald Baayen,et al.  Word Frequency Distributions , 2001 .

[26]  P. Niyogi,et al.  Computational and evolutionary aspects of language , 2002, Nature.

[27]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[28]  Rens Bod An efficient implementation of a new DOP model , 2003, EACL.

[29]  Emil L. Post Formal Reductions of the General Combinatorial Decision Problem , 1943 .

[30]  Robert Schreuder,et al.  Constraining psycholinguistic models of morphological processing and representation: The role of productivity , 1992 .

[31]  Sanjay Jain,et al.  Open Problems in "Systems That Learn" , 1994, J. Comput. Syst. Sci..

[32]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[33]  Yee Whye Teh,et al.  A Hierarchical Bayesian Language Model Based On Pitman-Yor Processes , 2006, ACL.

[34]  Joan L. Bybee,et al.  Regular morphology and the lexicon. , 1995 .

[35]  C. Ling,et al.  Answering the connectionist challenge: a symbolic model of learning the past tenses of English verbs , 1993, Cognition.

[36]  Richard Sproat,et al.  Morphology and computation , 1992 .

[37]  P. Niyogi,et al.  Learning from triggers , 1996 .

[38]  A. Orlitsky,et al.  Always Good Turing: Asymptotically Optimal Probability Estimation , 2003, Science.

[39]  R. Baayen,et al.  Morphological influences on the recognition of monosyllabic monomorphemic words , 2006 .

[40]  Yee Whye Teh,et al.  Beam sampling for the infinite hidden Markov model , 2008, ICML '08.

[41]  Mark Johnson,et al.  Squibs and Discussions: Memoization in Top-Down Parsing , 1995, CL.

[42]  S. Pinker,et al.  On language and connectionism: Analysis of a parallel distributed processing model of language acquisition , 1988, Cognition.

[43]  Noam Chomsky,et al.  Remarks on Nominalization , 2020, Nominalization.

[44]  Cristina Burani,et al.  Morphological Structure and Lexical Access. , 1984 .

[45]  Mark Johnson,et al.  Improving nonparameteric Bayesian inference: experiments on unsupervised word segmentation with adaptor grammars , 2009, NAACL.

[46]  Thomas L. Griffiths,et al.  Language Evolution by Iterated Learning With Bayesian Agents , 2007, Cogn. Sci..

[47]  William Croft,et al.  Radical Construction Grammar: Syntactic Theory in Typological Perspective , 2001 .

[48]  J. Lambek The Mathematics of Sentence Structure , 1958 .

[49]  Emil L. Post Recursively enumerable sets of positive integers and their decision problems , 1944 .

[50]  Shalom Lappin,et al.  Linguistic Nativism and the Poverty of the Stimulus , 2011 .

[51]  Gert Westermann,et al.  Emergent modularity and U-shaped learning in a constructivist neural network learning the English past tense , 1998 .

[52]  Alexander Clark,et al.  Learning Auxiliary Fronting with Grammatical Inference , 2006, CoNLL.

[53]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[54]  Mark Aronoff,et al.  Morphological Productivity and Phonological Transparency , 1981 .

[55]  M. Tomasello First Steps toward a Usage-Based Theory of Language Acquisition , 2001 .

[56]  Davide Ricca,et al.  Productivity in Italian word formation: a variable-corpus approach , 2006 .

[57]  P. H. Lindsay Human Information Processing , 1977 .

[58]  J. Pitman,et al.  The two-parameter Poisson-Dirichlet distribution derived from a stable subordinator , 1997 .

[59]  Joshua B. Tenenbaum,et al.  Church: a language for generative models , 2008, UAI.

[60]  John Case,et al.  Memory-Limited U-Shaped Learning , 2006, COLT.

[61]  Robert C. Berwick,et al.  The proper treatment of language acquisition and change in a population setting , 2009, Proceedings of the National Academy of Sciences.

[62]  John R. Anderson,et al.  Reflections of the Environment in Memory Form of the Memory Functions , 2022 .

[63]  Luigi Burzio Missing players: Phonology and the past-tense debate , 2002 .

[64]  Willem H. Zuidema Parsimonious Data-Oriented Parsing , 2007, EMNLP-CoNLL.

[65]  John R. Anderson The Adaptive Character of Thought , 1990 .

[66]  Mehryar Mohri,et al.  Finite-State Transducers in Language and Speech Processing , 1997, CL.

[67]  William A. Gale,et al.  Good-Turing Frequency Estimation Without Tears , 1995, J. Quant. Linguistics.

[68]  Partha Niyogi,et al.  A Note on Zipf's Law, Natural Languages, and Noncoding DNA regions , 1995, ArXiv.

[69]  Victor Kuperman Lexical processing of morphologically complex words : an information-theoretical perspective , 2008 .

[70]  B. Ambridge,et al.  Children's judgments of regular and irregular novel past-tense forms: new data on the English past-tense debate. , 2010, Developmental psychology.

[71]  Joan L. Bybee,et al.  Rules and schemas in the development and use of the English past tense , 1982 .

[72]  R. Brown,et al.  A First Language , 1973 .

[73]  Anne Cutler Productivity in word formation , 1980 .

[74]  Kenichi Kurihara,et al.  Variational Bayesian Grammar Induction for Natural Language , 2006, ICGI.

[75]  N. Stanietsky,et al.  The interaction of TIGIT with PVR and PVRL2 inhibits human NK cell cytotoxicity , 2009, Proceedings of the National Academy of Sciences.

[76]  R. Levy Expectation-based syntactic comprehension , 2008, Cognition.

[77]  John McCarthy,et al.  Recursive functions of symbolic expressions and their computation by machine, Part I , 1960, Commun. ACM.

[78]  Thomas L. Griffiths,et al.  Revealing Priors on Category Structures Through Iterated Learning , 2006 .

[79]  P. Kiparsky Explanation In Phonology , 1982 .

[80]  Philippe de Groote,et al.  Towards Abstract Categorial Grammars , 2001, ACL.

[81]  Steven Pinker,et al.  Words and rules , 1998 .

[82]  J. Hay From Speech Perception to Morphology: Affix Ordering Revisited , 2002 .

[83]  J. Pitman Combinatorial Stochastic Processes , 2006 .

[84]  Michael Mitzenmacher,et al.  A Brief History of Generative Models for Power Law and Lognormal Distributions , 2004, Internet Math..

[85]  H. Simon,et al.  ON A CLASS OF SKEW DISTRIBUTION FUNCTIONS , 1955 .

[86]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[87]  Raymond J. Mooney,et al.  Induction of First-Order Decision Lists: Results on Learning the Past Tense of English Verbs , 1995, J. Artif. Intell. Res..

[88]  Janet B. Pierrehumbert,et al.  Exemplar dynamics: Word frequency, lenition and contrast , 2000 .

[89]  J. Berko The Child's Learning of English Morphology , 1958 .

[90]  Y. Bar-Hillel A Quasi-Arithmetical Notation for Syntactic Description , 1953 .

[91]  R. Harald Baayen,et al.  Probabilistic approaches to morphology , 2003 .

[92]  Sophia Ananiadou,et al.  On the definition of word , 2004, Machine Translation.

[93]  Gerald J. Sussman,et al.  Structure and interpretation of computer programs , 1985, Proceedings of the IEEE.

[94]  Ray J. Solomonoff,et al.  A new method for discovering the grammars of phrase structure languages , 1959, IFIP Congress.

[95]  Roger W. Schvaneveldt,et al.  TESTING MORPHOLOGICAL PRODUCTIVITY , 1978 .

[96]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[97]  Richard M. Weist,et al.  Finiteness systems and lexical aspect in child Polish and English , 2009 .

[98]  Mark Johnson The DOP Estimation Method Is Biased and Inconsistent , 2002, Computational Linguistics.

[99]  John H.Gillespie Population Genetics: A Concise Guide , 1997 .

[100]  Z. Harris,et al.  Foundations of language , 1941 .

[101]  H. Rubenstein,et al.  Homographic entries in the internal lexicon , 1970 .

[102]  HighWire Press Philosophical Transactions of the Royal Society of London , 1781, The London Medical Journal.

[103]  Ingo Plag,et al.  Word-Formation in English , 2018 .

[104]  Colin Bannard,et al.  Stored Word Sequences in Language Learning , 2008, Psychological science.

[105]  Khalil Sima'an,et al.  Data-Oriented Parsing , 2003 .

[106]  M. Taft A morphological-decomposition model of lexical representation , 1988 .

[107]  Avi Pfeffer,et al.  IBAL: A Probabilistic Rational Programming Language , 2001, IJCAI.

[108]  K. Köpcke Schemas in German plural formation , 1988 .

[109]  Matt Post,et al.  Bayesian Learning of a Tree Substitution Grammar , 2009, ACL.

[110]  Peter Norvig,et al.  Techniques for Automatic Memoization with Applications to Context-Free Parsing , 1991, CL.

[111]  Steven Pinker,et al.  Generalisation of regular and irregular morphological patterns , 1993 .

[112]  Donald G. MacKay,et al.  On the Retrieval and Lexical Structure of Verbs. , 1976 .

[113]  A. Church A Set of Postulates for the Foundation of Logic , 1932 .

[114]  Joseph E. Emonds,et al.  Root and structure-preserving transformations , 1970 .

[115]  A. Goldberg,et al.  Incidental verbatim memory for language , 2010, Language and Cognition.

[116]  Phil Blunsom,et al.  A Note on the Implementation of Hierarchical Dirichlet Processes , 2009, ACL/IJCNLP.

[117]  L. Egghe Power Laws in the Information Production Process: Lotkaian Informetrics , 2005 .

[118]  P. Suppes The Semantics of Children's Language , 1974 .

[119]  Daniel M. Roy,et al.  Computable Exchangeable Sequences Have Computable de Finetti Measures , 2009, CiE.

[120]  David Eddington,et al.  Analogy and the dual-route model of morphology , 2000 .

[121]  Ton van der Wouden,et al.  Extended Lexical Units in Dutch , 2005, CLIN.

[122]  Paul Kiparsky,et al.  Word-formation and the lexicon , 1982 .

[123]  C. Fillmore,et al.  Grammatical constructions and linguistic generalizations: The What's X doing Y? construction , 1999 .

[124]  Joan L. Bybee,et al.  From Usage to Grammar: The Mind's Response to Repetition , 2007 .

[125]  R. Baayen,et al.  Affixal Homonymy triggers full-form storage, even with inflected words, even in a morphologically rich language , 2000, Cognition.

[126]  Rochelle Lieber,et al.  Word frequency distributions and lexical semantics , 1996, Comput. Humanit..

[127]  M. Aronoff,et al.  Producing morphologically complex words , 1988 .

[128]  Michael Collins,et al.  Convolution Kernels for Natural Language , 2001, NIPS.

[129]  M A Nowak,et al.  Language evolution and information theory. , 2000, Journal of theoretical biology.

[130]  Thomas L. Griffiths,et al.  Interpolating between types and tokens by estimating power-law generators , 2005, NIPS.

[131]  Alfonso Caramazza,et al.  Representation and processing of derived words , 1987 .

[132]  John Skilling,et al.  Data analysis : a Bayesian tutorial , 1996 .

[133]  Phil Blunsom,et al.  Inducing Tree-Substitution Grammars , 2010, J. Mach. Learn. Res..

[134]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[135]  Chung-chieh Shan,et al.  Embedded Probabilistic Programming , 2009, DSL.

[136]  Jean Berko Gleason,et al.  Parent–child interaction and the acquisition of lexical information during play. , 1980 .

[137]  Ingo Plag,et al.  The role of selectional restrictions, phonotactics and parsing in constraining suffix ordering in English , 2002 .

[138]  R. Harald Baayen,et al.  Quantitative aspects of morphological productivity , 1992 .

[139]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[140]  Noam Chomsky,et al.  Three models for the description of language , 1956, IRE Trans. Inf. Theory.

[141]  J. Roger Hindley,et al.  Lambda-Calculus and Combinators: An Introduction , 2008 .

[142]  A. Turing On Computable Numbers, with an Application to the Entscheidungsproblem. , 1937 .

[143]  Noam Chomsky,et al.  The Sound Pattern of English , 1968 .

[144]  Leslie G. Valiant,et al.  A Quantitative Theory of Neural Computation , 2006, Biological Cybernetics.

[145]  Maarten M. Fokkinga,et al.  Functional Programming with Bananas, Lenses, Envelopes and Barbed Wire , 1991, FPCA.

[146]  J. Saffran Statistical Language Learning , 2003 .

[147]  Whitney Tabor,et al.  A dynamical systems perspective on the relationship between symbolic and non-symbolic computation , 2009, Cognitive Neurodynamics.

[148]  Gerald J. Sussman,et al.  Sparse Representations for Fast, One-Shot Learning , 1997, AAAI/IAAI.

[149]  S. Pinker,et al.  The Nature of Human Concepts/Evidence from an Unusual Source , 1996 .

[150]  Daniel M. Roy Computability, inference and modeling in probabilistic programming , 2011 .

[151]  S. J. Keyser Linguistic inquiry monographs , 1976 .

[152]  J. Segui,et al.  Words and Morphemes as Units for Lexical Access , 1997 .

[153]  Thomas K. Landauer,et al.  How Much do People Remember? Some Estimates of the Quantity of Learned Information in Long-Term Memory , 1986, Cogn. Sci..

[154]  T. Griffiths,et al.  ITERATED LEARNING OF MULTIPLE LANGUAGES FROM MULTIPLE TEACHERS , 2010 .

[155]  L. Bloom Language Development: Form and Function in Emerging Grammars , 1970 .

[156]  Thomas L. Griffiths,et al.  The evolution of frequency distributions: Relating regularization to inductive biases through iterated learning , 2009, Cognition.

[157]  R. Harald Baayen,et al.  41. Corpus linguistics in morphology: Morphological productivity , 2009 .

[158]  R. Baayen,et al.  Putting the bits together: an information theoretical perspective on morphological processing , 2004, Cognition.

[159]  James Rogers,et al.  wMSO theories as grammar formalisms , 2003, Theor. Comput. Sci..

[160]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[161]  Cristina Burani,et al.  Effects of semantic markedness in the processing of regular nominal singulars and plurals in Italian , 1997 .

[162]  J. Hay Causes and Consequences of Word Structure , 2003 .

[163]  Robert F. Hadley Cognition and the computational power of connectionist networks , 2000, Connect. Sci..

[164]  Dominiek Sandra,et al.  On the Representation and Processing of Compound Words: Automatic Access to Constituent Morphemes Does Not Occur , 1990 .

[165]  Lois Bloom,et al.  One Word at a Time: The Use of Single Word Utterances Before Syntax , 1976 .

[166]  Donald A. Schumsky,et al.  The Morpheme Boundaries of Some English Derivational Suffixes. , 1980 .

[167]  Noam Chomsky,et al.  On Certain Formal Properties of Grammars , 1959, Inf. Control..

[168]  Kenny Smith,et al.  Iterated learning in populations of Bayesian agents , 2009 .

[169]  C. Fillmore,et al.  Regularity and Idiomaticity in Grammatical Constructions: The Case of Let Alone , 1988 .

[170]  G. Marcus The Algebraic Mind: Integrating Connectionism and Cognitive Science , 2001 .

[171]  Royal Skousen,et al.  Productivity and the English Past Tense: Testing Skousen's Analogy Model , 1994 .

[172]  L. Feldman,et al.  Morphological Priming: The Role of Prime Duration, Semantic Transparency, and Affix Position , 1999, Brain and Language.

[173]  Paul Kay,et al.  An Informal Sketch of a Formal Architecture for Construction Grammar , 2002, Grammars.

[174]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[175]  Lifang Du,et al.  A Survey of the Measurements of Morphological Productivity. , 2010 .

[176]  Mark Aronoff,et al.  Word Formation in Generative Grammar , 1979 .

[177]  J. Morton 7 – A Functional Model for Memory1 , 1970 .

[178]  C. Fowler,et al.  Relations among regular and irregular morphologically related words in the lexicon as revealed by repetition priming , 1985, Memory & cognition.

[179]  Simon Kirby,et al.  Innateness and culture in the evolution of language , 2006, Proceedings of the National Academy of Sciences.

[180]  Ivan A. Sag,et al.  Syntactic Theory: A Formal Introduction , 1999, Computational Linguistics.

[181]  Rens Bod,et al.  Beyond Grammar: An Experience-Based Theory of Language , 1998 .

[182]  Mark Steedman,et al.  The syntactic process , 2004, Language, speech, and communication.

[183]  Robert Schreuder,et al.  Towards a psycholinguistic computational model for morphological parsing , 2000, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[184]  Dan Klein,et al.  Simple, Accurate Parsing with an All-Fragments Grammar , 2010, ACL.

[185]  Dominiek Sandra,et al.  The morphology of the mental lexicon: Internal word structure viewed from a psycholinguistic perspective , 1994 .

[186]  C. Lebiere,et al.  The Atomic Components of Thought , 1998 .

[187]  Heinz Giegerich,et al.  Lexical Strata in English: Morphological Causes, Phonological Effects , 2005 .

[188]  Rens Bod,et al.  An All-Subtrees Approach to Unsupervised Parsing , 2006, ACL.

[189]  J. Bresnan Lexical-Functional Syntax , 2000 .

[190]  Lauri Karttunen,et al.  Finite State Morphology , 2003, CSLI Studies in Computational Linguistics.

[191]  Royal Skousen Analogy and Structure , 1992, Springer Netherlands.

[192]  G. Murphy,et al.  The Big Book of Concepts , 2002 .

[193]  Matti Laine,et al.  The Interplay of Word Formation Type, Affixal Homonymy, and Productivity in Lexical Processing: Evidence from a Morphologically Rich Language , 1999 .

[194]  Steven Pinker,et al.  Why No Mere Mortal Has Ever Flown Out to Center Field , 1991, Cogn. Sci..

[195]  Joan L. Bybee Morphology: A study of the relation between meaning and form , 1985 .

[196]  R. Jackendoff Morphological and semantic regularities in the lexicon , 1975 .

[197]  Nivja H. de Jong,et al.  Changing places: A cross-language perspective on frequency and family size in Dutch and Hebrew , 2005 .

[198]  Willem J. M. Levelt,et al.  Lexical access during the production of idiomatic phrases , 2006 .

[199]  Mark Johnson,et al.  Unsupervised Word Segmentation for Sesotho Using Adaptor Grammars , 2008, SIGMORPHON.

[200]  D. Howes,et al.  Zipf's Law and Miller's Random-Monkey Model , 1968 .

[201]  R. Baayen,et al.  Chronicling the Times: Productive Lexical Innovations in an English Newspaper , 1996 .

[202]  Stefan Evert,et al.  Measuring morphological productivity: Is automatic preprocessing sufficient? , 2001 .

[203]  Adele E. Goldberg,et al.  Constructions at Work , 2005 .

[204]  John J. L. Morton,et al.  Interaction of information in word recognition. , 1969 .

[205]  E. Jaynes Probability theory : the logic of science , 2003 .

[206]  Vanessa Ferdinand,et al.  Thomas' theorem meets Bayes' rule: A model of the iterated learning of language , 2009 .

[207]  Christian Biemann,et al.  A Random Text Model for the Generation of Statistical Language Invariants , 2007, NAACL.

[208]  K. Nelson Narratives from the crib , 1990 .

[209]  John Case,et al.  When unlearning helps , 2008, Inf. Comput..

[210]  S Pinker,et al.  Weird past tense forms , 1995, Journal of Child Language.

[211]  Rochelle Lieber,et al.  On the organization of the lexicon , 1981 .

[212]  H. Jeffreys,et al.  Theory of probability , 1896 .

[213]  Philipp Koehn,et al.  Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) , 2007 .

[214]  Joshua Goodman,et al.  Parsing Inside-Out , 1998, ArXiv.

[215]  Lorenzo Carlucci,et al.  Non-U-shaped vacillatory and team learning , 2008, J. Comput. Syst. Sci..

[216]  Walter Daelemans,et al.  Memory-Based Language Processing , 2009, Studies in natural language processing.

[217]  V. Marchman,et al.  From rote learning to system building: acquiring verb morphology in children and connectionist nets , 1993, Cognition.

[218]  Stefan Evert,et al.  A Simple LNRE Model for Random Character Sequences , 2004 .

[219]  M. Halle,et al.  Segmental phonology of Modern English , 1985 .

[220]  Linda Wilbanks,et al.  IT Productivity = ?? , 2009, IT Prof..

[221]  William D. Marslen-Wilson,et al.  Dissociating types of mental computation , 1997, Nature.

[222]  E. Clark Awareness of Language: Some Evidence from what Children Say and Do , 1978 .

[223]  K. P. Mohanan,et al.  The Theory of Lexical Phonology , 1982 .

[224]  Marcus Taft,et al.  The representation of bound morphemes in the lexicon: A Chinese study. , 1995 .

[225]  G. Zipf The Psycho-Biology Of Language: AN INTRODUCTION TO DYNAMIC PHILOLOGY , 1999 .

[226]  James Hoeffner,et al.  Are Rules a Thing of the Past?: The Acquisition of Verbal Morphology by an Attractor Network. , 1992 .

[227]  David A. Wood,et al.  Perspectives on formulaic language : acquisition and communication , 2010 .

[228]  Mark S. Seidenberg,et al.  Evaluating behavioral and neuroimaging data on past tense processing , 1998, Language.

[229]  Geoffrey K. Pullum,et al.  Natural languages and context-free languages , 1982 .

[230]  M. Maratsos More overregularizations after all: new data and discussion on Marcus, Pinker, Ullman, Hollander, Rosen & Xu , 2000, Journal of Child Language.

[231]  Joshua B. Tenenbaum,et al.  Natively probabilistic computation , 2009 .

[232]  S. Kirby,et al.  Cultural evolution: implications for understanding the human language faculty and its evolution , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[233]  R. Baayen,et al.  The balance of storage and computation in morphological processing: the role of word formation type, affixal homonymy, and productivity. , 2000, Journal of experimental psychology. Learning, memory, and cognition.

[234]  Dan Klein,et al.  Learning and Inference for Hierarchically Split PCFGs , 2007, AAAI.

[235]  Erwin Chan,et al.  Structures and distributions in morphology learning , 2008 .

[236]  Alec Marantz,et al.  Architecture and Blocking , 2008, Linguistic Inquiry.

[237]  Steven Pinker,et al.  Language learnability and language development , 1985 .

[238]  D. Bradley,et al.  Computational distinctions of vocabulary type , 1978 .

[239]  Kim Plunkett,et al.  A Connectionist Model of the Arabic Plural System , 1997 .

[240]  Marcus Tomalin Linguistics and the Formal Sciences: The Origins of Generative Grammar , 2006 .

[241]  Kenneth Daniel Shenkman Structure sensitivity and language processing in adult learners of English , 1994 .

[242]  Thomas L. Griffiths,et al.  Adaptor Grammars: A Framework for Specifying Compositional Nonparametric Bayesian Models , 2006, NIPS.

[243]  R. Harald Baayen,et al.  Productivity in language production , 1994 .

[244]  R. T. Cox Probability, frequency and reasonable expectation , 1990 .

[245]  James L. McClelland,et al.  On learning the past-tenses of English verbs: implicit rules or parallel distributed processing , 1986 .

[246]  Thomas L. Griffiths,et al.  Producing Power-Law Distributions and Damping Word Frequencies with Two-Stage Language Models , 2011, J. Mach. Learn. Res..

[247]  Joan L. Bybee,et al.  Frequency and the emergence of linguistic structure , 2001 .

[248]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[249]  Leon G. Higley,et al.  Forensic Entomology: An Introduction , 2009 .

[250]  Edward P. Stabler,et al.  Computational models of language universals: Expressiveness, learnability and consequences , 2011 .

[251]  G. Marcus Kluge: The Haphazard Construction of the Human Mind , 2008 .

[252]  Elisabeth Selkirk,et al.  The syntax of words , 1982 .

[253]  Erez Lieberman,et al.  Quantifying the evolutionary dynamics of language , 2007, Nature.

[254]  S Pinker,et al.  Rules of language. , 1991, Science.

[255]  Thomas L. Griffiths,et al.  Bayesian Inference for PCFGs via Markov Chain Monte Carlo , 2007, NAACL.

[256]  Dan Klein,et al.  Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[257]  Robert Schreuder,et al.  Modeling morphological processing. , 1995 .

[258]  Phil Blunsom,et al.  Inducing Compact but Accurate Tree-Substitution Grammars , 2009, NAACL.

[259]  Julie E. Boland,et al.  Lexical Morphology and Lexical Access , 1999, Brain and Language.

[260]  Daniel Jurafsky,et al.  The role of the lemma in form variation , 2002 .

[261]  Brian Roark,et al.  Computational Approaches to Morphology and Syntax , 2007 .

[262]  G. Westermann,et al.  Connectionist approaches to language learning , 2009 .

[263]  Marcus Taft,et al.  Interactive-activation as a framework for understanding morphological processing , 1994 .

[264]  E. Dąbrowska,et al.  Learning a morphological system without a default: the Polish genitive , 2001, Journal of Child Language.

[265]  S Pinker,et al.  On the acquisition of grammatical morphemes , 1981, Journal of Child Language.

[266]  Laurie Bauer Scalar productivity and -lily adverbs , 1992 .

[267]  D. Siegel Topics in English morphology , 1979 .

[268]  Harald Baayen,et al.  Suffix Ordering and Morphological Processing , 2009 .

[269]  J. Saffran Constraints on Statistical Language Learning , 2002 .

[270]  Andrew Radford,et al.  Minimalist Syntax: Exploring the Structure of English , 2004 .

[271]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[272]  V. Marchman,et al.  Learning from a connectionist model of the acquisition of the English past tense , 1996, Cognition.

[273]  C. F. Hockett Two Models of Grammatical Description , 1954 .

[274]  Brian MacWhinney,et al.  Enriching CHILDES for Morphosyntactic Analysis , 2008 .

[275]  Ewa Dąbrowska,et al.  Rules or schemas? Evidence from Polish , 2004 .

[276]  John R. Anderson,et al.  Why do children learn to say “Broke”? A model of learning the past tense without feedback , 2002, Cognition.

[277]  R. Baayen,et al.  Singulars and plurals in Dutch: Evidence for a parallel dual-route model , 1997 .

[278]  M A Nowak,et al.  Evolution of universal grammar. , 2001, Science.

[279]  Geoffrey K. Pullum,et al.  On the Mathematical Foundations of Syntactic Structures , 2011, J. Log. Lang. Inf..

[280]  David J. Weir,et al.  Characterizing mildly context-sensitive grammar formalisms , 1988 .

[281]  John R. Anderson,et al.  The place of cognitive architectures in a rational analysis , 1988 .

[282]  Noam Chomsky,et al.  The Logical Structure of Linguistic Theory , 1975 .

[283]  Robert Schreuder,et al.  How Complex Simplex Words can be , 1997 .

[284]  Carol Lynn Moder,et al.  Morphological Classes as Natural Categories , 1983 .

[285]  R. Harald Baayen,et al.  Parsing and productivity , 2002 .

[286]  Frank Wijnen,et al.  Storage and Computation in the Language Faculty , 2002 .

[287]  E. Williams,et al.  On the definition of word , 1987 .

[288]  Joshua B. Tenenbaum,et al.  Fragment Grammars: Exploring Computation and Reuse in Language , 2009 .

[289]  Pieter A. M. Seuren Concerning the roots of transformational generative grammar , 2009 .