Corpus linguistics and naive discriminative learning

Three classifiers from machine learning (the generalized linear mixed model, memory based learning, and support vector machines) are compared with a naive discriminative learning classifier, derived from basic principles of error-driven learning characterizing animal and human learning. Tested on the dative alternation in English, using the Switchboard data from (BRESNAN; CUENI; NIKITINA; BAAYEN, 2007), naive discriminative learning emerges with stateof-the-art predictive accuracy. Naive discriminative learning offers a united framework for understanding the learning of probabilistic distributional patterns, for classification, and for a cognitive grounding of distinctive collexeme analysis.

[1]  James L. McClelland,et al.  A distributed, developmental model of word recognition and naming. , 1989, Psychological review.

[2]  Walter Daelemans,et al.  Memory-Based Language Processing , 2009, Studies in natural language processing.

[3]  Royal Skousen,et al.  Analogical Modeling Of Language , 1989 .

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  D. Danks Equilibria of the Rescorla--Wagner model , 2003 .

[6]  Melody Dye,et al.  The Effects of Feature-Label-Order and Their Implications for Symbolic Learning , 2010, Cogn. Sci..

[7]  R. Harald Baayen,et al.  Models, forests, and trees of York English: Was/were variation as a case study for statistical practice , 2012, Language Variation and Change.

[8]  Dennis Norris,et al.  The Bayesian reader: explaining word recognition as an optimal Bayesian decision process. , 2006, Psychological review.

[9]  R. Harald Baayen,et al.  Predicting the dative alternation , 2007 .

[10]  Brian MacWhinney,et al.  A Unified Model of Language Acquisition , 2004 .

[11]  Peter Dayan,et al.  Explaining Away in Weight Space , 2000, NIPS.

[12]  Mark S. Seidenberg,et al.  Computing the meanings of words in reading: cooperative division of labor between visual and phonological processes. , 2004, Psychological review.

[13]  R. Harald Baayen,et al.  Sidestepping the combinatorial explosion: Towards a processing model based on discriminative learning , 2010 .

[14]  Michael Ramscar,et al.  Linguistic Self-Correction in the Absence of Feedback: A New Approach to the Logical Problem of Language Acquisition , 2007, Cogn. Sci..

[15]  Ralph R. Miller,et al.  Assessment of the Rescorla-Wagner model. , 1995 .

[16]  J. Tenenbaum,et al.  Special issue on “Probabilistic models of cognition , 2022 .

[17]  W S Murray,et al.  Serial mechanisms in lexical access: the rank hypothesis. , 2004, Psychological review.

[18]  J. Bresnan,et al.  Predicting syntax: Processing dative constructions in American and Australian varieties of English , 2010 .

[19]  J. Kroll,et al.  Handbook of bilingualism : psycholinguistic approaches , 2005 .

[20]  L. Allan,et al.  The widespread influence of the Rescorla-Wagner model , 1996, Psychonomic bulletin & review.

[21]  P. Dayan,et al.  tHe Cognitive neuroSCienCe of Motivation and learning , 2008 .

[22]  L. Allan A note on measurement of contingency between two binary variables in judgment tasks , 1980 .

[23]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[24]  N. Ellis Language Acquisition as Rational Contingency Learning , 2006 .

[25]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[26]  Achim Zeileis,et al.  BMC Bioinformatics BioMed Central Methodology article Conditional variable importance for random forests , 2008 .

[27]  G. Bower,et al.  From conditioning to category learning: an adaptive network model. , 1988 .

[28]  John R. Anderson Learning and memory: An integrated approach, 2nd ed. , 2000 .

[29]  D. Norris,et al.  Shortlist B: a Bayesian model of continuous speech recognition. , 2008, Psychological review.

[30]  M Coltheart,et al.  DRC: a dual route cascaded model of visual word recognition and reading aloud. , 2001, Psychological review.

[31]  Willem J. M. Levelt,et al.  A theory of lexical access in speech production , 1999, Behavioral and Brain Sciences.

[32]  Stefan Th. Gries,et al.  Frequency tables: tests, effect sizes, and explorations , 2011 .

[33]  James L. McClelland,et al.  An interactive activation model of context effects in letter perception: I. An account of basic findings. , 1981 .

[34]  G. Tutz,et al.  An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. , 2009, Psychological methods.

[35]  N. Snider,et al.  More than words: Frequency effects for multi-word phrases , 2010 .

[36]  H. Baayen,et al.  Holistic Processing of Regular Four-word Sequences 1 HOLISTIC PROCESSING OF REGULAR FOUR-WORD SEQUENCES Holistic Processing of Regular Four-word Sequences : A Behavioral and ERP study of the effects of structure , frequency , and probability on immediate free recall , 2009 .

[37]  B. Hayes,et al.  Rules vs. analogy in English past tenses: a computational/experimental study , 2003, Cognition.

[38]  J. Bresnan,et al.  Syntactic probabilities affect pronunciation variation in spontaneous speech , 2009, Language and Cognition.

[39]  Rens Bod,et al.  Exemplar-based syntax: How to get productivity from examples , 2006 .

[40]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[41]  Nick Chater,et al.  The probabilistic analysis of language acquisition: Theoretical, computational, and experimental analysis , 2010, Cognition.

[42]  Melody Dye,et al.  The Enigma of Number: Why Children Find the Meanings of Even Small Number Words Hard to Learn and How We Can Help Them Do Better , 2011, PloS one.

[43]  E. Steyerberg,et al.  [Regression modeling strategies]. , 2011, Revista espanola de cardiologia.

[44]  Dušica Filipović Đurđević,et al.  An amorphous model for morphological processing in visual comprehension based on naive discriminative learning. , 2011, Psychological review.

[45]  Henrietta J. Cedergren,et al.  Variable Rules: Performance as a Statistical Reflection of Competence , 1974 .

[46]  Frank E. Harrell,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2001 .

[47]  S. Gries,et al.  Extending collostructional analysis: A corpus-based perspective on `alternations' , 2004 .

[48]  W. Schultz Getting Formal with Dopamine and Reward , 2002, Neuron.

[49]  P. Boersma,et al.  Empirical Tests of the Gradual Learning Algorithm , 2001, Linguistic Inquiry.

[50]  J. Grainger,et al.  Orthographic neighborhood effects in bilingual word recognition , 1998 .