Analyzing linguistic data: a practical introduction to statistics using R, 1st Edition

1. An introduction to 'R' 2. Graphic data exploration 3. Probability distributions 4. Basic statistical methods 5. Clustering and classification 6. Regression modeling 7. Mixed models Appendix A. Solutions to the exercises Appendix B. Overview of 'R' functions.

[1]  R. H. Baayen,et al.  The CELEX Lexical Database (CD-ROM) , 1996 .

[2]  R. Harald Baayen,et al.  Morphological productivity across speech and writing , 1999, English Language and Linguistics.

[3]  Rochelle Lieber,et al.  Word frequency distributions and lexical semantics , 1996, Comput. Humanit..

[4]  G. Zipf,et al.  The Psycho-Biology of Language , 1936 .

[5]  Hugo Quené,et al.  On multi-level modeling of data from repeated measures designs: a tutorial , 2004, Speech Commun..

[6]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[7]  S. Levinson,et al.  Structural Phylogenetics and the Reconstruction of Ancient Language History , 2005, Science.

[8]  Barbara R. Holland,et al.  Analysis of Phylogenetics and Evolution with R , 2007 .

[9]  R. Harald Baayen,et al.  Computing Historical Consciousness. A Quantitative Inquiry into the Presence of the Past in Newspaper Texts , 2001, Comput. Humanit..

[10]  Michael J Cortese,et al.  Visual word recognition of single-syllable words. , 2004, Journal of experimental psychology. General.

[11]  J. R. Koehler,et al.  Modern Applied Statistics with S-Plus. , 1996 .

[12]  Peter Dalgaard,et al.  Introductory statistics with R , 2002, Statistics and computing.

[13]  S. Maxwell,et al.  Bivariate median splits and spurious statistical significance. , 1993 .

[14]  David L. Hoover,et al.  Another Perspective on Vocabulary Richness , 2003, Comput. Humanit..

[15]  Stefan Evert,et al.  The emergence of productive non-medical -itis , 2004 .

[16]  Stefan Evert,et al.  A Simple LNRE Model for Random Character Sequences , 2004 .

[17]  R. Baayen,et al.  Regular morphologically complex neologisms leave detectable traces in the mental lexicon , 2007 .

[18]  W. M. Bolstad Introduction to Bayesian Statistics , 2004 .

[19]  Richard Sproat,et al.  Morphology and computation , 1992 .

[20]  William A. Gale,et al.  Good-Turing Frequency Estimation Without Tears , 1995, J. Quant. Linguistics.

[21]  R. Baayen,et al.  Predicting the Unpredictable: Interpreting Neutralized Segments in Dutch , 2003 .

[22]  R. Baayen,et al.  Morphological influences on the recognition of monosyllabic monomorphemic words , 2006 .

[23]  Kristopher J Preacher,et al.  On the practice of dichotomization of quantitative variables. , 2002, Psychological methods.

[24]  Douglas Biber,et al.  Variation across speech and writing: Methodology , 1988 .

[25]  Cochran Wg,et al.  Testing a linear relation among variances. , 1951 .

[26]  W D Marslen-Wilson,et al.  Differentiating lexical form, meaning, and structure in the neural language system. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Allan R. Wilks,et al.  The new S language: a programming environment for data analysis and graphics , 1988 .

[28]  R. Harald Baayen,et al.  Word Frequency Distributions , 2001 .

[29]  G. Herdan Re: type-token mathematics , 1961 .

[30]  Mirjam Ernestus,et al.  Variation in Dutch: From written MOGELIJK to spoken MOK , 2005 .

[31]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[32]  Thai Minh Dang Les caractères statistiques du vocabulaire : domaine vietnamien , 2000 .

[33]  R. Harald Baayen,et al.  How Variable May a Constant be? Measures of Lexical Richness in Perspective , 1998, Comput. Humanit..

[34]  Douglas Biber,et al.  Dimensions of Register Variation , 1995 .

[35]  David A. Belsley,et al.  Regression Analysis and its Application: A Data-Oriented Approach.@@@Applied Linear Regression.@@@Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1981 .

[36]  Jacob Cohen The Cost of Dichotomization , 1983 .

[37]  A. Ellegård The auxiliary do : the establishment and regulation of its use in English , 1955 .

[38]  Simon N. Wood,et al.  Generalized Additive Models , 2006, Annual Review of Statistics and Its Application.

[39]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[40]  F. E. Satterthwaite An approximate distribution of estimates of variance components. , 1946, Biometrics.

[41]  I. Good THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[42]  R. Baayen,et al.  Singulars and plurals in Dutch: Evidence for a parallel dual-route model , 1997 .

[43]  R. Baayen,et al.  Lexical statistics and lexical processing: semantic density, information complexity, sex, and irregularity in Dutch , 2005 .

[44]  B. Everitt,et al.  A Handbook of Statistical Analyses using R , 2006 .

[45]  Anthony S. Kroch,et al.  Function and grammar in the history of English , 1989 .

[46]  R. Harald Baayen,et al.  Semantic Density and Past-Tense Formation in Three Germanic Languages , 2005 .

[47]  M. Kendall The Statistical Study of Literary Vocabulary , 1944, Nature.

[48]  Robert H. Kushler,et al.  Statistical Computing: An Introduction to Data Analysis Using S-PLUS , 2003, Technometrics.

[49]  Ulrich H. Frauenfelder,et al.  Neighborhood Density and Frequency Across Languages and Modalities , 1993 .

[50]  H. H. Clark The language-as-fixed-effect fallacy: A critique of language statistics in psychological research. , 1973 .

[51]  Patrick Juola,et al.  The Time Course of Language Change , 2003, Comput. Humanit..

[52]  J. L. Myers,et al.  Regression analyses of repeated measures data in cognitive research. , 1990, Journal of experimental psychology. Learning, memory, and cognition.

[53]  J. Faraway Extending the Linear Model with R: Generalized Linear, Mixed Effects and Nonparametric Regression Models , 2005 .

[54]  J. Raaijmakers,et al.  How to deal with "The language-as-fixed-effect fallacy": Common misconceptions and alternative solutions. , 1999 .

[55]  Stefan Evert,et al.  The zipfR library: Words and other rare events in R , 2006 .

[56]  R. Harald Baayen,et al.  Derivational Productivity and Text Typology , 1994, J. Quant. Linguistics.

[57]  R. Harald Baayen,et al.  Predicting the dative alternation , 2007 .

[58]  Charles E. Heckler,et al.  Correspondence Analysis and Data Coding With Java and R , 2007, Technometrics.