Capturing correlational structure in Russian paradigms: A case study in logistic mixed-effects modeling

Abstract This study addresses the statistical analysis of a phenomenon in Russian verbal paradigms, a suffix shift that is spreading through the paradigm and making it more regular. A problem that arises in the analysis of data collected from the Russian National Corpus is that counts documenting this phenomenon are based on repeated observations of the same verbs and, moreover, on counts for different parts of the paradigms of these same verbs. Unsurprisingly, individual verbs display consistent (although variable) behavior with respect to the suffix shift. The non-independence of the elementary observations in our data has to be taken into account in the statistical evaluation of the patterns in the data. We show how mixed-effects modeling can be used to do this in a principled way, and that it is also necessary to do so in order to avoid anti-conservative evaluation of significance.

[1]  Joan L. Bybee Morphology: A study of the relation between meaning and form , 1985 .

[2]  Kira Gor,et al.  Formal Instruction and the Acquisition of Verbal Morphology , 2004 .

[3]  Stefan Thomas Gries,et al.  Quantitative Corpus Linguistics with R: A Practical Introduction , 2009 .

[4]  E. Steyerberg,et al.  [Regression modeling strategies]. , 2011, Revista espanola de cardiologia.

[5]  Andrej A. Zaliznjak Grammatičeskij slovar' russkogo jazyka , 1987 .

[6]  Elisabeth Dévière,et al.  Analyzing linguistic data: a practical introduction to statistics using R , 2009 .

[7]  Deepayan Sarkar,et al.  Lattice: Multivariate Data Visualization with R , 2008 .

[8]  Henning Andersen,et al.  Russian Conjugation: Acquisition and Evolutive Change , 2011 .

[9]  R. Harald Baayen,et al.  Analyzing linguistic data: a practical introduction to statistics using R, 1st Edition , 2008 .

[10]  R. Harald Baayen,et al.  Predicting the dative alternation , 2007 .

[11]  Elena A. Zemskaja,et al.  Russkaja razgovornaja reč , 1987 .

[12]  Kira Gor,et al.  Generation of Complex Verbal Morphology in First and Second Language Acquisition: Evidence from Russian , 2004 .

[13]  T. Jaeger,et al.  Categorical Data Analysis: Away from ANOVAs (transformation or not) and towards Logit Mixed Models. , 2008, Journal of memory and language.

[14]  Dorota Kiebzak-Mandera,et al.  Acquisition of Russian verb morphology : the early stages , 1997 .

[15]  Sunil J Rao,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2003 .