Variation of verbal constructions in Estonian dialects

Traditional Estonian dialect classifications are based on the phonology, morphology, and lexis, and there are very few studies about syntax available. The present article is the first quantitative syntactic study of Estonian dialects. We concentrate on constructions consisting of finite and non-finite verbs, and we apply contemporary statistical methods to explore the syntactic variation. Our results show that even bare token frequencies can identify syntactic patterns quite well, and that analyses exploiting collostructional methods makes the variational patterns even clearer. We use correspondence analysis and clustering to detect geographic influence on variation. The results suggest a syntax-based classification of dialects differs from the traditional classifications based mainly on phonology and lexis. Our data reveal systematic differences between eastern and western dialects at the syntactic level, whereas analyses based on phonology and lexis distinguish mainly between northern and southern dialects. The western dialects make more use of analytic constructions consisting of a finite and a non-finite verb form.

[1]  John Nerbonne,et al.  The forests behind the trees , 2009 .

[2]  A. Goldberg Constructions: A Construction Grammar Approach to Argument Structure , 1995 .

[3]  Jussi Ylikoski,et al.  Defining non-finites: Action nominals, converbs and infinitives , 2003 .

[4]  John Nerbonne,et al.  Toward a dialectological yardstick* , 2007, J. Quant. Linguistics.

[5]  Helle Metslang,et al.  Some Notes on Proximative and Avertive in Estonian , 2009 .

[6]  Liina Lindström,et al.  Parsing Corpus of Estonian Dialects , 2009 .

[7]  Wilbert Jan Heeringa Measuring dialect pronunciation differences using Levenshtein distance , 2004 .

[8]  William Labov,et al.  The Social Stratification of English in New York City: 1966–2006 , 2006 .

[9]  A C C Gibbs,et al.  Data Analysis , 2009, Encyclopedia of Database Systems.

[10]  Mati Erelt,et al.  Eesti keele käsiraamat , 2007 .

[11]  Ilona Tragel Eesti keele tuumverbid , 2003 .

[12]  M. Greenacre Correspondence analysis in practice , 1993 .

[13]  Cristina Guardiano,et al.  Evidence for syntax as a signal of historical relatedness , 2009 .

[14]  Stefan Evert,et al.  The Statistics of Word Cooccur-rences: Word Pairs and Collocations , 2004 .

[15]  Benedikt Szmrecsanyi,et al.  The morphosyntax of varieties of English worldwide: A quantitative perspective , 2009 .

[16]  H. J. Bennis,et al.  A Syntactic Atlas of the Dutch dialects (SAND) , 2000 .

[17]  Ted Pedersen,et al.  Fishing for Exactness , 1996, ArXiv.

[18]  John Nerbonne,et al.  Projecting Dialect Distances to Geography: Bootstrap Clustering vs. Noisy Clustering , 2007, GfKl.

[19]  Ilona Tragel,,et al.  Grammaticalization of Estonian saama ‘to get’ , 2012 .

[20]  S. Gries,et al.  Extending collostructional analysis: A corpus-based perspective on `alternations' , 2004 .

[21]  D. S. Sivia,et al.  Data Analysis , 1996, Encyclopedia of Evolutionary Psychological Science.

[22]  David Heap,et al.  La variation grammaticale en géolinguistique, les pronoms sujet en roman central , 1997 .

[23]  W. Heeringa,et al.  Associations among linguistic levels , 2009 .

[24]  Jan-Ola Östman,et al.  Construction Grammar: A thumbnail sketch , 2004 .

[25]  Daniel Wiechmann On the computation of collostruction strength: Testing measures of association as expressions of lexical bias , 2008 .

[26]  M. Dunn,et al.  Structural Phylogeny in Historical Linguistics: Methodological Explorations Applied in Island Melanesia , 2008 .

[27]  Kadri Muischnek Verbi ja noomeni püsiühendid eesti keeles , 2006 .

[28]  Mirjam Fried,et al.  Construction grammar in a cross-language perspective , 2004 .

[29]  Mati Erelt Estonian : typological studies , 2001 .

[30]  L. Cronbach Coefficient alpha and the internal structure of tests , 1951 .

[31]  Stefan Th. Gries,et al.  Collostructions: Investigating the interaction of words and constructions , 2003 .

[32]  Liina Lindström,et al.  Finiitverbi asend lauses. Sõnajärg ja seda mõjutavad tegurid suulises eesti keeles , 2005 .

[33]  C. Fillmore,et al.  Grammatical constructions and linguistic generalizations: The What's X doing Y? construction , 1999 .

[34]  W. Labov The social stratification of English in New York City , 1969 .

[35]  Wladyslaw Cichocki,et al.  Geographic Variation in Acadian French /r /: What Can Correspondence Analysis Contribute Toward Explanation? , 2006, Lit. Linguistic Comput..

[36]  Ludovic Lebart,et al.  Exploring Textual Data , 1997 .

[37]  Liina Lindström,et al.  The possessive perfect construction in Estonian , 2010 .

[38]  Charlotte Gooskens,et al.  Gabmap – A web application for dialectology. , 2011 .