Discovery of association rules between syntactic variables

This research applies an association rule mining technique to purely syntactic dialect data. The paper answers the research question of how relevant associations between syntactic variables can be discovered. The method calculates the proportional overlap between geographical distributions of syntactic microvariables and incorporates rule quality factors such as accuracy, coverage and completeness to measure the interestingness of the variable associations.The exploratory review of the results discusses several highly ranked association rules and also examines an implicational chain of syntactic variables.

[1]  William Frawley,et al.  Knowledge Discovery in Databases , 1991 .

[2]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[3]  H. S. Horn,et al.  Measurement of "Overlap" in Comparative Ecological Studies , 1966, The American Naturalist.

[4]  F. Newmeyer Possible and probable languages: A generative perspective on linguistic typology , 2008 .

[5]  Usama M. Fayyad,et al.  Knowledge Discovery in Databases: An Overview , 1997, ILP.

[6]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[7]  Alex Alves Freitas,et al.  On rule interestingness measures , 1999, Knowl. Based Syst..

[8]  Kenneth McGarry,et al.  A survey of interestingness measures for knowledge discovery , 2005, The Knowledge Engineering Review.

[9]  Martin Haspelmath,et al.  Parametric versus functional explanations of syntactic universals , 2008 .

[10]  Gregory Piatetsky-Shapiro,et al.  Discovery, Analysis, and Presentation of Strong Rules , 1991, Knowledge Discovery in Databases.

[11]  L. Rizzi Null objects in Italian and the theory of 'pro' , 1986 .

[12]  Cristina Guardiano,et al.  Three fundamental issues in parametric linguistics , 2008 .

[13]  Leonie Cornips,et al.  Elicitation techniques in a Dutch syntactic dialect atlas project , 2001 .

[14]  Patrick Meyer,et al.  On selecting interestingness measures for association rules: User oriented description and multiple criteria decision aid , 2008, Eur. J. Oper. Res..

[15]  Marco R. Spruit,et al.  Measuring Syntactic Variation in Dutch Dialects , 2006, Lit. Linguistic Comput..