Model selection and phonological argumentation

Statistical and empirical methods are in widespread use in present-day phonological research. In particular, researchers are often interested in the problem of model selection, or determining whether or not a particular term in a model is statistically significant, in order to make a judgement about whether or not that term is theoretically significant. If a term is not significant, it is often tempting to conclude that it is not relevant. However, such inferences require an assessment of statistical power, a dimension independent from significance. Assessing power is more difficult than assessing significance because it depends on factors including the true (or expected) effect size, sample size, and degree of noise. In this paper, we provide a non-technical introduction to the issue of power, illustrated with simulations based on experimental investigations of incomplete neutralization, to illustrate how not all null results are equally informative. In particular, depending on the statistical power, a non-significant result can either be uninformative or reasonably interpreted as providing evidence for the null hypothesis.

[1]  M. Oostendorp The puzzle : Incomplete final devoicing 3 2 The puzzle : Incomplete final devoicing , 2007 .

[2]  R. Port,et al.  Neutralization of syllable-final voicing in German , 1985 .

[3]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[4]  John A. Goldsmith,et al.  Unsupervised Learning of the Morphology of a Natural Language , 2001, CL.

[5]  Aaron Braver,et al.  Degrees of incompleteness in neutralization: Paradigm uniformity in a phonetics with weighted constraints , 2013 .

[6]  Heiko Haller,et al.  Misinterpretations of significance: A problem students share with their teachers? , 2002 .

[7]  John Goldsmith The evaluation metric in generative grammar , 2011 .

[8]  D. Barr,et al.  Random effects structure for confirmatory hypothesis testing: Keep it maximal. , 2013, Journal of memory and language.

[9]  A. M. Ramer A letter from an incompletely neutral phonologist , 1996 .

[10]  David R. Anderson,et al.  Model Based Inference in the Life Sciences: A Primer on Evidence , 1998 .

[11]  Bodo Winter,et al.  The Nature of Incomplete Neutralization in German : Implications for Laboratory Phonology , 2012 .

[12]  Viktor Kharlamov,et al.  Incomplete neutralization of the voicing contrast in word-final obstruents in Russian: Phonological, lexical, and methodological influences , 2014, J. Phonetics.

[13]  Bodo Winter,et al.  The other N: The role of repetitions and items in the design of phonetic experiments , 2015, ICPhS.

[14]  R. Baierlein Probability Theory: The Logic of Science , 2004 .

[15]  W. Jassem,et al.  Neutralization of voicing in Polish obstruents , 1989 .

[16]  R. Harald Baayen,et al.  Analyzing linguistic data: a practical introduction to statistics using R, 1st Edition , 2008 .

[17]  M. Fourakis,et al.  On the ‘Incomplete Neutralization’ of German Final Obstruents , 1984 .

[18]  Shravan Vasishth,et al.  Statistical Methods for Linguistic Research: Foundational Ideas - Part I , 2016, Lang. Linguistics Compass.

[19]  S. Lazic,et al.  Model based Inference in the Life Sciences: a Primer on Evidence , 2011 .

[20]  Allard Jongman,et al.  Incomplete neutralization and other sub-phonemic durational differences in production and perception: evidence from Dutch , 2004, J. Phonetics.

[21]  D. Bates,et al.  Balancing Type I Error and Power in Linear Mixed Models , 2015, 1511.01864.

[22]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[23]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[24]  Allard Jongman,et al.  Orthographic vs. morphological incomplete neutralization effects , 2006, J. Phonetics.

[25]  Hans Georg Piroth,et al.  Speaker-dependent differences in voicing and devoicing of German obstruents , 2004, J. Phonetics.

[26]  Sven Grawunder,et al.  Assessing incomplete neutralization of final devoicing in German , 2014, J. Phonetics.