Modeling Item Position Effects Using Generalized Linear Mixed Models

Item position effects can seriously bias analyses in educational measurement, especially when multiple matrix sampling designs are deployed. In such designs, item position effects may easily occur if not explicitly controlled for. Still, in practice it usually turns out to be rather difficult—or even impossible—to completely control for effects due to the position of items. The objectives of this article are to show how item position effects can be modeled using the linear logistic test model with additional error term (LLTM +ε) in the framework of generalized linear mixed models (GLMMs), to explore in a simulation study how well the LLTM +ε holds the nominal Type I risk threshold, to conduct power analysis for this model, and to examine the sensitivity of the LLTM +ε to designs that are not completely balanced concerning item position. Overall, the LLTM +ε proved suitable for modeling item position effects when a balanced design is used. With decreasing balance, the model tends to be more conservative in the sense that true item position effects are more unlikely to be detected. Implications for linking and equating procedures which use common items are discussed.

[1]  Frederic M. Lord ITEM SAMPLING IN TEST THEORY AND IN RESEARCH DESIGN , 1965 .

[2]  Melvin R. Novick,et al.  Some latent train models and their use in inferring an examinee's ability , 1966 .

[3]  M. R. Novick,et al.  Statistical Theories of Mental Test Scores. , 1971 .

[4]  G. H. Fischer,et al.  The linear logistic test model as an instrument in educational research , 1973 .

[5]  Wendy M. Yen,et al.  The Extent, Causes and Importance of Context Effects on Item Parameters for Two Latent Trait Models. , 1980 .

[6]  L. Tierney,et al.  Accurate Approximations for Posterior Moments and Marginal Densities , 1986 .

[7]  Linda L. Cook,et al.  Problems Related to the Use of Conventional and Item Response Theory Equating Methods in Less Than Optimal Circumstances , 1987 .

[8]  L. Wise Item Position Effects for Test of Word Knowledge and Arithmetic Reasoning. , 1989 .

[9]  Rebecca Zwick Effects of Item Order and Context on Estimation of NAEP Reading Proficiency , 1991 .

[10]  Chapter 7: Statistical and Psychometric Issues in the Measurement of Educational Achievement Trends: Examples From the National Assessment of Educational Progress , 1992 .

[11]  Robert L. Brennan The Context of Context Effects , 1992 .

[12]  Nancy L. Allen,et al.  The NAEP 1998 Technical Report. , 2001 .

[13]  Mark Wilson,et al.  A framework for item response models , 2004 .

[14]  Geert Molenberghs,et al.  Estimation and software , 2004 .

[15]  Geert Molenberghs,et al.  An Introduction to (Generalized (Non)Linear Mixed Models , 2004 .

[16]  R. Brennan,et al.  Test Equating, Scaling, and Linking , 2004 .

[17]  Paul De Boeck,et al.  Descriptive and explanatory item response models , 2004 .

[18]  P. Fayers Item Response Theory for Psychologists , 2004, Quality of Life Research.

[19]  M. Meulders,et al.  Person-by-item predictors , 2004 .

[20]  J. Schepers,et al.  Models with item and item group predictors , 2004 .

[21]  Lale Khorramdel,et al.  Examining item-position effects in large-scale assessment using the Linear Logistic Test Model , 2008 .

[22]  Paul De Boeck,et al.  Random Item IRT Models , 2008 .

[23]  Walter D. Way,et al.  Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design , 2008 .

[24]  André A. Rupp,et al.  An NCME Instructional Module on Booklet Designs in Large‐Scale Assessments of Student Achievement: Theory and Practice , 2009 .

[25]  Eugenio Gonzalez,et al.  principles of multiple matrix booklet designs and parameter recovery in large-scale assessments , 2010 .

[26]  Abe D. Hofman,et al.  The estimation of item response models with the lmer function from the lme4 package in R , 2011 .

[27]  Rianne Janssen,et al.  Modeling Item-Position Effects Within an IRT Framework , 2012 .

[28]  Anthony D. Albano,et al.  Rasch Scale Stability in the Presence of Item Parameter and Trait Drift , 2012 .