A Comparison of Item Calibration Procedures in the Presence of Test Speededness.

In the presence of test speededness, the parameter estimates of item response theory models can be poorly estimated due to conditional dependencies among items, particularly for end-of-test items (i.e., speeded items). This article conducted a systematic comparison of five-item calibration procedures—a two-parameter logistic (2PL) model, a one-dimensional mixture model, a two-step strategy (a combination of the one-dimensional mixture and the 2PL), a two-dimensional mixture model, and a hybrid model-–by examining how sample size, percentage of speeded examinees, percentage of missing responses, and way of scoring missing responses (incorrect vs. omitted) affect the item parameter estimation in speeded tests. For nonspeeded items, all five procedures showed similar results in recovering item parameters. For speeded items, the one-dimensional mixture model, the two-step strategy, and the two-dimensional mixture model provided largely similar results and performed better than the 2PL model and the hybrid model in calibrating slope parameters. However, those three procedures performed similarly to the hybrid model in estimating intercept parameters. As expected, the 2PL model did not appear to be as accurate as the other models in recovering item parameters, especially when there were large numbers of examinees showing speededness and a high percentage of missing responses with incorrect scoring. Real data analysis further described the similarities and differences between the five procedures.

[1]  J. P. Meyer,et al.  A Mixture Rasch Model With Item Response Time Components , 2010 .

[2]  Allan S. Cohen,et al.  A Speeded Item Response Model with Gradual Process Change , 2008 .

[3]  Richard R. Reilly,et al.  A STUDY OF SPEEDEDNESS AS A SOURCE OF TEST BIAS1 , 1972 .

[4]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[5]  Wendy M. Yen,et al.  Scaling Performance Assessments: Strategies for Managing Local Item Dependence , 1993 .

[6]  Allan S. Cohen,et al.  A Method for Maintaining Scale Stability in the Presence of Test Speededness , 2003 .

[7]  T. C. Oshima The Effect of Speededness on Parameter Estimation in Item Response Theory , 1994 .

[8]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[9]  Wim J. van der Linden,et al.  IRT Parameter Estimation With Response Times as Collateral Information , 2010 .

[10]  Jürgen Rost,et al.  Rasch Models in Latent Classes: An Integration of Two Approaches to Item Analysis , 1990 .

[11]  Sun-Joo Cho,et al.  Explanatory Secondary Dimension Modeling of Latent Differential Item Functioning , 2011 .

[12]  Stephen G. Sireci,et al.  Validity Issues in Test Speededness , 2007 .

[13]  Brenda H. Loyd,et al.  VERTICAL EQUATING USING THE RASCH MODEL , 1980 .

[14]  Furong Gao,et al.  Investigating Local Dependence With Conditional Covariance Functions , 1998 .

[15]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[16]  Allan S. Cohen,et al.  Item Parameter Estimation Under Conditions of Test Speededness: Application of a Mixture Rasch Model With Ordinal Constraints , 2002 .