Derivation and Applicability of Asymptotic Results for Multiple Subtests Person-Fit Statistics

In high-stakes testing, it is important to check the validity of individual test scores. Although a test may, in general, result in valid test scores for most test takers, for some test takers, test scores may not provide a good description of a test taker’s proficiency level. Person-fit statistics have been proposed to check the validity of individual test scores. In this study, the theoretical asymptotic sampling distribution of two person-fit statistics that can be used for tests that consist of multiple subtests is first discussed. Second, simulation study was conducted to investigate the applicability of this asymptotic theory for tests of finite length, in which the correlation between subtests and number of items in the subtests was varied. The authors showed that these distributions provide reasonable approximations, even for tests consisting of subtests of only 10 items each. These results have practical value because researchers do not have to rely on extensive simulation studies to simulate sampling distributions.

[1]  Rob R. Meijer,et al.  The Null Distribution of Person-Fit Statistics for Conventional and Adaptive Tests , 1999 .

[2]  André A. Rupp,et al.  A Systematic Review of the Methodology for Person Fit Research in Item Response Theory: Lessons about Generalizability of Inferences from the Design of Simulation Studies , 2013 .

[3]  N. Smirnov Table for Estimating the Goodness of Fit of Empirical Distributions , 1948 .

[4]  Jorge N. Tendeiro,et al.  The Use of the lz and lz* Person-Fit Statistics and Problems Derived From Model Misspecification , 2012 .

[5]  Fritz Drasgow,et al.  Appropriateness Measurement for Some Multidimensional Test Batteries , 1991 .

[6]  Klaas Sijtsma,et al.  Methodology Review: Evaluating Person Fit , 2001 .

[7]  Ronald D. Armstrong,et al.  On the Performance of the l Z Person-Fit Statistic , 2007 .

[8]  Jorge N. Tendeiro,et al.  Detection of Invalid Test Scores on Admission Tests: A Simulation Study Using Person-Fit Statistics , 2015 .

[9]  P. Fayers Item Response Theory for Psychologists , 2004, Quality of Life Research.

[10]  Klaas Sijtsma,et al.  Statistic lz-Based Person-Fit Methods for Noncognitive Multiscale Measures , 2014 .

[11]  Jorge N. Tendeiro,et al.  The Use of Person-Fit Scores in High-Stakes Educational Testing: How to Use Them and What They Tell Us , 2014 .

[12]  David Magis,et al.  A Didactic Presentation of Snijders’s lz* Index of Person Fit With Emphasis on Response Model Selection and Ability Estimation , 2012 .

[13]  Tom A. B. Snijders,et al.  Asymptotic null distribution of person fit statistics with estimated person parameter , 2001 .

[14]  Fritz Drasgow,et al.  Appropriateness measurement with polychotomous item response models and standardized indices , 1984 .