Using Credible Intervals to Detect Differential Item Functioning in IRT Models

Differential item functioning (DIF) occurs when individuals from different groups with the same level of ability have different probabilities of answering an item correctly. In this paper, we develop a Bayesian approach to detect DIF based on the credible intervals within the framework of item response theory models. Our method performed well for both uniform and non-uniform DIF conditions in the two-parameter logistic model. The efficacy of the proposed approach is demonstrated through simulation studies and a real data application.

[1]  R. Hambleton,et al.  Detecting potentially biased test items : Comparison of IRT area and Mantel-Haenszel methods , 1989 .

[2]  Cees A. W. Glas,et al.  DETECTION OF DIFFERENTIAL ITEM FUNCTIONING USING LAGRANGE MULTIPLIER TESTS , 1996 .

[3]  Zhushan Li A Power Formula for the Mantel–Haenszel Test for Differential Item Functioning , 2015, Applied psychological measurement.

[4]  Chun Wang,et al.  A mixture hierarchical model for response times and response accuracy. , 2015, The British journal of mathematical and statistical psychology.

[5]  Nan-Jung Hsu,et al.  A Speeded Item Response Model: Leave the Harder till Later , 2014, Psychometrika.

[6]  Tukur Dahiru,et al.  P – VALUE, A TRUE TEST OF STATISTICAL SIGNIFICANCE? A CAUTIONARY NOTE , 2008, Annals of Ibadan postgraduate medicine.

[7]  Melvin R. Novick,et al.  Some latent train models and their use in inferring an examinee's ability , 1966 .

[8]  Adam C Carle,et al.  Comparison of two Bayesian methods to detect mode effects between paper-based and computerized adaptive assessments: a preliminary Monte Carlo study , 2012, BMC Medical Research Methodology.

[9]  Paul H. Garthwaite,et al.  Statistical Inference , 2002 .

[10]  H. V. D. Flier,et al.  DETECTING EXPERIMENTALLY INDUCED ITEM BIAS USING THE ITERATIVE LOGIT METHOD , 1985 .

[11]  G. Camilli,et al.  Variance Estimation for Differential Test Functioning Based on Mantel-Haenszel Statistics , 1997 .

[12]  Dorothy T. Thayer,et al.  DIFFERENTIAL ITEM FUNCTIONING AND THE MANTEL‐HAENSZEL PROCEDURE , 1986 .

[13]  Henk Kelderman,et al.  Item bias detection using loglinear irt , 1989 .

[14]  Mian Wang,et al.  Anchor Selection Using the Wald Test Anchor-All-Test-All Procedure , 2017, Applied psychological measurement.

[15]  De Ayala,et al.  The Theory and Practice of Item Response Theory , 2008 .

[16]  Edward M. H. Lin,et al.  A Three-Parameter Speeded Item Response Model: Estimation and Application , 2016 .

[17]  Anderson Aj,et al.  Use of log-linear models for assessing differential item functioning in a measure of psychological functioning. , 1994 .