Please Scroll down for Article Multivariate Behavioral Research Evaluation of Mimic-model Methods for Dif Testing with Comparison to Two- Group Analysis

Differential item functioning (DIF) occurs when an item on a test or questionnaire has different measurement properties for 1 group of people versus another, irrespective of mean differences on the construct. This study focuses on the use of multiple-indicator multiple-cause (MIMIC) structural equation models for DIF testing, parameterized as item response models. The accuracy of these methods, and the sample size requirements, are not well established. This study examines the accuracy of MIMIC methods for DIF testing when the focal group is small and compares results with those obtained using 2-group item response theory (IRT). Results support the utility of the MIMIC approach. With small focal-group samples, tests of uniform DIF with binary or 5-category ordinal responses were more accurate with MIMIC models than 2-group IRT. Recommendations are offered for the application of MIMIC methods for DIF testing.

[1]  Carol M. Woods Empirical Selection of Anchors for Tests of Differential Item Functioning , 2009 .

[2]  K. G. J8reskoC,et al.  Simultaneous Factor Analysis in Several Populations , 2007 .

[3]  J. Schroeder,et al.  Ethnic differences among adolescents seeking smoking cessation treatment: a structural analysis of responses on the Fagerström Test for Nicotine Dependence. , 2007, Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco.

[4]  Fritz Drasgow,et al.  Detecting differential item functioning with confirmatory factor analysis and item response theory: toward a unified strategy. , 2006, The Journal of applied psychology.

[5]  J. Teresi,et al.  Identification of Differential Item Functioning Using Item Response Theory and the Likelihood-Based Model Comparison Approach: Application to the Mini-Mental State Examination , 2006, Medical care.

[6]  Carol M Woods,et al.  Ramsay-curve item response theory (RC-IRT) to detect and correct for nonnormal latent variables. , 2006, Psychological methods.

[7]  S. Oishi The concept of life satisfaction across cultures: An IRT analysis , 2006 .

[8]  Li Cai,et al.  A Cautionary Note on Using G2(dif) to Assess Relative Model Fit in Categorical Data Analysis , 2006, Multivariate behavioral research.

[9]  Holmes Finch,et al.  The MIMIC Model as a Method for Detecting DIF: Comparison With Mantel-Haenszel, SIBTEST, and the IRT Likelihood Ratio , 2005 .

[10]  M. N. Gelin Type I error rates of the DIF MIMIC approach using Joreskog’s covariance matrix with ML and WLS estimation , 2005 .

[11]  Adam W. Meade,et al.  A Comparison of Item Response Theory and Confirmatory Factor Analytic Methodologies for Establishing Measurement Equivalence/Invariance , 2004 .

[12]  Roger E. Millsap,et al.  Assessing Factorial Invariance in Ordered-Categorical Measures , 2004 .

[13]  R. Rapee,et al.  More information from fewer questions: the factor structure and item properties of the original and brief fear of negative evaluation scale. , 2004, Psychological assessment.

[14]  D. Bolt,et al.  A multigroup item response theory analysis of the psychopathy checklist--revised. , 2004, Psychological assessment.

[15]  Wen-Chung Wang,et al.  Effects of Anchor Item Methods on the Detection of Differential Item Functioning Within the Family of Rasch Models , 2004 .

[16]  K. Hagtvet,et al.  Measuring anxiety by ordered categorical items in data with subgroup structure: the case of the Hungarian version of the trait anxiety scale of the state-trait anxiety inventory for children (staic-h) , 2004 .

[17]  J. Abramowitz,et al.  The Anxiety Sensitivity Index - Revised: psychometric properties and factor structure in two nonclinical samples. , 2003, Behaviour research and therapy.

[18]  Wen-Chung Wang,et al.  Effects of Anchor Item Methods on Differential Item Functioning Detection with the Likelihood Ratio Test , 2003 .

[19]  R. MacIntosh,et al.  Variance Estimation for Converting MIMIC Model Parameters to IRT Parameters in DIF Analysis , 2003 .

[20]  J. Anthony,et al.  Possible age-associated bias in reporting of clinical features of drug dependence: epidemiological evidence on adolescent-onset marijuana use. , 2003, Addiction.

[21]  W. Spector,et al.  Impact of differential item functioning on age and gender differences in functional disability. , 2002, The journals of gerontology. Series B, Psychological sciences and social sciences.

[22]  Barbara M Byrne,et al.  Measurement equivalence: a comparison of methods based on confirmatory factor analysis and item response theory. , 2002, The Journal of applied psychology.

[23]  David Thissen,et al.  Quick and Easy Implementation of the Benjamini-Hochberg Procedure for Controlling the False Positive Rate in Multiple Comparisons , 2002 .

[24]  N. Waller The Schedule for Nonadaptive and Adaptive Personality , 2001 .

[25]  D A Grayson,et al.  Item bias in the Center for Epidemiologic Studies Depression Scale: effects of physical disorders and disability in an elderly community sample. , 2000, The journals of gerontology. Series B, Psychological sciences and social sciences.

[26]  B. Mast,et al.  Assessment of functional abilities among geriatric patients: A MIMIC model of the functional independence measure , 2000 .

[27]  Robert D. Ankenmann,et al.  An Investigation of the Power of the Likelihood Ratio Goodness-of-Fit Statistic in Detecting Differential Item Functioning. , 1999 .

[28]  A. Mackinnon,et al.  Age differences in depression and anxiety symptoms: a structural equation modelling analysis of data from a general population sample , 1999, Psychological Medicine.

[29]  John W. Tukey,et al.  Controlling Error in Multiple Comparisons, with Examples from State-to-State Differences in Educational Achievement , 1999 .

[30]  Allan S. Cohen,et al.  Detection of Differential Item Functioning Under the Graded Response Model With the Likelihood Ratio Test , 1998 .

[31]  Frans J. Oort,et al.  Simulation study of item bias detection with restricted factor analysis , 1998 .

[32]  K. Ryan Methods for identifying biased test items , 1997 .

[33]  R. J. Mokken,et al.  Handbook of modern item response theory , 1997 .

[34]  F. Samejima Graded Response Model , 1997 .

[35]  Allan S. Cohen,et al.  An Investigation of the Likelihood Ratio Test For Detection of Differential Item Functioning , 1996 .

[36]  Seock-Ho Kim,et al.  A Comparison of Lord's Chi-Square, Raju's Area Measures, and the Likelihood Ratio Test on Detection of Differential Item Functioning , 1995 .

[37]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[38]  L. Shepard,et al.  Methods for Identifying Biased Test Items , 1994 .

[39]  H. Wainer,et al.  Differential item functioning , 1995 .

[40]  Howard T. Everson,et al.  Methodology Review: Statistical Approaches for Assessing Measurement Bias , 1993 .

[41]  Keith F Widaman,et al.  Confirmatory factor analysis and item response theory: two approaches for exploring measurement invariance. , 1993, Psychological bulletin.

[42]  Howard Wainer,et al.  Detection of differential item functioning using the parameters of item response models. , 1993 .

[43]  F. Oort Using restricted factor analysis to detect item bias , 1992 .

[44]  Leigh Burstein,et al.  Instructionally Sensitive Psychometrics: Application of a New IRT‐Based Detection Technique to Mathematics Achievement Test Items , 1991 .

[45]  B. Muthén Latent variable modeling in heterogeneous populations , 1989 .

[46]  Gideon J. Mellenbergh,et al.  Item bias and item response theory , 1989 .

[47]  Howard Wainer,et al.  Use of item response theory in the study of group differences in trace lines. , 1988 .

[48]  Jan de Leeuw,et al.  On the relationship between item response theory and factor analysis of discretized variables , 1987 .

[49]  Bengt Muthen,et al.  Some uses of structural equation modeling in validity studies: Extending IRT to external variables , 1986 .

[50]  David Thissen,et al.  Beyond group-mean differences: The concept of item bias. , 1986 .

[51]  Bengt Muthén,et al.  A Method for Studying the Homogeneity of Test Items with Respect to Other Relevant Variables , 1985 .

[52]  D. Chambless,et al.  Assessment of fear of fear in agoraphobics: the body sensations questionnaire and the agoraphobic cognitions questionnaire. , 1984, Journal of consulting and clinical psychology.

[53]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters , 1982 .

[54]  Bengt Muthén,et al.  Simultaneous factor analysis of dichotomous variables in several groups , 1981 .

[55]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm , 1981 .

[56]  A. Goldberger,et al.  Estimation of a Model with Multiple Indicators and Multiple Causes of a Single Latent Variable , 1975 .

[57]  D. Sörbom A GENERAL METHOD FOR STUDYING DIFFERENCES IN FACTOR MEANS AND FACTOR STRUCTURE BETWEEN GROUPS , 1974 .

[58]  M. R. Novick,et al.  Statistical Theories of Mental Test Scores. , 1971 .