lordif: An R Package for Detecting Differential Item Functioning Using Iterative Hybrid Ordinal Logistic Regression/Item Response Theory and Monte Carlo Simulations.

Logistic regression provides a flexible framework for detecting various types of differential item functioning (DIF). Previous efforts extended the framework by using item response theory (IRT) based trait scores, and by employing an iterative process using group-specific item parameters to account for DIF in the trait scores, analogous to purification approaches used in other DIF detection frameworks. The current investigation advances the technique by developing a computational platform integrating both statistical and IRT procedures into a single program. Furthermore, a Monte Carlo simulation approach was incorporated to derive empirical criteria for various DIF statistics and effect size measures. For purposes of illustration, the procedure was applied to data from a questionnaire of anxiety symptoms for detecting DIF associated with age from the Patient-Reported Outcomes Measurement Information System.

[1]  Anderson Rb On the comparability of meaningful stimuli in cross-cultural research. , 1967 .

[2]  Keith F Widaman,et al.  Confirmatory factor analysis and item response theory: two approaches for exploring measurement invariance. , 1993, Psychological bulletin.

[3]  B. Zumbo A Handbook on the Theory and Methods of Differential Item Functioning (DIF) LOGISTIC REGRESSION MODELING AS A UNITARY FRAMEWORK FOR BINARY AND LIKERT-TYPE (ORDINAL) ITEM SCORES , 1999 .

[4]  Dan M Mungas,et al.  Composite scores for executive function items: Demographic heterogeneity and relationships with quantitative magnetic resonance imaging , 2008, Journal of the International Neuropsychological Society.

[5]  H. Swaminathan,et al.  Detecting Differential Item Functioning Using Logistic Regression Procedures , 1990 .

[6]  Allan S. Cohen,et al.  DIF Detection and Effect Size Measures for Polytomously Scored Items , 2007 .

[7]  Gerald van Belle,et al.  Test bias in a cognitive test: differential item functioning in the CASI , 2004, Statistics in medicine.

[8]  Randall D. Penfield DIFAS: Differential Item Functioning Analysis System , 2005 .

[9]  A. Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[10]  Richard N. Jones Identification of Measurement Differences Between English and Spanish Language Versions of the Mini-Mental State Examination: Detecting Differential Item Functioning Using MIMIC Modeling , 2006, Medical care.

[11]  M Schemper,et al.  Explained variation for logistic regression. , 1996, Statistics in medicine.

[12]  P. Holland,et al.  DIF DETECTION AND DESCRIPTION: MANTEL‐HAENSZEL AND STANDARDIZATION1,2 , 1992 .

[13]  David Cella,et al.  A comparison of three sets of criteria for determining the presence of differential item functioning using ordinal logistic regression , 2007, Quality of Life Research.

[14]  Susan J. Maller,et al.  Iterative Purification and Effect Size Use With Logistic Regression for Differential Item Functioning Detection , 2007 .

[15]  Dimitrios Rizopoulos ltm: An R Package for Latent Variable Modeling and Item Response Theory Analyses , 2006 .

[16]  R. Zwick When Do Item Response Function and Mantel-Haenszel Definitions of Differential Item Functioning Coincide? , 1990 .

[17]  T. C. Oshima,et al.  The Item Parameter Replication Method for Detecting Differential Functioning in the Polytomous DFIT Framework , 2009 .

[18]  David R. Cox The analysis of binary data , 1970 .

[19]  F. Lord Applications of Item Response Theory To Practical Testing Problems , 1980 .

[20]  S. Haneuse,et al.  Item response theory facilitated cocalibrating cognitive tests and reduced bias in estimated rates of decline. , 2008, Journal of clinical epidemiology.

[21]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[22]  Randall D. Penfield,et al.  An NCME Instructional Module on Using Differential Step Functioning to Refine the Analysis of DIF in Polytomous Items , 2009 .

[23]  P. Mair,et al.  Extended Rasch Modeling: The eRm Package for the Application of IRT Models in R , 2007 .

[24]  F. Samejima Estimation of latent ability using a response pattern of graded scores , 1968 .

[25]  S. Gregorich Do Self-Report Instruments Allow Meaningful Comparisons Across Diverse Population Groups?: Testing Measurement Invariance Using the Confirmatory Factor Analysis Framework , 2006, Medical care.

[26]  Maria Orlando,et al.  Further Investigation of the Performance of S - X2: An Item Fit Index for Use With Dichotomous Item Response Theory Models , 2003 .

[27]  Gerald van Belle,et al.  Differential Item Functioning Analysis With Ordinal Logistic Regression Techniques: DIFdetect and difwithpar , 2006, Medical care.

[28]  Allan S. Cohen,et al.  Detection of Differential Item Functioning Under the Graded Response Model With the Likelihood Ratio Test , 1998 .

[29]  S. Menard Coefficients of Determination for Multiple Logistic Regression Analysis , 2000 .

[30]  J. Spray,et al.  Logistic Discriminant Function Analysis for DIF Identification of Polytomously Scored Items , 1993 .

[31]  C. McHorney,et al.  Assessment of Differential Item Functioning for Demographic Comparisons in the MOS SF-36 Health Survey , 2006, Quality of Life Research.

[32]  S. Reise,et al.  Item Banks for Measuring Emotional Distress From the Patient-Reported Outcomes Measurement Information System (PROMIS®): Depression, Anxiety, and Anger , 2011, Assessment.

[33]  Taehoon Kang,et al.  An Investigation of the Performance of the Generalized S-X2 Item-Fit Index for Polytomous IRT Models , 2007 .

[34]  W. Stout,et al.  A new procedure for detection of crossing DIF , 1996 .

[35]  P. Crane,et al.  Japanese–English language equivalence of the Cognitive Abilities Screening Instrument among Japanese-Americans , 2009, International Psychogeriatrics.

[36]  L. E. Gibbons,et al.  Measuring depression levels in HIV-infected patients as part of routine clinical care using the nine-item Patient Health Questionnaire (PHQ-9) , 2010, AIDS care.

[37]  Randall D. Penfield Assessing Differential Step Functioning in Polytomous Items Using a Common Odds Ratio Estimator. , 2007 .

[38]  N. Nagelkerke,et al.  A note on a general definition of the coefficient of determination , 1991 .

[39]  M Schemper,et al.  Computing measures of explained variation for logistic regression models. , 1999, Computer methods and programs in biomedicine.

[40]  N. Waller EZDIF: Detection of Uniform and Nonuniform Differential Item Functioning With the Mantel-Haenszel and Logistic Regression Procedures , 1998 .

[41]  L. Shepard,et al.  Methods for Identifying Biased Test Items , 1994 .

[42]  Timothy R. Miller,et al.  Logistic Regression and Its Use in Detecting Differential Item Functioning in Polytomous Items , 1996 .

[43]  P. Boeck,et al.  Explanatory item response models : a generalized linear and nonlinear approach , 2004 .

[44]  Martha L. Stocking,et al.  Developing a Common Metric in Item Response Theory , 1983 .

[45]  D. Cella,et al.  Rapid detection of differential item functioning in assessments of health-related quality of life: The Functional Assessment of Cancer Therapy , 2007, Quality of Life Research.

[46]  Mark J. Gierl,et al.  Evaluating Type I Error and Power Rates Using an Effect Size Measure With the Logistic Regression Procedure for DIF Detection , 2001 .

[47]  Howard T. Everson,et al.  Methodology Review: Statistical Approaches for Assessing Measurement Bias , 1993 .

[48]  S. Greenland,et al.  Simulation study of confounder-selection strategies. , 1993, American journal of epidemiology.

[49]  Kurt L. Johnson,et al.  Differential item functioning impact in a modified version of the Roland–Morris Disability Questionnaire , 2007, Quality of Life Research.