Comparison of proficiency in an anesthesiology course across distinct medical student cohorts: Psychometric approaches to test equating

Background: Examinations are necessary for assessment of student proficiency in medical education, but comparison of achievement across different cohorts in different tests is challenging. We applied psychometric test equating methods to compare student proficiency in two different examinations for a clinical anesthesiology course. Methods: Each examination contained 50 multiple choice items and nine common items were identified from the two examinations (administered in 2011 and 2012). The common item design was used for test equating. Two psychometric test‐equating approaches, chained linear equating and item response theory, were used to compare student proficiency in anesthesiology across distinct medical student cohorts. Raw scores from the 2012 test were linearly transformed to the 2011 scale using the chained method, and then Rasch analysis was applied to calibrate examinee ability and item difficulty in the two examinations on a common scale. Results: Both the linear equating method and Rasch analysis indicated that students in the 2011 examination performed better than those who took the 2012 examination (both p < 0.001). Rasch analysis revealed that the range of student ability was between −0.53 and 4.16, while the difficulty of all items ranged from −5.25 to 6.32. No significant difference in mean item difficulty was noted among the common items and other items in the two examinations. Conclusion: Although both the chained linear equating method and Rasch analysis can be readily applied to practical test‐equating issues in medical education, Rasch analysis exhibited more versatility in test parameter estimation and item bank development for clinical curriculums.

[1]  W. Rogers,et al.  Investigation of IRT-Based Equating Methods in the Presence of Outlier Common Items , 2008 .

[2]  M. Tsou,et al.  Statistical item analysis of the examination in anesthesiology for medical students using the Rasch model , 2011, Journal of the Chinese Medical Association : JCMA.

[3]  Anton A. Béguin,et al.  Obtaining a Common Scale for Item Response Theory Item Parameters Using Separate Versus Concurrent Estimation in the Common-Item Equating Design , 2002 .

[4]  D. Eignor Linking Scores Derived Under Different Modes of Test Administration , 2007 .

[5]  De Ayala,et al.  The Theory and Practice of Item Response Theory , 2008 .

[6]  Sun Huh,et al.  Test Equating of the Medical Licensing Examination in 2003 and 2004 Based on the Item Response Theory , 2006, Journal of Educational Evaluation for Health Professions.

[7]  Hsiu-Hsi Chen,et al.  Item analysis for the written test of Taiwanese board certification examination in anaesthesiology using the Rasch model. , 2010, British journal of anaesthesia.

[8]  Paul W. Holland,et al.  Statistical models for test equating, scaling, and linking , 2011 .

[9]  D. Borsboom Educational Measurement (4th ed.) , 2009 .

[10]  M. Tsou,et al.  Item response analysis on an examination in anesthesiology for medical students in Taiwan: A comparison of one‐ and two‐parameter logistic models , 2013, Journal of the Chinese Medical Association : JCMA.

[11]  Steven J. Osterlind,et al.  Modern Measurement: Theory, Principles, and Applications of Mental Appraisal , 2005 .

[12]  C. McHorney,et al.  Equating Health Status Measures With Item Response Theory: Illustrations With Functional Status Items , 2000, Medical care.

[13]  Hsiu-Hsi Chen,et al.  Application of the Rasch Model to Develop a Simplified Version of a Multiattribute Utility Measurement on Attitude Toward Labor Epidural Analgesia , 2011, Anesthesia and analgesia.

[14]  R. Brennan,et al.  Test Equating, Scaling, and Linking: Methods and Practices , 2004 .

[15]  Catherine M. Hombo,et al.  Equating and Linking of Performance Assessments , 2000 .

[16]  Shin-ichi Mayekawa,et al.  A COMPARISON OF EQUATING METHODS AND LINKING DESIGNS FOR DEVELOPING AN ITEM POOL UNDER ITEM RESPONSE THEORY , 2011 .

[17]  Samuel A. Livingston,et al.  Generalized Equating Functions for NEAT Designs , 2009 .