论文信息 - Evaluating the quality of medical multiple‐choice items created with automated processes

Evaluating the quality of medical multiple‐choice items created with automated processes

Computerised assessment raises formidable challenges because it requires large numbers of test items. Automatic item generation (AIG) can help address this test development problem because it yields large numbers of new items both quickly and efficiently. To date, however, the quality of the items produced using a generative approach has not been evaluated. The purpose of this study was to determine whether automatic processes yield items that meet standards of quality that are appropriate for medical testing. Quality was evaluated firstly by subjecting items created using both AIG and traditional processes to rating by a four‐member expert medical panel using indicators of multiple‐choice item quality, and secondly by asking the panellists to identify which items were developed using AIG in a blind review.

Mark J. Gierl | Hollis Lai | H. Lai

[1] Vikram Sheel Kumar,et al. A test in development. , 2013, Clinical chemistry.

[2] Randy Elliot Bennett,et al. How the Internet Will Help Large-Scale Assessment Reinvent Itself , 2001 .

[3] Michael C. Rodriguez,et al. A Review of Multiple-Choice Item-Writing Guidelines for Classroom Assessment , 2002 .

[4] Steven M. Downing,et al. Handbook of test development , 2006 .

[5] Mark J. Gierl,et al. Using automatic item generation to create multiple‐choice test items , 2012, Medical education.

[6] Lawrence M. Rudner,et al. Implementing the Graduate Management Admission Test Computerized Adaptive Test , 2009 .

[7] T. Haladyna. Developing and Validating Multiple-Choice Test Items , 1994 .