Hybrid Computerized Adaptive Testing: From Group Sequential Design to Fully Sequential Design.

Computerized adaptive testing (CAT) and multistage testing (MST) have become two of the most popular modes in large-scale computer-based sequential testing.  Though most designs of CAT and MST exhibit strength and weakness in recent large-scale implementations, there is no simple answer to the question of which design is better because different modes may fit different practical situations. This article proposes a hybrid adaptive framework to combine both CAT and MST, inspired by an analysis of the history of CAT and MST. The proposed procedure is a design which transitions from a group sequential design to a fully sequential design. This allows for the robustness of MST in early stages, but also shares the advantages of CAT in later stages with fine tuning of the ability estimator once its neighborhood has been identified. Simulation results showed that hybrid designs following our proposed principles provided comparable or even better estimation accuracy and efficiency than standard CAT and MST designs, especially for examinees at the two ends of the ability range.

[1]  Bert F. Green A Comment on Early Student Blunders on Computer-Based Adaptive Tests , 2011 .

[2]  Z. Ying,et al.  a-Stratified Multistage Computerized Adaptive Testing with b Blocking , 2001 .

[3]  R. D. Bock,et al.  Adaptive EAP Estimation of Ability in a Microcomputer Environment , 1982 .

[4]  Hua-Hua Chang,et al.  A Global Information Approach to Computerized Adaptive Testing , 1996 .

[5]  Donald A. Rock,et al.  REPRODUCTION OF TOTAL TEST SCORE THROUGH THE USE OF SEQUENTIAL PROGRAMMED TESTS1 , 1968 .

[6]  Liane Nicole Patsula,et al.  A comparison of computerized adaptive testing and multi-stage testing. , 1999 .

[7]  Cynthia G. Parshall,et al.  New Algorithms for Item Selection and Exposure Control with Computerized Adaptive Testing. , 1995 .

[8]  Hua-Hua Chang,et al.  To Weight or Not to Weight? Balancing Influence of Initial Items in Adaptive Testing , 2007 .

[9]  R. Luecht Computer-Assisted Test Assembly Using Optimization Heuristics , 1998 .

[10]  Howard Wainer,et al.  Item Clusters and Computerized Adaptive Testing: A Case for Testlets , 1987 .

[11]  Wim J. van der Linden,et al.  Multidimensional Adaptive Testing with Optimal Design Criteria for Item Selection , 2008, Psychometrika.

[12]  Yi Zheng,et al.  On-the-Fly Assembled Multistage Adaptive Testing , 2015, Applied psychological measurement.

[13]  Kyung T. Han,et al.  An Efficiency Balanced Information Criterion for Item Selection in Computerized Adaptive Testing. , 2012 .

[14]  Dmitry I. Belov,et al.  A Monte Carlo Approach to the Design, Assembly, and Evaluation of Multistage Adaptive Tests , 2008 .

[15]  Richard M. Luecht,et al.  Some Practical Examples of Computer‐Adaptive Sequential Testing , 1998 .

[16]  Z. Ying,et al.  a-Stratified Multistage Computerized Adaptive Testing , 1999 .

[17]  Hua-Hua Chang,et al.  The maximum priority index method for severely constrained item selection in computerized adaptive testing. , 2009, The British journal of mathematical and statistical psychology.

[18]  Roger J. Owen A BAYESIAN APPROACH TO TAILORED TESTING , 1969 .

[19]  Dmitry I. Belov,et al.  Monte Carlo Test Assembly for Item Pool Analysis and Extension , 2005 .

[20]  Willem J. van der Linden,et al.  Bayesian item selection criteria for adaptive testing , 1998 .

[21]  Melvin R. Novick,et al.  Some latent train models and their use in inferring an examinee's ability , 1966 .

[22]  Robert J. Mislevy,et al.  A Consumer's Guide to LOGIST and BILOG , 1987 .

[23]  Ronald K. Hambleton,et al.  Optimal and Nonoptimal Computer-Based Test Designs for Making Pass–Fail Decisions , 2006 .

[24]  Stephen G. Sireci,et al.  A Review of Models for Computer-Based Testing , 2012 .

[25]  Ronald K. Hambleton,et al.  Comparison of the Psychometric Properties of Several Computer-Based Test Designs for Credentialing Exams With Multiple Purposes , 2002 .

[26]  F. Samejima Estimation of latent ability using a response pattern of graded scores , 1968 .

[27]  Kelly L. Rulison,et al.  I've Fallen and I Can't Get Up: Can High-Ability Students Recover From Early Mistakes in CAT? , 2009, Applied psychological measurement.

[28]  M. Stocking,et al.  A Model and Heuristic For Solving Very Large Item Selection Problems , 1993 .

[29]  Hua-Hua Chang,et al.  Psychometrics Behind Computerized Adaptive Testing , 2015, Psychometrika.

[30]  Frederic M. Lord Robbins-Monro Procedures for Tailored Testing , 1971 .

[31]  Wim J. van der Linden,et al.  A Model for Optimal Constrained Adaptive Testing , 1998 .

[32]  W. J. J. Veerkamp,et al.  Some New Item Selection Criteria for Adaptive Testing , 1994 .

[33]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm , 1981 .

[34]  R. Owen,et al.  A Bayesian Sequential Procedure for Quantal Response in the Context of Adaptive Mental Testing , 1975 .