Operational Characteristics of Adaptive Testing Procedures Using the Graded Response Model

The purpose of the present research was to develop general guidelines to assist practitioners in setting up operational computerized adaptive testing (CAT) sys tems based on the graded response model. Simulated data were used to investigate the effects of systematic manipulation of various aspects of the CAT procedures for the model. The effects of three major variables were examined: item pool size, the stepsize used along the trait continuum until maximum likelihood estima tion could be calculated, and the stopping rule em ployed. The findings suggest three guidelines for graded response CAT procedures: (1) item pools with as few as 30 items may be adequate for CAT; (2) the variable-stepsize method is more useful than the fixed- stepsize methods; and (3) the minimum-standard-error stopping rule will yield fewer cases of nonconverg ence, administer fewer items, and produce higher cor relations of CAT θ estimates with full-scale estimates and the known θs than the minimum-information stop ping rule. The implications of these findings for psy chological assessment are discussed. Index terms: computerized adaptive testing, graded response model, item response theory, polychotomous scoring.

[1]  R. Darrell Bock,et al.  Estimating item parameters and latent ability when responses are scored in two or more nominal categories , 1972 .

[2]  D. Andrich A rating formulation for ordered response categories , 1978 .

[3]  William R. Koch,et al.  An Investigation of Procedures for Computerized Adaptive Testing Using Partial Credit Scoring , 1989 .

[4]  Mark D. Reckase,et al.  An Evaluation of One- and Three-Parameter Logistic Tailored Testing Procedures for Use with Small Item Pools. , 1983 .

[5]  David J. Weiss,et al.  Final Report: Computer-Based Measurement of Intellectual Capabilities , 1983 .

[6]  David J. Weiss,et al.  Final Report: Computerized Adaptive Ability Testing, April 1981 , 1981 .

[7]  D. Andrich Application of a Psychometric Rating Model to Ordered Categories Which Are Scored with Successive Integers , 1978 .

[8]  F. Samejima Estimation of latent ability using a response pattern of graded scores , 1968 .

[9]  Melvin R. Novick,et al.  Some latent train models and their use in inferring an examinee's ability , 1966 .

[10]  R. J. DeAyala,et al.  Computerized Adaptive Testing: A Comparison of the Nominal Response Model and the Three Parameter Logistic Model. , 1987 .

[11]  W.R. Koch Likert Scaling Using the Graded Response Latent Trait Model , 1983 .

[12]  Vern W. Urry,et al.  TAILORED TESTING: A SUCCESSFUL APPLICATION OF LATENT TRAIT THEORY* , 1977 .

[13]  David J. Weiss,et al.  Improving Measurement Quality and Efficiency with Adaptive Testing , 1982 .

[14]  David J. Weiss,et al.  Final Report: Computerized Adaptive Measurement of Achievement and Ability , 1985 .

[15]  Procedures for Criterion Referenced Tailored Testing. , 1981 .

[16]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm , 1981 .

[17]  G. Masters A rasch model for partial credit scoring , 1982 .