Estimating the Standard Error of the Maximum Likelihood Ability Estimator in Adaptive Testing Using the Posterior-Weighted Test Information Function

The standard error of the maximum likelihood ability estimator is commonly estimated by evaluating the test information function at an examinee's current maximum likelihood estimate (a point estimate) of ability. Because the test information function evaluated at the point estimate may differ from the test information function evaluated at an examinee's true ability value, the estimated standard error may be biased under certain conditions. This is of particular concern in adaptive testing because the height of the test information function is expected to be higher at the current estimate of ability than at the actual value of ability. This article proposes using the posterior-weighted test information function in computing the standard error of the maximum likelihood ability estimator for adaptive test sessions. A simulation study showed that the proposed approach provides standard error estimates that are less biased and more efficient than those provided by the traditional point estimate approach.

[1]  F. Lord Applications of Item Response Theory To Practical Testing Problems , 1980 .

[2]  Willem J. van der Linden,et al.  Bayesian item selection criteria for adaptive testing , 1998 .

[3]  Frederic M. Lord MAXIMUM LIKELIHOOD AND BAYESIAN PARAMETER ESTIMATION IN ITEM RESPONSE THEORY , 1986 .

[4]  L. Joseph,et al.  Bayesian Statistics: An Introduction , 1989 .

[5]  J. Douglas Faires,et al.  Numerical Analysis , 1981 .

[6]  Tianyou Wang,et al.  Properties of Ability Estimation Methods in Computerized Adaptive Testing , 1998 .

[7]  R. D. Bock,et al.  Adaptive EAP Estimation of Ability in a Microcomputer Environment , 1982 .

[8]  Randall D. Penfield,et al.  Applying a Weighted Maximum Likelihood Latent Trait Estimator to the Generalized Partial Credit Model , 2005 .

[9]  Howard Wainer,et al.  Computerized Adaptive Testing: A Primer , 2000 .

[10]  D. Deregt,et al.  Introduction and History. , 2008, Bulletin of the Medical Library Association.

[11]  W. Alan Nicewander,et al.  Ability estimation for conventional tests , 1993 .

[12]  J. S. Roberts,et al.  A Unidimensional Item Response Model for Unfolding Responses From a Graded Disagree-Agree Response Scale , 1996 .

[13]  Cornelis A.W. Glas,et al.  Computerized adaptive testing : theory and practice , 2000 .

[14]  Cynthia G. Parshall,et al.  Practical Considerations in Computer-Based Testing , 2002 .

[15]  T. A. Warm Weighted likelihood estimation of ability in item response theory , 1989 .

[16]  Shu-Ying Chen,et al.  A comparison of item selection rules including precision, content, and exposure considerations at the early stages of computerized adaptive testing , 1999 .

[17]  Some Formulas for Use with Bayesian Ability Estimates , 1993 .

[18]  B. G. Dodd,et al.  The Effect of Population Distribution and Method of Theta Estimation on Computerized Adaptive Testing (CAT) Using the Rating Scale Model , 1997 .

[19]  Rob R. Meijer,et al.  Trait Level Estimation for Nonfitting Response Vectors , 1997 .

[20]  Mark D. Reckase,et al.  Item Response Theory: Parameter Estimation Techniques , 1998 .

[21]  Barbara G. Dodd,et al.  A Comparison of Maximum Likelihood Estimation and Expected a Posteriori Estimation in CAT Using the Partial Credit Model , 1998 .

[22]  Hua-Hua Chang,et al.  A Global Information Approach to Computerized Adaptive Testing , 1996 .

[23]  Reducing Bias in CAT Trait Estimation: A Comparison of Approaches , 1999 .