Item selection and ability estimation adaptive testing

The last century saw a tremendous progression in the refinement and use of standardized linear tests. The first administered College Board exam occurred in 1901 and the first Scholastic Assessment Test (SAT) was given in 1926. Since then, progressively more sophisticated standardized linear tests have been developed for a multitude of assessment purposes, such as college placement, professional licensure, higher-education admissions, and tracking educational standing or progress. Standardized linear tests are now administered around the world. For example, the Test of English as a Foreign Language (TOEFL) has been delivered in approximately 88 countries.

[1]  Frederic M. Lord THE SELF‐SCORING FLEXILEVEL TEST* , 1970 .

[2]  Angela J. Verschoor,et al.  Optimal Testing With Easy or Difficult Items in Computerized Adaptive Testing , 2006 .

[3]  R. J. De Ayala,et al.  A Comparison of the Partial Credit and Graded Response Models in Computerized Adaptive Testing , 1992 .

[4]  Willem J. van der Linden,et al.  Bayesian item selection criteria for adaptive testing , 1998 .

[5]  Hua-Hua Chang,et al.  A Global Information Approach to Computerized Adaptive Testing , 1996 .

[6]  Hua-Hua Chang,et al.  The asymptotic posterior normality of the latent trait in an IRT model , 1993 .

[7]  F. Lord Applications of Item Response Theory To Practical Testing Problems , 1980 .

[8]  Frederic M. Lord,et al.  THE SELF‐SCORING FLEXILEVEL TEST1 , 1971 .

[9]  Deborah L. Schnipke,et al.  A Comparison of Item Selection Routines in Linear and Adaptive Tests , 1995 .

[10]  R. D. Bock,et al.  Adaptive EAP Estimation of Ability in a Microcomputer Environment , 1982 .

[11]  Frederic M. Lord,et al.  MAXIMUM LIKELIHOOD AND BAYESIAN PARAMETER ESTIMATION IN ITEM RESPONSE THEORY , 1984 .

[12]  T. A. Warm Weighted likelihood estimation of ability in item response theory , 1989 .

[13]  Wim J. van der Linden,et al.  Empirical Initialization of the Trait Estimator in Adaptive Testing , 1999 .

[14]  J. S. Roberts,et al.  Computerized Adaptive Testing with the Generalized Graded Unfolding Model , 2001 .

[15]  H. Gulliksen Theory of mental tests , 1952 .

[16]  Fumiko Samejima The bias function of the maximum likelihood estimate of ability for the dichotomous response level , 1993 .

[17]  M. R. Novick,et al.  Statistical Theories of Mental Test Scores. , 1971 .

[18]  Georg Rasch,et al.  Probabilistic Models for Some Intelligence and Attainment Tests , 1981, The SAGE Encyclopedia of Research Design.

[19]  David M. Williamson,et al.  Calibrating Item Families and Summarizing the Results Using Family Expected Response Functions , 2003 .

[20]  R. H. Klein Entink,et al.  A Multivariate Multilevel Approach to the Modeling of Accuracy and Speed of Test Takers , 2008, Psychometrika.

[21]  H. Holling,et al.  Explaining and Controlling for the Psychometric Properties of Computer-Generated Figural Matrix Items , 2008 .

[22]  Cornelis A.W. Glas,et al.  Cross-validating item parameter estimation in computerized adaptive testing , 2001 .

[23]  B. T. Hemker,et al.  Evaluation of Selection Procedures for Computerized Adaptive Testing with Polytomous Items , 2002 .

[24]  Fumiko Samejima,et al.  A comment on Birnbaum's three-parameter logistic model in the latent trait theory , 1973 .

[25]  Cornelis A.W. Glas,et al.  Modeling Rule-Based Item Generation , 2011 .

[26]  Daniel O. Segall,et al.  Equating the CAT-ASVAB. , 1997 .

[27]  Cornelis A.W. Glas,et al.  Cross-Validating Item Parameter Estimation in Adaptive Testing , 2001 .

[28]  Wim J. van der Linden,et al.  Computerized Adaptive Testing With Item Cloning , 2003 .

[29]  Willem J. van der Linden,et al.  Using Response Times for Item Selection in Adaptive Testing , 2008 .

[30]  Robert J. Mislevy,et al.  Bayes modal estimation in item response models , 1986 .

[31]  R. J. De Ayala The nominal response model in computerized adaptive testing , 1992 .

[32]  Reducing Bias in CAT Trait Estimation: A Comparison of Approaches , 1999 .

[33]  Z. Ying,et al.  a-Stratified Multistage Computerized Adaptive Testing , 1999 .

[34]  J WIM,et al.  A HIERARCHICAL FRAMEWORK FOR MODELING SPEED AND ACCURACY ON TEST ITEMS , 2007 .

[35]  Roger J. Owen A BAYESIAN APPROACH TO TAILORED TESTING , 1969 .

[36]  Melvin R. Novick,et al.  Some latent train models and their use in inferring an examinee's ability , 1966 .

[37]  van der Linden,et al.  A hierarchical framework for modeling speed and accuracy on test items , 2007 .

[38]  Wim J. J. Veerkamp,et al.  A comparison of different item selection criteria for adaptive testing , 1994 .

[39]  Martha L. Stocking,et al.  An Alternative Method for Scoring Adaptive Tests , 1996 .

[40]  David J. Weiss,et al.  Bias and Information of Bayesian Adaptive Testing , 1984 .

[41]  Cornelis A.W. Glas,et al.  Computerized adaptive testing with item clones , 2001 .

[42]  Cornelis A.W. Glas,et al.  Modeling Variability in Item Parameters in Item Response Models. Research Report. , 2001 .

[43]  Tianyou Wang,et al.  Properties of Ability Estimation Methods in Computerized Adaptive Testing , 1998 .

[44]  R. Owen,et al.  A Bayesian Sequential Procedure for Quantal Response in the Context of Adaptive Mental Testing , 1975 .

[45]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[46]  Howard Wainer,et al.  Building Algebra Testlets: A Comparison of Hierarchical and Linear Structures. , 1991 .

[47]  Hua-Hua Chang,et al.  To Weight or Not to Weight? Balancing Influence of Initial Items in Adaptive Testing , 2007 .

[48]  David J. Weiss,et al.  Improving Measurement Quality and Efficiency with Adaptive Testing , 1982 .

[49]  R. Tsutakawa,et al.  The effect of uncertainty of item parameter estimation on ability estimates , 1990 .

[50]  Wim J. van der Linden,et al.  Capitalization on Item Calibration Error in Adaptive Testing , 1998 .

[51]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[52]  Heinz Holling,et al.  Automatic item generation of probability word problems , 2009 .

[53]  Barbara G. Dodd,et al.  A Comparison of Maximum Likelihood Estimation and Expected a Posteriori Estimation in CAT Using the Partial Credit Model , 1998 .

[54]  Cornelis A.W. Glas,et al.  Statistical aspects of adaptive testing , 2006 .

[55]  W. J. J. Veerkamp,et al.  Some New Item Selection Criteria for Adaptive Testing , 1994 .

[56]  Z. Ying,et al.  Nonlinear sequential designs for logistic item response theory models with applications to computerized adaptive tests , 2009, 0906.1859.