Vertical Scaling in Standards-Based Educational Assessment and Accountability Systems January 2007

This paper describes vertical scaling as a method for linking a set of test forms of increasing difficulty, and explores the applicability of these methods for standards-based educational achievement tests under status-based and growth-based accountability priorities. The purpose of the paper is to inform state policy-makers and assessment and accountability specialists about vertical scaling methods, and the potential advantages and limitations of vertical scaling in standards-based educational accountability programs. The possible application of vertical scales to support growth models is discussed, as well as alternatives to vertical scaling that meet accountability system needs.

[1]  L. L. Thurstone,et al.  The absolute zero in intelligence measurement. , 1928 .

[2]  H. Gulliksen Theory of mental tests , 1952 .

[3]  Melvin R. Novick,et al.  Some latent train models and their use in inferring an examinee's ability , 1966 .

[4]  F. Lord Applications of Item Response Theory To Practical Testing Problems , 1980 .

[5]  Georg Rasch,et al.  Probabilistic Models for Some Intelligence and Attainment Tests , 1981, The SAGE Encyclopedia of Research Design.

[6]  R. Hambleton,et al.  Item Response Theory: Principles and Applications , 1984 .

[7]  Edward H. Haertel Report on TRP Analyses of Issues Concerning Within-Age versus Cross-Age Scales for the National Assessment of Educational Progress. , 1991 .

[8]  D. M. Lewis,et al.  Standard setting : A bookmark approach , 1996 .

[9]  H. Huynh On Score Locations of Binary and Partial Credit Items and Their Applications to Item Mapping and Criterion-Referenced Interpretation , 1998 .

[10]  Meryl W. Bertenthal,et al.  Uncommon Measures: Equivalence and Linkage among Educational Tests. , 1999 .

[11]  Yeow Meng Thum,et al.  Setting Performance Standards: Concepts, Methods, and Perspectives , 2003 .

[12]  Thakur B. Karkee,et al.  Separate versus Concurrent Calibration Methods in Vertical Scaling. , 2003 .

[13]  R. Brennan,et al.  Test Equating, Scaling, and Linking: Methods and Practices , 2004 .

[14]  G. Cizek Adapting Testing Technology to Serve Accountability Aims: The Case of Vertically Moderated Standard Setting , 2005 .

[15]  Richard J. Patz,et al.  30 Vertical Scaling: Statistical Models for Measuring Growth and Achievement , 2006 .

[16]  Huynh Huynh,et al.  A Clarification on the Response Probability Criterion RP67 for Standard Settings Based on Bookmark and Item Mapping. , 2006 .

[17]  Lihua Yao,et al.  Methods and Models for Vertical Scaling , 2007 .