Raw-Score Conditional Standard Errors of Measurement in Generalizability Theory

A comprehensive, integrated treatment is provided of both conditional absolute (A-type) standard errors of measurement (SEM) and conditional relative (a-type) SEMs from the perspective of generalizability theory. Results are provided for univariate singlefacet designs, multivariate single-facet designs, and designs with multiple random facets. Some previously derived conditional SEMs are shown to be special cases of results derived here. Average values (over examinees) of certain conditional SEMs are shown to be related to the error variances in coefficient a and stratified a. It is shown that the conditional A-type SEM is the standard error of the mean for the within-person design. As such, it is unaffected by the across-persons design and relatively easy to estimate. By contrast, the conditional 8-type SEM is necessarily influenced by the across-persons design and often quite complicated to estimate, especially for multifacet designs. Almost all estimators are illustrated with data from the Iowa Tests of Basic Skills, the Iowa Tests of Educational Development, the Iowa Writing Assessment, and the QUASAR project. These examples support the conclusion that both types of conditional SEMs tend to be smaller at the extremes of the score scale than in the middle. Further, these examples suggest that a concave-down quadratic function fits the estimates quite well in a wide variety of cases.

[1]  F. Lord Applications of Item Response Theory To Practical Testing Problems , 1980 .

[2]  L. S. Feldt Some Relationships between the Binomial Error Model and Classical Test Theory , 1984 .

[3]  L. S. Feldt,et al.  Approximating Scale Score Standard Error of Measurement From the Raw Score Standard Error , 1998 .

[4]  L. S. Feldt,et al.  Estimation of Measurement Error Variance at Specific Score Levels , 1996 .

[5]  M. W. Richardson,et al.  The theory of the estimation of test reliability , 1937 .

[6]  R. Brennan,et al.  Signal/noise ratios for domain-referenced tests , 1978 .

[7]  Audrey L. Quails-Payne A Comparison of Score Level Estimates of the Standard Error of Measurement , 1992 .

[8]  Robert L. Brennan,et al.  A Variance Components Model for Measurement Procedures Associated with a Table of Specifications , 1982 .

[9]  F. Lord DO TESTS OF THE SAME LENGTH HAVE THE SAME STANDARD ERROR OF MEASUREMENT , 1956 .

[10]  Samuel A. Livingston ESTIMATION OF THE CONDITIONAL STANDARD ERROR OF MEASUREMENT FOR STRATIFIED TESTS , 1982 .

[11]  M. J. Kolen,et al.  Conditional Standard Errors of Measurement for Scale Scores Using IRT , 1996 .

[12]  R. Brennan Elements of generalizability theory , 1983 .

[13]  Robert L. Brennan,et al.  Conditional standard errors of measurement for scale scores using binomial and compund binomial assu , 1992 .

[14]  D. Jarjoura An Estimator of Examinee-Level Measurement Error Variance That Considers Test Form Difficulty Adjustments , 1986 .

[15]  F. Graybill,et al.  Theorems Concerning Eisenhart's Model II , 1961 .

[16]  C. Hoyt Test reliability estimated by analysis of variance , 1941 .

[17]  L. Cronbach Coefficient alpha and the internal structure of tests , 1951 .

[18]  Mei Liu,et al.  Generalizability and Validity of a Mathematics Performance Assessment , 1996 .

[19]  R. Linn Educational measurement, 3rd ed. , 1989 .

[20]  Donald B. Rubin,et al.  The Dependability of Behavioral Measurements: Theory of Generalizability for Scores and Profiles. , 1974 .

[21]  J. Keats Estimation of error variances of test scores , 1957 .

[22]  W. Mollenkopf Variation of the standard error of measurement , 1949, Psychometrika.

[23]  M. R. Novick,et al.  Statistical Theories of Mental Test Scores. , 1971 .

[24]  F. Lord Estimating Test Reliability , 1955 .

[25]  L. Cronbach,et al.  Generalizability of stratified-parallel tests , 1965, Psychometrika.

[26]  L. S. Feldt Confidence Intervals for the Proportion of Mastery in Criterion‐Referenced Measurement , 1996 .

[27]  F. Lord A strong true-score theory, with applications. , 1965, Psychometrika.

[28]  Cyril Burt,et al.  Fundamentals of Statistics. , 1948 .