论文信息 - Generalizability and Validity of a Mathematics Performance Assessment

Generalizability and Validity of a Mathematics Performance Assessment

The QUASAR Cognitive Assessment Instrument (QCAI) is designed to measure program outcomes and growth in mathematics. It consists of a relatively large set of open-ended tasks that assess mathematical problem solving, reasoning, and communication at the middle-school grade levels. This study provides some evidence for the generalizability and validity of the assessment. The results from the generalizability studies indicate that the error due to raters is minimal, whereas there is considerable differential student performance across tasks. The dependability of grade level scores for absolute decision making is encouraging; when the number of students is equal to 350, the coefficients are between .80 and .97 depending on the form and grade level. As expected, there tended to be a higher relationship between the QCAI scores and both the problem solving and conceptual subtest scores from a mathematics achievement multiple-choice test than between the QCAI scores and the mathematics computation subtest scores.

[1] R. Shavelson. Performance Assessment in Science , 1991 .

[2] Suzanne Lane,et al. Use of Generalizability Theory for Estimating the Dependability of a Scoring System for Sample Essays , 1989 .

[3] Robert L. Linn,et al. Educational Assessment: Expanded Expectations and Challenges , 1993 .

[4] J. Frederiksen,et al. A Systems Approach to Educational Testing , 1989 .

[5] Stephen B. Dunbar,et al. Complex, Performance-Based Assessment: Expectations and Validation Criteria , 1991 .

[6] Mei Liu,et al. Reliability and validity of a mathematics performance assessment , 1994 .

[7] Samuel Messick,et al. The Interplay of Evidence and Consequences in the Validation of Performance Assessments. Research Report. , 1992 .

[8] Stephen B. Dunbar,et al. Quality Control in the Development and Use of Performance Assessments , 1991 .