Validation of Performance-Based Assessments

Using Messick’s (1995, 1996) framework for validity, six aspects of construct validation are outlined to guide the validation of performance-based assessments: content, substantive, structural, generalizability, external, and consequential. Each aspect is discussed, with the focus on studies that could be conducted within the context of a large-scale educational assessment. Also discussed are the issues that affect construct validation within that context, and recommendations for future areas of study are outlined.

[1]  G. Wiggins Teaching to the (Authentic) Test. , 1989 .

[2]  Gary W. Phillips,et al.  Technical Issues in Large-Scale Performance Assessment. , 1996 .

[3]  Milbrey W. McLaughlin,et al.  Improving Education Through Standards-Based Reform: A Report by the National Academy of Education Panel on Standards-Based Education Reform , 1995 .

[4]  A. Seraphine,et al.  Can Test Scores Remain Authentic When Teaching to the Test , 1993 .

[5]  Robert L. Brennan,et al.  The Conventional Wisdom About Group Mean Scores , 1995 .

[6]  Susy Macqueen,et al.  Validity , 1973, Just Algorithms.

[7]  On the generalizability of school-level performance assessment scores , 1994 .

[8]  L. Shepard Chapter 9: Evaluating Test Validity , 1993 .

[9]  Eva Nick,et al.  The dependability of behavioral measurements: theory of generalizability for scores and profiles , 1973 .

[10]  R. Linn Educational measurement, 3rd ed. , 1989 .

[11]  Donald B. Rubin,et al.  The Dependability of Behavioral Measurements: Theory of Generalizability for Scores and Profiles. , 1974 .

[12]  W. A. Mehrens,et al.  Methods for Improving Standardized Test Scores: Fruitful, Fruitless, or Fraudulent? , 1989 .

[13]  Edward H. Haertel,et al.  Generalizability Analysis for Performance Assessments of Student Achievement or School Effectiveness , 1997 .

[14]  S. Messick Validity of Psychological Assessment: Validation of Inferences from Persons' Responses and Performances as Scientific Inquiry into Score Meaning. Research Report RR-94-45. , 1994 .

[15]  Pamela R. Aschbacher Performance Assessment: State Activity, Interest, and Concerns , 1991 .

[16]  D. Perkins,et al.  Are Cognitive Skills Context-Bound? , 1989 .

[17]  Daniel F. McCaffrey,et al.  The Effects of Content, Format, and Inquiry Level on Science Performance Assessment Scores , 2000 .

[18]  R. Brennan Performance Assessments from the Perspective of Generalizability Theory , 2000 .

[19]  Stephen B. Dunbar,et al.  Complex, Performance-Based Assessment: Expectations and Validation Criteria , 1991 .

[20]  Richard J. Shavelson,et al.  Generalizability Theory: A Primer , 1991 .

[21]  Lorrie A. Shepard,et al.  Chapter 9: Evaluating Test Validity , 1993 .

[22]  R. Brennan Elements of generalizability theory , 1983 .

[23]  Joan Boykoff Baron,et al.  Performance-based student assessment : challenges and possibilities , 1996 .

[24]  D. Campbell,et al.  Convergent and discriminant validation by the multitrait-multimethod matrix. , 1959, Psychological bulletin.

[25]  M. Kane A Sampling Model for Validity , 1982 .

[26]  Evaluation of the voluntary national tests : phase 1 , 1999 .

[27]  L. Shepard Why We Need Better Assessments. , 1989 .

[28]  L. Crocker Assessing Content Representativeness of Performance Assessment Exercises , 1997 .

[29]  R. Shavelson,et al.  Sampling Variability of Performance Assessments. , 1993 .