论文信息 - The Critical Role of Anchor Paper Selection in Writing Assessment

The Critical Role of Anchor Paper Selection in Writing Assessment

Scoring rubrics are routinely used to evaluate the quality of writing samples produced for writing performance assessments, with anchor papers chosen to represent score points defined in the rubric. Although the careful selection of anchor papers is associated with best practices for scoring, little research has been conducted on the role of anchor paper selection in writing assessment. This study examined the consequences of differential selection of anchor papers to represent a common scoring rubric. A set of writing samples was scored under two conditions—one using anchors selected from within grade and one using anchors selected from across three grade levels. Observed ratings were analyzed using three- and four-facet Rasch (one-parameter logistic) models. Ratings differed in magnitude and rank-order, with the difference presumed to be due to the anchor paper conditions and not a difference in overall severity between the rater groups. Results shed light on potential threats to validity within conventional context-dependent scoring practices and raise issues that have not been investigated with respect to the selection of anchor papers, such as the interpretation of results at different grade levels, implications for the assessment of progress over time, and the reliability of anchor paper selection within a scoring context.

J. M. Ryan | M. Thompson | Sharon E. Osborn Popp

[1] Samuel Messick,et al. STANDARDS OF VALIDITY AND THE VALIDITY OF STANDARDS IN PERFORMANCE ASSESSMENT , 2005 .

[2] Robert L. Brennan,et al. Generalizability of Performance Assessments , 2005 .

[3] Tonya R. Moon,et al. Training and Scoring Issues Involved in Large-Scale Writing Assessments , 2005 .

[4] W. A. Mehrens. Using Performance Assessment for Accountability Purposes , 2005 .

[5] Nikki Elliott-Schuman,et al. Grade 4 Anchor Set Annotations from the Spring 2001 Washington Assessment of Student Learning in Writing [with]"Presentation Guide" for Principals. , 2001 .

[6] Margaret E. Goertz,et al. Assessment and Accountability Systems in the 50 States, 1999-2000. CPRE Research Report Series. , 2001 .

[7] Liru Zhang,et al. Delaware Student Testing Program: Report on Special Writing Study. , 2000 .

[8] Fred Stofflet,et al. Improving the Validity and Reliability of Large Scale Writing Assessment. , 2000 .

[9] Belita Gordon,et al. The Relation Between Score Resolution Methods and Interrater Reliability: An Empirical Study of an Analytic Scoring Rubric , 2000 .

[10] Michael Ranney,et al. Cognitive Differences in Proficient and Nonproficient Essay Scorers , 1998 .

[11] Sara Cushing Weigle,et al. Using FACETS to model rater training effects , 1998 .