Performance-Based Assessments

By the time children in the United States reach seventh grade, half declare no interest in science (Office of Science and Technology Policy, 1991). Among girls, this disinterest appears to be particularly pronounced (S. Johnson, 1987; Jones, Mullis, Raizen, Weiss, & Weston, 1992). At the same time, girls’ and boys’ performances on standardized tests of science achievement begin to diverge with girls falling behind boys. This fact is well supported by numerous large-scale studies such as the International Association for the Evaluation of Educational Achievement or IEA (1988), the National Assessment of Educational Progress (NAEP) 1970-1986 (Mullis & Jenkins, 1988), and the British Columbia Science Assessments (Bateson & Parsons-Chatman, 1989). In the science classroom, however, girls perform as well, or better than, boys (Maccoby & Jacklin, 1974). Therefore, standardized tests are thought to under-predict girls’ science achievement (Linn, 1991). Although this gender disparity has been attributed to several factors, there is considerable concern that the difference may be an artifact of the method of measurement (Bateson & Parsons-Chatman, 1989; Bolger & Kellaghan, 1990). That is, there is something about the test itself that puts girls at a disadvantage. Girls’ lower test scores, in turn, are thought to undermine their self-perceptions of competence, leading to their disinterest in science and eventual drop from the science “pipeline” (Oaks, 1990; Rosser et al., 1989). Included in the current reform rhetoric is the need to change the method by which we evaluate students’ achievements. To do so will “open gates of opportunity rather than close them off” (National Commission on Testing and Public Policy, 1990, p. x). The belief seems to be that by replacing traditional assessment methods with new alternative methods such as performance-based assessments, the gender bias in testing may be eliminated (Jenkins & MacDonald, 1989; National Center for Improving Science Education, 1989). This article begins by reviewing what we know about gender differences on traditional tests of science achievement and what is hoped to be gained by changing to performance-based assessments. Then, as an initial look at the effect of new forms of testing on males’ and females’ science achievement, their scores on performance-based assessments are compared. Finally, these findings are discussed in the context of science reform.

[1]  J. Norcini,et al.  Factors influencing reproducibility of tests using standardized patients , 1989 .

[2]  Richard J. Shavelson,et al.  Indicators of Science Achievement: Options for a Powerful Policy Instrument. , 1990 .

[3]  The 1990 Science Report Card. NAEP'S Assessment of Fourth, Eighth, and Twelfth Graders. , 1992 .

[4]  Alison Kelly Sex Stereotypes and School Science: A Three Year Follow-Up. , 1988 .

[5]  Gerald Kulm,et al.  Science Assessment in the Service of Reform. , 1991 .

[6]  G. Erickson,et al.  Females and Science Achievement: Evidence, Explanations, and Implications. , 1984 .

[7]  N. Frederiksen The real test bias: Influences of testing on teaching and learning. , 1984 .

[8]  Abigail M. Harris,et al.  Gender differences in national assessment of educational progress science items: What does i don't know really mean? , 1987 .

[9]  Gail P. Baxter,et al.  Science performance assessments: benchmarks and surrogates , 1994 .

[10]  L. Jacoby,et al.  Knowledge and clinical problem‐solving , 1985, Medical education.

[11]  R. Shavelson,et al.  Research news and Comment: Performance Assessments , 1992 .

[12]  C. Dweck,et al.  An Analysis of Learned Helplessness: Continuous Changes in Performance, Strategy, and Achievement Cognitions Following Failure. , 1978 .

[13]  Jere Brophy,et al.  Teacher-student relationships: causes and consequences , 1974 .

[14]  Herbert J. Walberg,et al.  Improving science education , 1995 .

[15]  D. Monty Neill,et al.  Standardized Testing: Harmful to Educational Health. , 1989 .

[16]  Sandra Johnson Gender differences in science: parallels in interest, experience and performance , 1987 .

[17]  Niall Bolger,et al.  Method of Measurement and Gender Differences in Scholastic Achievement , 1990 .

[18]  Sex‐related differences in science achievement: a possible testing artefact , 1989 .

[19]  G. Hanna Sex Differences in the Mathematics Achievement of Eighth Graders in Ontario. , 1986 .

[20]  S. Minden,et al.  The missing half girls and science education , 1982 .

[21]  L. Wilkinson,et al.  Gender Influences in Classroom Interaction , 1985 .

[22]  C. P. M. Vleuten,et al.  Reliability and feasibility of measuring medical interviewing skills: the revised Maastricht History‐Taking and Advice Checklist , 1991, Medical education.

[23]  W. B. MacDonald,et al.  Science Teaching in the Spirit of Science. , 1989 .

[24]  M. Slakter THE EFFECT OF GUESSING STRATEGY ON OBJECTIVE TEST SCORES1 , 1968 .

[25]  3 – Listening to Adolescents: Gender Differences in Science Classroom Interaction* , 1985 .

[26]  Julianne C. Turner,et al.  A Developmental Perspective on Standardized Achievement Testing , 1991 .

[27]  R. Shavelson,et al.  Sampling Variability of Performance Assessments. , 1993 .

[28]  R. Slavin PET and the Pendulum: Faddism in Education and how to Stop It. , 1989 .

[29]  Sandra Johnson,et al.  The underachievement of girls in physics: Towards explanations , 1984 .

[30]  Jane Butler Kahle,et al.  The myth of equality in science classrooms , 1983 .

[31]  Assessment in elementary school science education , 1989 .

[32]  G. Rowley WHICH EXAMINEES ARE MOST FAVOURED BY THE USE OF MULTIPLE CHOICE TESTS , 1974 .

[33]  G. Stechler,et al.  The development of the self: a psychoanalytic perspective. , 1980, The Psychoanalytic study of the child.

[34]  Ina V. S. Mullis,et al.  The Science Report Card: Elements of Risk and Recovery. Trends and Achievement Based on the 1986 National Assessment. , 1988 .

[35]  R. Shavelson Performance Assessment in Science , 1991 .

[36]  Norman Frederiksen,et al.  THE REAL TEST BIAS , 1981 .

[37]  R. Murphy SEX DIFFERENCES IN OBJECTIVE TEST PERFORMANCE , 1982 .

[38]  R. Shavelson Performance Assessments: Political Rhetoric and Measurement Reality , 1992 .

[39]  B. Weiner Principles for a Theory of Motivation , 1986 .

[40]  Phyllis C. Blumenfeld,et al.  5 – Classroom Experiences and Student Gender: Are There Differences and Do They Matter?* , 1985 .

[41]  Susan R. Goldman,et al.  Evaluation of Procedure-Based Scoring for Hands-On Science Assessment , 1992 .

[42]  Kenneth Tobin,et al.  Windows into Science Classrooms: Problems Associated with Higher-Level Cognitive Learning. , 1990 .

[43]  Jeannie Oakes,et al.  Lost Talent: The Underparticipation of Women, Minorities, and Disabled Persons in Science , 1990 .

[44]  E. Maccoby,et al.  The Psychology of Sex Differences , 1974 .

[45]  Richard J. Shavelson,et al.  Indicators for Monitoring Mathematics and Science Education , 1989 .

[46]  Pinchas Tamir An Inquiry Oriented Laboratory Examination. , 1974 .