The increasing demand for performance assessment of speaking skills in second languages has led to logistic complications, for example, the delivery of tests in overseas locations. One solution to the problem has been to train native speaker interlocutors to carry out a series of oral interactions with the candidate, with assessment from audiorecordings of the test session postponed and conducted cen trally by a small team of trained raters. This technique is currently used in two large-scale occupationally related ESP tests administered internationally on behalf of the Australian government. But these procedures raise questions about the effect of such facets of the assessment situation as interlocutor variables and the quality of the audiotape recording. Recent developments in multifaceted Rasch measure ment have significantly broadened the possibilities for investigation of these issues. The research presented in this article investigates potential problems associated with the above approach to the offshore testing of speaking skills. Data from audiotape-based assessments of 70 offshore candidates from two administrations of the Occupational English Test, an advanced-level ESP test for health pro fessionals, are considered. In addition to multiple ratings of candidate perform ance, each recording is rated for perceptions of the competence of the interlocutor, the rapport established between the candidate and the interlocutor, and the audi bility of the interaction. These aspects of the assessment situation are treated as facets in a series of multifaceted Rasch analyses of the data. The results of the analysis reveal the effects of interlocutor variability and audio tape quality on ratings. The article concludes with an evaluation of the overall feasibility of the procedure, and implications for test administration are considered. The study is also a further demonstration of the application of multifaceted Rasch measurement in performance assessment settings.
[1]
Gillian Wigglesworth.
Exploring bias analysis as a tool for improving rater consistency in assessing oral interaction
,
1993
.
[2]
T. McNamara.
Assessing the second language proficiency of health professionals.
,
1990
.
[3]
S. Ross.
Accommodative questions in oral proficiency interviews
,
1992
.
[4]
J. Linacre,et al.
Many-facet Rasch measurement
,
1994
.
[5]
R. Young,et al.
Conversational Styles in Language Proficiency Interviews
,
1995
.
[6]
Steven J. Ross,et al.
The Discourse of Accommodation in Oral Proficiency Interviews
,
1992,
Studies in Second Language Acquisition.
[7]
T. Lumley,et al.
Specific-purpose language performance tests : task and interaction
,
1996
.
[8]
智原 哲郎,et al.
コミュニケーション能力をいかに測るか : The Australian Second Language Proficiency Ratings (ASLPR) を通しての考察
,
1996
.
[9]
Liz Hamp-Lyons,et al.
EXAMINING EXPERT JUDGMENTS OF TASK DIFFICULTY ON ESSAY TESTS
,
1994
.