This paper traces developments in educational psychology and measurement that led to the Test of English as a Foreign Language (TOEFL) and the test of English for International Communication (TOEIC) and the application of educational measurement terms such as validity and reliability to testing. Use of a table of specifications for planning language tests, procedures for obtaining greater interand intra-rater reliability on composition tests are noted. An overview is given of recent criticisms of language testing in Japan. Considerations for item writing are examined briefly, and the major item types in language testing are described and discussed, including multiple choice (in reading and listening), matching (reading), a variety of cloze procedures (reading and grammar), scanning (reading), paraphrasing (reading and grammar), information transfer (reading and writing), editing, guided paragraph writing, paragraphs and essays, question and response (listening and speaking), paraphrase (listening), short talks (listening), and using a table (listening). Some simple statistical procedures for determining item difficulty and for discriminating top-scorirg and low-scoring students are also offered. Contains 30 references.
[1]
B. Bloom.
Taxonomy of educational objectives
,
1956
.
[2]
Barry J. Guerrero,et al.
Holistic Evaluation of Writing Samples for Placement in Post-Secondary English Composition Courses.
,
1986
.
[3]
Christine Klein-Braley,et al.
A survey of research on the C-Test1
,
1984
.
[4]
William E. Coffman,et al.
A Comparison of Two Methods of Reading Essay Examinations
,
1968
.
[5]
Miles Myers,et al.
A Procedure for Writing Assessment and Holistic Scoring.
,
1982
.
[6]
Norman E. Gronlund.
Constructing achievement test / Norman E. Gronlund
,
1977
.