Assessing vocabulary size through multiple-choice formats : Issues with guessing and sampling rates

In most tests of vocabulary size, knowledge is assessed through multiple-choice formats. Despite advantages such as ease of scoring, multiple-choice tests (MCT) are accompanied with problems. One of the more central issues has to do with guessing and the presence of other construct-irrelevant strategies that can lead to overestimation of scores. A further challenge when designing vocabulary size tests is that of sampling rate. How many words constitute a representative sample of the underlying population of words that the test is intended to measure? This paper addresses these two issues through a case study based on data from a recent and increasingly used MCT of vocabulary size: the Vocabulary Size Test. Using a criterion-related validity approach, our results show that for multiple-choice items sampled from this test, there is a discrepancy between the test scores and the scores obtained from the criterion measure, and that a higher sampling rate would be needed in order to better represent knowledge of the underlying population of words. We offer two main interpretations of these results, and discuss their implications for the construction and use of vocabulary size tests.

[1]  Lyle F. Bachman 语言测试要略 = Fundamental considerations in language testing , 1990 .

[2]  N. Schmitt Researching Vocabulary: A Vocabulary Research Manual , 2010 .

[3]  André A. Rupp,et al.  How assessing reading comprehension with multiple-choice questions shapes the construct: a cognitive processing perspective , 2006 .

[4]  P. Meara,et al.  Beyond A Clockwork Orange: Acquiring Second Language Vocabulary through Reading. , 1998 .

[5]  John Read,et al.  Assessing Vocabulary by John Read , 2000 .

[6]  D. Beglar A Rasch-based validation of the Vocabulary Size Test , 2010 .

[7]  R. Waring,et al.  AT WHAT RATE DO LEARNERS LEARN AND RETAIN NEW VOCABULARY FROM READING A GRADED READER , 2001 .

[8]  Xian Zhang The I Don't Know Option in the Vocabulary Size Test. , 2013 .

[9]  Norbert Schmitt,et al.  A reassessment of frequency and vocabulary size in L2 vocabulary teaching1 , 2012, Language Teaching.

[10]  Norbert Schmitt,et al.  The Word Associates Format: Validation evidence , 2011 .

[11]  B. Laufer,et al.  Testing Vocabulary Knowledge: Size, Strength, and Computer Adaptiveness. , 2004 .

[12]  Irina Elgort Deliberate Learning and Vocabulary Acquisition in a Second Language , 2011 .

[13]  I.S.P. Nation,et al.  Vocabulary learning and reading , 1978 .

[14]  N. Schmitt,et al.  Developing and exploring the behaviour of two new versions of the Vocabulary Levels Test , 2001 .

[15]  P. Meara,et al.  An alternative to multiple choice vocabulary tests , 1987 .

[16]  M. Wesche,et al.  Assessing Second Language Vocabulary Knowledge: Depth Versus Breadth. , 1996 .

[17]  Paul Meara,et al.  Scores on a yes-no vocabulary test: correction for guessing and response style , 2002 .

[18]  S. Messick Validity of Psychological Assessment: Validation of Inferences from Persons' Responses and Performances as Scientific Inquiry into Score Meaning. Research Report RR-94-45. , 1994 .

[19]  I. Nation How Large a Vocabulary Is Needed for Reading and Listening? , 2006 .

[20]  Norbert Schmitt,et al.  Jumping from the highest graded readers to ungraded novels: Four case studies , 2014 .

[21]  M. Kane Validating the Interpretations and Uses of Test Scores , 2013 .

[22]  Norbert Schmitt,et al.  Scoring Yes–No vocabulary tests: Reaction time vs. nonword approaches , 2012 .

[23]  Irina Elgort,et al.  Effects of L1 definitions and cognate status of test items on the Vocabulary Size Test , 2013 .

[24]  Jie Li Focus on Vocabulary , 2008 .

[25]  J. Charles Alderson,et al.  Diagnosing Foreign Language Proficiency: The Interface between Learning and Assessment , 2005 .

[26]  Jeffrey Stewart,et al.  Do Multiple-Choice Options Inflate Estimates of Vocabulary Size on the VST? , 2014 .

[27]  Paul Nation,et al.  Using dictionaries to estimate vocabulary size: essential, but rarely followed, procedures , 1993 .

[28]  Paul Meara,et al.  RESEARCHING VOCABULARY THROUGH A WORD KNOWLEDGE FRAMEWORK , 1997, Studies in Second Language Acquisition.

[29]  Jeffrey Stewart,et al.  A Multiple-Choice Test of Active Vocabulary Knowledge , 2012 .

[30]  Lars Stenius Stæhr VOCABULARY KNOWLEDGE AND ADVANCED LISTENING COMPREHENSION IN ENGLISH AS A FOREIGN LANGUAGE , 2009, Studies in Second Language Acquisition.

[31]  Jeffrey Stewart,et al.  Estimating Guessing Effects on the Vocabulary Levels Test for Differing Degrees of Word Knowledge , 2011 .

[32]  S. Embretson The new rules of measurement. , 1996 .

[33]  J. Read,et al.  Validating a Test to Measure Depth of Vocabulary Knowledge , 2013 .

[34]  Cyril J. Weir,et al.  Language Testing and Validation , 2005 .