The effect of corpus size in predicting reaction time in a basic word recognition task: Moving on from Kučera and Francis

Word frequency is one of the strongest determiners of reaction time (RT) in word recognition tasks; it is an important theoretical and methodological variable. The Kučera and Francis (1967) word frequency count (derived from the 1-million-word Brown corpus) is used by most investigators concerned with the issue of word frequency. Word frequency estimates from the Brown corpus were compared with those from a 131-million-word corpus (the HAL corpus; conversational text gathered from Usenet) in a standard word naming task with 32 subjects. RT was predicted equally well by both corpora for high-frequency words, but the larger corpus provided better predictors for low- and medium-frequency words. Furthermore, the larger corpus provides estimates for 97,261 lexical items; the smaller corpus, for 50,406 items.

[1]  Curt Burgess,et al.  Modelling Parsing Constraints with High-dimensional Context Space , 1997 .

[2]  J. M. Cattell THE TIME IT TAKES TO SEE AND NAME OBJECTS , 1886 .

[3]  Maryellen C. MacDonald,et al.  The lexical nature of syntactic ambiguity resolution , 1994 .

[4]  G. Yeni-Komshian,et al.  Word frequency and age effects in normally developing children's phonological processing. , 1996, Journal of speech and hearing research.

[5]  Eugene A. Lovelace,et al.  On using norms for low-frequency words , 1988 .

[6]  A. Rudell Frequency of word usage and perceived word difficulty: Ratings of Kučera and Francis words , 1993 .

[7]  J. Pujol,et al.  Impaired estimation of word occurrence frequency in frontal lobe patients , 1997, Neuropsychologia.

[8]  Maryellen C. MacDonald,et al.  Probabilistic constraints and syntactic ambiguity resolution , 1994 .

[9]  Michael Garman,et al.  Psycholinguistics: Accessing the mental lexicon , 1990 .

[10]  John J. L. Morton,et al.  Interaction of information in word recognition. , 1969 .

[11]  H J Dupuy,et al.  The rationale, development, and standardization of a basic word vocabulary test. , 1974, Vital and health statistics. Series 2, Data evaluation and methods research.

[12]  Michael K. Tanenhaus,et al.  Lexical structure and language comprehension , 1989 .

[13]  Allen R. Dobbs,et al.  Frequency Effects in Lexical Decisions: A Test of the Verification Model , 1985 .

[14]  W. Nelson Francis,et al.  FREQUENCY ANALYSIS OF ENGLISH USAGE: LEXICON AND GRAMMAR , 1983 .

[15]  H. Breland Word Frequency and Word Difficulty: A Comparison of Counts in Four Corpora , 1996 .

[16]  Lance J. Rips,et al.  Structure and process in semantic memory: A featural model for semantic decisions. , 1974 .

[17]  Christine Chiarello,et al.  Lateralization of Lexical Processes in the Normal Brain: A Review of Visual Half-field Research , 1988 .

[18]  J. Hyönä,et al.  Eye fixation patterns among dyslexic and normal readers: effects of word length and word frequency. , 1995, Journal of experimental psychology. Learning, memory, and cognition.

[19]  R. Burchfield Frequency Analysis of English Usage: Lexicon and Grammar. By W. Nelson Francis and Henry Kučera with the assistance of Andrew W. Mackie. Boston: Houghton Mifflin. 1982. x + 561 , 1985 .

[20]  Peter Graf,et al.  Completion norms for 40 three-letter word stems , 1987 .

[21]  Susan M. Garnsey,et al.  Semantic Influences On Parsing: Use of Thematic Role Information in Syntactic Ambiguity Resolution , 1994 .

[22]  M. Gernsbacher Resolving 20 years of inconsistent interactions between lexical familiarity and orthography, concreteness, and polysemy. , 1984, Journal of experimental psychology. General.

[23]  S. Clark,et al.  Word frequency and list composition effects in associative recognition and recall , 1994, Memory & cognition.

[24]  Curt Burgess,et al.  A Computational Model of Syntactic Ambiguity as a Lexical Process , 1989 .

[25]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[26]  D. Plaut Relearning after Damage in Connectionist Networks: Toward a Theory of Rehabilitation , 1996, Brain and Language.

[27]  James L. McClelland,et al.  Distributed memory and the representation of general and specific information. , 1985, Journal of experimental psychology. General.

[28]  A. Jacobs,et al.  On the role of competing word units in visual word recognition: The neighborhood frequency effect , 1989, Perception & psychophysics.

[29]  Curt Burgess,et al.  From simple associations to the building blocks of language: Modeling meaning in memory with the HAL model , 1998 .

[30]  Yasushi Hino,et al.  Neighborhood Size and Neighborhood Frequency Effects in Word Recognition , 1995 .

[31]  E. Shoben,et al.  The influence of sentence constraint on the scope of facilitation for upcoming words. , 1985 .