Beyond single syllables: Large-scale modeling of reading aloud with the Connectionist Dual Process (CDP++) model

Most words in English have more than one syllable, yet the most influential computational models of reading aloud are restricted to processing monosyllabic words. Here, we present CDP++, a new version of the Connectionist Dual Process model (Perry, Ziegler, & Zorzi, 2007). CDP++ is able to simulate the reading aloud of mono- and disyllabic words and nonwords, and learns to assign stress in exactly the same way as it learns to associate graphemes with phonemes. CDP++ is able to simulate the monosyllabic benchmark effects its predecessor could, and therefore shows full backwards compatibility. CDP++ also accounts for a number of novel effects specific to disyllabic words, including the effects of stress regularity and syllable number. In terms of database performance, CDP++ accounts for over 49% of the reaction time variance on items selected from the English Lexicon Project, a very large database of several thousand of words. With its lexicon of over 32,000 words, CDP++ is therefore a notable example of the successful scaling-up of a connectionist model to a size that more realistically approximates the human lexical system.

[1]  Michael Hammond,et al.  The phonology of English : a prosodic optimality-theoretic approach , 1999 .

[2]  Rebecca Treiman,et al.  The English Lexicon Project , 2007, Behavior research methods.

[3]  Arthur M. Jacobs,et al.  Item performance in visual word recognition , 2009, Psychonomic bulletin & review.

[4]  Derek Besner,et al.  When parallel processing in visual word recognition is not enough: New evidence from naming , 2003, Psychonomic bulletin & review.

[5]  A. Jacobs,et al.  Models of visual word recognition: Sampling the state of the art. , 1994 .

[6]  L. Colombo Lexical stress effect and its interaction with frequency in word pronunciation , 1992 .

[7]  Padraic Monaghan,et al.  Stressing what is important: Orthographic cues and lexical stress assignment , 2009, Journal of Neurolinguistics.

[8]  P. Ladefoged A course in phonetics , 1975 .

[9]  David C. Plaut,et al.  A connectionist approach to word reading and acquired dyslexia: extension to sequential processing , 1999, Cogn. Sci..

[10]  A. Roelofs,et al.  Metrical structure in planning the production of spoken words , 1998 .

[11]  K. Rayner Eye movements in reading and information processing: 20 years of research. , 1998, Psychological bulletin.

[12]  M. H. Kelly,et al.  Orthographic cues to lexical stress: Effects on naming and lexical decision , 1998, Memory & cognition.

[13]  Marco Zorzi,et al.  NORMAL AND IMPAIRED SPELLING IN A CONNECTIONIST DUAL-ROUTE ARCHITECTURE , 2003, Cognitive neuropsychology.

[14]  Debra Jared,et al.  Spelling-Sound Consistency Affects the Naming of High-Frequency Words , 1997 .

[15]  Michael Hammond,et al.  Vowel quantity and syllabification in English , 1997 .

[16]  R. Ratcliff,et al.  A Diffusion Model Account of Criterion Shifts in the Lexical Decision Task. , 2008, Journal of memory and language.

[17]  R. Glushko The Organization and Activation of Orthographic Knowledge in Reading Aloud. , 1979 .

[18]  David S. Touretzky,et al.  Connectionist Models and Linguistic Theory: Investigations of Stress Systems in Language , 1993, Cogn. Sci..

[19]  S. Lupker,et al.  Can CANISO activate CASINO? Transposed-letter similarity effects with nonadjacent letter positions ☆ , 2004 .

[20]  References , 1971 .

[21]  D. Share Phonological recoding and self-teaching: sine qua non of reading acquisition , 1995, Cognition.

[22]  M Taft,et al.  Processing of Orthographic Structure by Adults of Different Reading Ability , 2001, Language and speech.

[23]  J. Ziegler,et al.  Rules versus statistics in reading aloud: New evidence on an old debate , 2010 .

[24]  Jyotsna Vaid,et al.  Word frequency modulates the Basic Orthographic Syllabic Structure (BOSS) effect in English polysyllable word recognition , 2007 .

[25]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[26]  M. Coltheart,et al.  Lexical and Nonlexical Print-to-Sound Translation of Disyllabic Words and Nonwords , 2000 .

[27]  M. Zorzi,et al.  Two routes or one in reading aloud? A connectionist dual-process model. , 1998 .

[28]  Max Coltheart,et al.  Access to the internal lexicon , 1977 .

[29]  Manuel Perea,et al.  Transposed-letter effects: Consonants, vowels and letter frequency , 2008 .

[30]  David A. Balota,et al.  Bringing Computational Models of Word Naming Down to the Item Level , 1997 .

[31]  B. Bergum,et al.  Attention and Performance VI , 1978 .

[32]  James L. McClelland,et al.  An interactive activation model of context effects in letter perception: I. An account of basic findings. , 1981 .

[33]  M. Zorzi Serial processing in reading aloud: no challenge for a parallel model. , 2000, Journal of experimental psychology. Human perception and performance.

[34]  Michael J Cortese,et al.  Visual word recognition of single-syllable words. , 2004, Journal of experimental psychology. General.

[35]  Boris New,et al.  Syllabic length effects in visual word recognition and naming. , 2003, Acta psychologica.

[36]  Daragh E. Sibley,et al.  Learning orthographic and phonological representations in models of monosyllabic and bisyllabic naming , 2010 .

[37]  M. van Oostendorp,et al.  Schwa in phonological theory , 1998 .

[38]  Lucia Colombo,et al.  Interacting sources of information in word naming: a study of individual differences , 1994 .

[39]  J Grainger,et al.  Orthographic processing in visual word recognition: a multiple read-out model. , 1996, Psychological review.

[40]  B. Ans,et al.  A connectionist multiple-trace memory model for polysyllabic word reading. , 1998, Psychological review.

[41]  Marcus Taft,et al.  Lexical access-via an orthographic code: The basic orthographic syllabic structure (BOSS) , 1979 .

[42]  Marco Zorzi,et al.  Additive and interactive effects of stimulus degradation: no challenge for CDP+. , 2009, Journal of experimental psychology. Learning, memory, and cognition.

[43]  F. L. Sack English Word-Stress , 1969 .

[44]  K. Paap,et al.  Dual-route models of print to sound: Still a good horse race , 1991 .

[45]  M. Behrmann,et al.  Surface dyslexia and dysgraphia: dual routes, single lexicon , 1992 .

[46]  James L. McClelland,et al.  Understanding normal and impaired word reading: computational principles in quasi-regular domains. , 1996, Psychological review.

[47]  J. Ziegler,et al.  The Feedback Consistency Effect in Lexical Decision and Naming , 1997 .

[48]  J. Mehler,et al.  A destressing deafness in French , 1997 .

[49]  Dan Chateau,et al.  Spelling–sound consistency effects in disyllabic word naming , 2003 .

[50]  Mark Yates,et al.  Phonological neighbors speed visual word processing: evidence from multiple tasks. , 2005, Journal of experimental psychology. Learning, memory, and cognition.

[51]  S. Rixon English Phonetics and Phonology , 2003 .

[52]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[53]  B. Weekes Differential Effects of Number of Letters on Word and Nonword Naming Latency , 1997 .

[54]  D. Besner,et al.  Reading pseudohomophones: Implications for models of pronunciation assembly and the locus of word-frequency effects in naming. , 1987 .

[55]  Mark S. Seidenberg,et al.  Simulating consistency effects and individual differences in nonword naming: A comparison of current models , 2006 .

[56]  Derek Besner,et al.  Neighborhood effects in reading aloud: new findings and new challenges for computational models. , 2006, Journal of experimental psychology. Human perception and performance.

[57]  J. Ziegler,et al.  Identical Words are Read Differently in Different Languages , 2001, Psychological science.

[58]  Sally Andrews,et al.  Rule and analogy mechanisms in reading nonwords: Hough dou peapel rede gnew wirds? , 1998 .

[59]  Sally Andrews,et al.  From inkmarks to ideas : current issues in lexical processing , 2006 .

[60]  M Behrmann,et al.  Frequency and consistency effects in a pure surface dyslexic patient. , 1997, Journal of experimental psychology. Human perception and performance.

[61]  Linda Cupples,et al.  Effects of Stress Typicality During Speeded Grammatical Classification , 2003, Language and speech.

[62]  Johannes C. Ziegler,et al.  A developmental perspective on the neural code for written words , 2006, Trends in Cognitive Sciences.

[63]  M. Zorzi The connectionist dual process (CDP) approach to modelling reading aloud , 2010 .

[64]  R. H. Baayen,et al.  The CELEX Lexical Database (CD-ROM) , 1996 .

[65]  P. Carr English phonetics and phonology , 1999 .

[66]  C. J. Álvarez,et al.  Syllables and morphemes: contrasting frequency effects in Spanish. , 2001, Journal of experimental psychology. Learning, memory, and cognition.

[67]  Mark S. Seidenberg,et al.  Computing the meanings of words in reading: cooperative division of labor between visual and phonological processes. , 2004, Psychological review.

[68]  M. H. Kelly,et al.  Word Onset Patterns and Lexical Stress in English. , 2004 .

[69]  Ludovic Ferrand,et al.  Reading aloud polysyllabic words and nonwords: The syllabic length effect reexamined , 2000, Psychonomic bulletin & review.

[70]  Iggy Roca,et al.  Derivations and constraints in phonology , 1999 .

[71]  Arthur M. Jacobs,et al.  Word, Pseudoword, and Nonword Processing: A Multitask Comparison Using Event-Related Brain Potentials , 1997, Journal of Cognitive Neuroscience.

[72]  Marco Zorzi,et al.  Nested incremental modeling in the development of computational theories: the CDP+ model of reading aloud. , 2007, Psychological review.

[73]  Bernard Widrow,et al.  Adaptive switching circuits , 1988 .

[74]  Barry Heselwood,et al.  Schwa and the phonotactics of RP English , 2007 .

[75]  T. A. Hall,et al.  English syllabification as the interaction of markedness constraints , 2006 .

[76]  D. Feldman-Stewart,et al.  Learning in small connectionist networks does not generalize to large networks , 1994 .

[77]  Jeffrey S Bowers,et al.  Contrasting five different theories of letter position coding: evidence from orthographic similarity effects. , 2006, Journal of experimental psychology. Human perception and performance.

[78]  Edward Flemming,et al.  Rosa's roses: reduced vowels in American English , 2004, Journal of the International Phonetic Association.

[79]  Larry E. Toothaker,et al.  Multiple Regression: Testing and Interpreting Interactions , 1991 .

[80]  M. Gordon Syllable Weight: Phonetics, Phonology, Typology , 2006 .

[81]  Michael C. Doyle,et al.  Effects of frequency on visual word recognition tasks: where are they? , 1989, Journal of experimental psychology. General.

[82]  B. Hayes Metrical Stress Theory: Principles and Case Studies , 1995 .

[83]  J. Ziegler,et al.  Reading acquisition, developmental dyslexia, and skilled reading across languages: a psycholinguistic grain size theory. , 2005, Psychological bulletin.

[84]  T. A. Hall Against extrasyllabic consonants in German and English , 2002, Phonology.

[85]  Derek Besner,et al.  The cross-script length effect: further evidence challenging PDP models of reading aloud. , 2009, Journal of experimental psychology. Learning, memory, and cognition.

[86]  D. Balota,et al.  Visual word recognition of multisyllabic words , 2009 .

[87]  D. Jared Spelling-Sound Consistency and Regularity Effects in Word Naming , 2002 .

[88]  C. Clifton,et al.  The prosodic property of lexical stress affects eye movements during silent reading , 2005, Cognition.

[89]  Usha Goswami,et al.  Phonology, reading development, and dyslexia: A cross-linguistic perspective , 2002 .

[90]  K. Rayner Eye movements in reading and information processing. , 1978, Psychological bulletin.

[91]  R Treiman,et al.  Children's sensitivity to syllables, onsets, rimes, and phonemes. , 1996, Journal of experimental child psychology.

[92]  James L. McClelland,et al.  Conspiracy effects in word pronunciation. , 1987 .

[93]  M. Coltheart,et al.  Serial and strategic effects in reading aloud , 1999 .

[94]  Conrad Perry,et al.  Speed of lexical and nonlexical processing in French: The case of the regularity effect , 2003, Psychonomic bulletin & review.

[95]  Michael Hammond The phonology of English , 1999 .

[96]  Marco Zorzi,et al.  The Development of Spelling-Sound Relationships in a Model of Phonological Reading , 1998 .

[97]  James L. McClelland,et al.  Nonword pronunciation and models of word recognition. , 1994, Journal of experimental psychology. Human perception and performance.

[98]  J. Bullinaria Modeling Reading, Spelling, and Past Tense Learning with Artificial Neural Networks , 1997, Brain and Language.

[99]  Linda Cupples,et al.  The processing of lexical stress during visual word recognition: Typicality effects and orthographic correlates , 2006, Quarterly journal of experimental psychology.

[100]  Arthur M. Jacobs,et al.  What is the pronunciation for -ough and the spelling for /u/? A database for computing feedforward and feedback consistency in English , 1997 .

[101]  Rebecca Treiman,et al.  Phonetic Biases in Voice Key Response Time Measurements , 2002 .

[102]  L. Allan,et al.  The widespread influence of the Rescorla-Wagner model , 1996, Psychonomic bulletin & review.

[103]  J. Ziegler,et al.  Developmental dyslexia and the dual route model of reading: Simulating individual differences and subtypes , 2008, Cognition.

[104]  Mark S. Seidenberg,et al.  Naming multisyllabic words. , 1990, Journal of experimental psychology. Human perception and performance.

[105]  Chris Davis,et al.  The density constraint on form-priming in the naming task: interference effects from a masked prime , 1991 .

[106]  S. Hains,et al.  Individual differences in word recognition latency , 1979 .

[107]  Marco Zorzi,et al.  Do current connectionist learning models account for reading development in different languages? , 2004, Cognition.

[108]  Ken N. Seergobin,et al.  On the association between connectionism and data: Are a few words necessary? , 1990 .

[109]  Alan H. Kawamoto,et al.  Pronunciation of homographs , 1992 .

[110]  Barry Heselwood,et al.  R vocalisation, linking R and intrusive R: accounting for final schwa in RP English , 2009 .

[111]  S. Lupker,et al.  Strategic Control in a Naming Task: Changing Routes or Changing Deadlines? , 1997 .

[112]  S. Andrews Frequency and neighborhood effects on lexical access: Activation or search? , 1989 .

[113]  Conrad Perry,et al.  Testing a Computational Account of Category-Specific Deficits , 1999, Journal of Cognitive Neuroscience.

[114]  D. Balota,et al.  Moving beyond Coltheart’s N: A new measure of orthographic similarity , 2008, Psychonomic bulletin & review.

[115]  Mark S. Seidenberg,et al.  The special role of rimes in the description, use, and acquisition of English orthography. , 1995, Journal of experimental psychology. General.

[116]  James L. McClelland,et al.  A distributed, developmental model of word recognition and naming. , 1989, Psychological review.

[117]  Alain Content,et al.  The effect of spelling-to-sound regularity on naming in French , 1991 .

[118]  A. Caramazza,et al.  The assignment of word stress in oral reading: Evidence from a case of acquired dyslexia. , 1993 .

[119]  David A. Balota,et al.  The Utility of Item-Level Analyses in Model Evaluation: A Reply to Seidenberg and Plaut , 1998 .

[120]  S. Becker,et al.  Long-term semantic priming: a computational account and empirical evidence. , 1997, Journal of experimental psychology. Learning, memory, and cognition.

[121]  M Coltheart,et al.  DRC: a dual route cascaded model of visual word recognition and reading aloud. , 2001, Psychological review.