Bilingual and Multilingual Mental Lexicon: A Modeling Study With Linear Discriminative Learning

This study addresses the question of whether there is anything special about learning a third language, as compared to learning a second language, just by virtue of the third language being the third language acquired, and independently of the specific properties of the third language. We used computational modeling to explore this question for the learning of a small vocabulary of some 400 words, with English as L1, German or Mandarin as L2, and Mandarin and alternatively Dutch, as L3. For computational modeling, we made use of the mathematical framework of linear discriminative learning, which we extended with the learning rule of Widrow-Hoff to enable the modeling of incremental learning of the mappings between form and meaning when words' meanings are represented by vectors of real numbers (embeddings) rather than by abstract symbolic units. A series of simulation experiments covering single-language learning, bilingual learning, and finally trilingual learning, clarified that within the framework of discrimination learning, within-language homophones give rise to frailty in comprehension that in turn for production gives rise to semantic errors in L1, and language intrusions in L2 and L3. Our model correctly predicts production to lag behind comprehension in learning, and it clarified that, within the boundaries of discrimination learning, the properties of the L3 crucially determine whether L3 learning appears to involve a language that is `dormant' with respect to L1 and L2. Qualitatively surprisingly different patterns of acquisition of the L3, and its interactions with L1 and L2, can arise in our simulations without any changes in the mathematics driving learning. Our simulations also show that when words' forms incorporate not only segmental but also suprasegmental information, the nature of errors that arise in production changes. In the general discussion, we reflect on the implications of our findings for the question of what is special about multilingualism.

[1]  Harish Tayyar Madabushi,et al.  Keeping it simple: Implementation and performance of the proto-principle of adaptation and learning in the language sciences , 2020, ArXiv.

[2]  Cognitive restructuring in the bilingual mind: motion event construal in early Cantonese–English bilinguals , 2019, Language and Cognition.

[3]  R. Baayen,et al.  Estonian case inflection made simple. A case study in Word and Paradigm morphology with Linear Discriminative Learning. , 2019 .

[4]  T. Dijkstra,et al.  Multilink: a computational model for bilingual word recognition and word translation , 2018, Bilingualism: Language and Cognition.

[5]  Michela Mosca,et al.  Trilinguals’ language switching: A strategic and flexible account , 2018, Quarterly journal of experimental psychology.

[6]  Anders Søgaard,et al.  A Survey of Cross-lingual Word Embedding Models , 2017, J. Artif. Intell. Res..

[7]  R. Harald Baayen,et al.  The Discriminative Lexicon: A Unified Computational Model for the Lexicon and Lexical Processing in Comprehension and Production Grounded Not in (De)Composition but in Linear Discriminative Learning , 2019, Complex..

[8]  R. Baayen,et al.  Inflectional morphology with linear mappings , 2018, The Mental Lexicon.

[9]  Elnaz Shafaei-Bajestan,et al.  Wide Learning for Auditory Comprehension , 2018, INTERSPEECH.

[10]  A. E. Tytus Active and dormant languages in the multilingual mental lexicon , 2018, International Journal of Multilingualism.

[11]  R. Harald Baayen,et al.  Language comprehension as a multi‐label classification problem , 2018 .

[12]  A. E. Tytus Rising to the bilingual challenge: self-reported experiences of managing life with two languages , 2018 .

[13]  A. E. Tytus,et al.  Moral judgement and foreign language effect: when the foreign language becomes the second language , 2018 .

[14]  A. Lohmann,et al.  Cut (n) and cut (v) are not homophones: Lemma frequency affects the duration of noun–verb conversion pairs , 2017, Journal of Linguistics.

[15]  M. Goral,et al.  Bilingualism: A framework for understanding the mental lexicon , 2017 .

[16]  Y. Rodina,et al.  Crosslinguistic influence in the acquisition of a third language: The Linguistic Proximity Model , 2017 .

[17]  Michael Ramscar,et al.  Are baboons learning "orthographic" representations? Probably not , 2017, PloS one.

[18]  E. Schleef,et al.  Acquisition of sociolinguistic awareness by German learners of English: A study in perceptions of quotative be like , 2017 .

[19]  K. de Bot,et al.  Bilingual Language Switching: Production vs. Recognition , 2017, Front. Psychol..

[20]  Denis Arnold,et al.  Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features, bypassing the phoneme as recognition unit , 2017, PloS one.

[21]  M. Brysbaert,et al.  Exploring the relations between word frequency, language exposure, and bilingualism in a computational model of reading , 2017 .

[22]  Hiroshi Kanayama,et al.  Multilingual Training of Crosslingual Word Embeddings , 2017, EACL.

[23]  Thomas Hofmann,et al.  Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification , 2017, WWW.

[24]  Petar Milin,et al.  Discrimination in lexical decision , 2017, PloS one.

[25]  Samuel L. Smith,et al.  Offline bilingual word vectors, orthogonal transformations and the inverted softmax , 2017, ICLR.

[26]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[27]  Ekaterini Klepousniotou,et al.  Relative Meaning Frequencies for 100 Homonyms: British eDom Norms , 2016 .

[28]  A. Hall,et al.  Adaptive Switching Circuits , 2016 .

[29]  Abdelkader Hermas,et al.  The categorization of the relative complementizer phrase in third-language English: A feature re-assembly account , 2015 .

[30]  Benny B. Briesemeister,et al.  Avoid violence, rioting, and outrage; approach celebration, delight, and strength: Using large text corpora to compute valence, arousal, and the basic emotions , 2015, Quarterly journal of experimental psychology.

[31]  C. Westbury You Can’t Drink a Word: Lexical and Individual Emotionality Affect Subjective Familiarity Judgments , 2014, Journal of psycholinguistic research.

[32]  Eric P. Xing,et al.  Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2014, ACL 2014.

[33]  R. Baayen,et al.  Effects of primary and secondary morphological family size in monolingual and bilingual word processing , 2014 .

[34]  S. Jun,et al.  Prosodic typology : the phonology of intonation and phrasing , 2014 .

[35]  John Nerbonne,et al.  A Cognitively Grounded Measure of Pronunciation Distance , 2013, PloS one.

[36]  Ylva Falk,et al.  The role of L1 explicit metalinguistic knowledge in L3 oral production at the initial state* , 2013, Bilingualism: Language and Cognition.

[37]  Stewart M. McCauley,et al.  Error and expectation in language learning: The curious absence of mouses in adult speech , 2013 .

[38]  J. Rothman Linguistic and cognitive motivations for the Typological Primacy Model (TPM) of third language (L3) transfer: Timing of acquisition and proficiency considered* , 2013, Bilingualism: Language and Cognition.

[39]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[40]  Suzanne Flynn,et al.  Further evidence in support of the Cumulative-Enhancement Model: CP structure development , 2012 .

[41]  Camilla Bardel,et al.  The L2 status factor and the declarative/procedural distinction , 2012 .

[42]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[43]  Raphael Berthele,et al.  On the Use of PUT Verbs by Multilingual Speakers of Romansh , 2012 .

[44]  Eliza Margaretha,et al.  Inducing a measure of phonetic similarity from pronunciation variation , 2012, J. Phonetics.

[45]  A. Pavlenko 7. (Re-)naming the World: Word-to-Referent Mapping in Second Language Speakers , 2011 .

[46]  Dušica Filipović Đurđević,et al.  An amorphous model for morphological processing in visual comprehension based on naive discriminative learning. , 2011, Psychological review.

[47]  Alan Juffs Second language acquisition. , 2006, Wiley interdisciplinary reviews. Cognitive science.

[48]  J. Nerbonne,et al.  Inducing a measure of phonetic similarity from dialect variation , 2011 .

[49]  A. D. Groot Language and Cognition in Bilinguals and Multilinguals: An Introduction , 2010 .

[50]  T. Dijkstra,et al.  Language comprehension in the bilingual brain: fMRI and ERP support for psycholinguistic models , 2010, Brain Research Reviews.

[51]  Melody Dye,et al.  The Effects of Feature-Label-Order and Their Implications for Symbolic Learning , 2010, Cogn. Sci..

[52]  Janet G. van Hell,et al.  The Revised Hierarchical Model: A critical review and assessment. , 2010, Bilingualism.

[53]  Models as hypothesis generators and models as roadmaps , 2010 .

[54]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[55]  A. Pavlenko 6. Conceptual Representation in the Bilingual Lexicon and Second Language Vocabulary Learning , 2009 .

[56]  Aneta Pavlenko,et al.  The bilingual mental lexicon : interdisciplinary approaches , 2009 .

[57]  S. Gahl Time and Thyme Are not Homophones: The Effect of Lemma Frequency on Word Durations in Spontaneous Speech , 2008 .

[58]  M. Gally Défense et illustration de la langue française , 2008 .

[59]  Michael Ramscar,et al.  Linguistic Self-Correction in the Absence of Feedback: A New Approach to the Logical Problem of Language Acquisition , 2007, Cogn. Sci..

[60]  Margarita Kaushanskaya,et al.  Cross-linguistic transfer and borrowing in bilinguals , 2007, Applied Psycholinguistics.

[61]  N. Ellis Selective Attention and Transfer Phenomena in L2 Acquisition: Contingency, Cue Competition, Salience, Interference, Overshadowing, Blocking, and Perceptual Learning , 2006 .

[62]  Vivian Cook,et al.  Do bilinguals have different concepts? The case of shape and material in Japanese L2 users of English , 2006 .

[63]  N. Ellis Language Acquisition as Rational Contingency Learning , 2006 .

[64]  Roger Hawkins,et al.  Second Language Acquisition of Phonology, Morphology, and Syntax , 2006 .

[65]  Thomas Serre,et al.  Object recognition with features inspired by visual cortex , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[66]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[67]  Devin M. Casenhiser,et al.  Children's resistance to homonymy: an experimental study of pseudohomonyms , 2005, Journal of Child Language.

[68]  R. Schreuder,et al.  A roommate in cream: Morphological family size effects on interlingual homograph recognition , 2005 .

[69]  S. Baumann,et al.  German Intonation in Autosegmental-Metrical Phonology* , 2005 .

[70]  Mark S. Seidenberg,et al.  Computing the meanings of words in reading: cooperative division of labor between visual and phonological processes. , 2004, Psychological review.

[71]  C. Foley,et al.  The Cumulative-Enhancement Model for Language Acquisition: Comparing Adults' and Children's Patterns of Development in First, Second and Third Language Acquisition of Relative Clauses , 2004 .

[72]  A. Aertsen,et al.  The Spectro-Temporal Receptive Field , 1981, Biological Cybernetics.

[73]  D. Danks Equilibria of the Rescorla--Wagner model , 2003 .

[74]  T. Dijkstra,et al.  The architecture of the bilingual word recognition system: From identification to decision , 2002, Bilingualism: Language and Cognition.

[75]  Nick C. Ellis,et al.  Frequency effects in language acquisition: A review with implications for theories of implicit and explicit language acquisition. (Target article) , 2002 .

[76]  Peter Redgrave,et al.  A computational model of action selection in the basal ganglia. II. Analysis and simulation of behaviour , 2001, Biological Cybernetics.

[77]  Peter Redgrave,et al.  A computational model of action selection in the basal ganglia. I. A new functional anatomy , 2001, Biological Cybernetics.

[78]  Luc Steels,et al.  Language as a Complex Adaptive System , 2000, PPSN.

[79]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[80]  P. Redgrave,et al.  The basal ganglia: a vertebrate solution to the selection problem? , 1999, Neuroscience.

[81]  San Duanmu Stress and the Development of Disyllabic Words in Chinese , 1999 .

[82]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[83]  C. Shih,et al.  Mandarin third tone sandhi and prosodic structure , 1997 .

[84]  L. Allan,et al.  The widespread influence of the Rescorla-Wagner model , 1996, Psychonomic bulletin & review.

[85]  R. H. Baayen,et al.  The CELEX Lexical Database (CD-ROM) , 1996 .

[86]  I. Ohzawa,et al.  Receptive-field dynamics in the central visual pathways , 1995, Trends in Neurosciences.

[87]  J. Kroll,et al.  Category Interference in Translation and Picture Naming: Evidence for Asymmetric Connections Between Bilingual Memory Representations , 1994 .

[88]  G. Ayers,et al.  Guidelines for ToBI labelling , 1994 .

[89]  E. Clark JCL volume 22 issue 1 Cover and Back matter , 1995, Journal of Child Language.

[90]  Philip R. Cohen,et al.  The Meaning of Intonational Contours in the Interpretation of Discourse , 1990 .

[91]  Dwight L. Bolinger,et al.  Intonation and Its Uses: Melody in Grammar and Discourse , 1989 .

[92]  R. Rescorla Pavlovian conditioning. It's not what you think it is. , 1988, The American psychologist.

[93]  James L. McClelland,et al.  An interactive activation model of context effects in letter perception: part 1.: an account of basic findings , 1988 .

[94]  Kam L. Wong Analysis or synthesis , 1985 .

[95]  James L. McClelland,et al.  An interactive activation model of context effects in letter perception: I. An account of basic findings. , 1981 .

[96]  D. J. Hermes,et al.  Spectro-temporal characterization of auditory neurons: Redundant or necessary? , 1981, Hearing Research.

[97]  R J Harris,et al.  A test of interlingual interaction in comprehension by bilinguals , 1981, Journal of psycholinguistic research.

[98]  L. Allan A note on measurement of contingency between two binary variables in judgment tasks , 1980 .

[99]  R. Rescorla,et al.  A theory of Pavlovian conditioning : Variations in the effectiveness of reinforcement and nonreinforcement , 1972 .

[100]  趙 元任,et al.  A grammar of spoken Chinese = 中國話的文法 , 1968 .

[101]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[102]  R. E. Kalman,et al.  A New Approach to Linear Filtering and Prediction Problems , 2002 .

[103]  B. L. Whorf Language, Thought, and Reality: Selected Writings of Benjamin Lee Whorf , 1956 .