A Connectionist Single-Mechanism Account of Rule-Like Behavior in Infancy

A Connectionist Single-Mechanism Account of Rule-Like Behavior in Infancy Morten H. Christiansen (morten@siu.edu) Christopher M. Conway (conway@siu.edu) Department of Psychology; Southern Illinois University Carbondale, IL 62901-6502 USA Suzanne Curtin (curtin@gizmo.usc.edu) Department of Linguistics; University of Southern California Los Angeles, CA 90089-1693 USA Abstract One of the most controversial issues in cognitive science per- tains to whether rules are necessary to explain complex be- havior. Nowhere has the debate over rules been more heated than within the field of language acquisition. Most researchers agree on the need for statistical learning mechanisms in lan- guage acquisition, but disagree on whether rule-learning com- ponents are also needed. Marcus, Vijayan, Rao, & Vishton (1999) have provided evidence of rule-like behavior which they claim can only be explained by a dual-mechanism ac- count. In this paper, we show that a connectionist single- mechanism approach provides a more parsimonious account of rule-like behavior in infancy than the dual-mechanism ap- proach. Specifically, we present simulation results from an ex- isting connectionist model of infant speech segmentation, fit- ting the behavioral data under naturalistic circumstances with- out invoking rules. We further investigate diverging predic- tions from the single- and dual-mechanism accounts through additional simulations and artificial language learning experi- ments. The results support a connectionist single-mechanism account, while undermining the dual-mechanism account. Introduction The nature of the learning mechanisms that infants bring to the task of language acquisition is a major focus of research in cognitive science. With the rise of connectionism, much of the scientific debate surrounding this research has focused on whether rules are necessary to explain language acquisition. All parties in the debate acknowledge that statistical learning mechanisms form a necessary part of the language acquisition process (e.g., Christiansen & Curtin, 1999; Marcus, Vijayan, Rao, & Vishton, 1999; Pinker, 1991). However, there is much disagreement over whether a statistical learning mech- anism is sufficient to account for complex rule-like behavior, or whether additional rule-learning mechanisms are needed. In the past this debate has primarily taken place within spe- cific areas of language acquisition, such as inflectional mor- phology (e.g., Pinker, 1991; Plunkett & Marchman, 1993) and visual word recognition (e.g., Coltheart, Curtis, Atkins & Haller, 1993; Seidenberg & McClelland, 1989). More re- cently, Marcus et al. (1999) have presented results from ex- periments with 7-month-olds, apparently showing that infants acquire abstract algebraic rules after two minutes of expo- sure to habituation stimuli. The algebraic rules are construed as representing an open-ended relationship between variables for which one can substitute arbitrary values, “such as ‘the first item X is the same as the third item Y,’ or more gener- ally, that ‘item I is the same as item J”’ (Marcus et al., 1999, p. 79). Marcus et al. further claim that a connectionist single- mechanism approach based on statistical learning is unable to fit their experimental data. In this paper, we build on earlier work (Christiansen & Curtin, 1999) and present a detailed connectionist model of these infant data, and provide new experimental data that support a statistically-based single- mechanism approach while undermining the dual-mechanism account. In the remainder of this paper, we first show that knowl- edge acquired in the service of learning to segment the speech stream can be recruited to carry out the kind of classification task used in the experiments by Marcus et al. For this pur- pose we took an existing model of early infant speech seg- mentation (Christiansen, Allen & Seidenberg, 1998) and used it to simulate the results obtained by Marcus et al. The simu- lations demonstrate that no rules are needed to account for the data; rather, statistical knowledge related to word seg- mentation can explain the rule-like behavior of the infants in the Marcus et al. study. We then explore the issue of timing in stimuli presentation and present additional simu- lations from which empirical predictions are derived that di- verge from those of the rule-based account. These predictions are tested in experiments with adults. Experiment 1 replicated the results from Marcus et al. using adult subjects. Experi- ment 2 confirmed the predictions from our single-mechanism approach, whereas the dual-mechanism approach cannot ac- count for these results without adding extra machinery to complement the statistical and rule-based components. To- gether, the simulations and the experiments thus suggest that a single-mechanism model provides the most parsimonious account of the empirical data presented here and in Marcus et al., thus obviating the need for a separate rule-based compo- nent. Simulation 1: Rule-Like Behavior without Rules Marcus et al. (1999) used an artificial language learning paradigm to test their claim that the infant has two mecha- nisms for learning language, one that uses statistical informa- tion and another which uses algebraic rules. They conducted three experiments which tested infants’ ability to generalize to items not presented in the familiarization phase of the ex- periment. We focus here on their third experiment because it was controlled for possible confounds found in the first two experiments: differences in phonetic features (Experiment 1) and reduplication 1 (Experiment 2). Marcus et al. claim that Though the control for reduplication was not entirely complete (see Elman, 1999).

[1]  Paul W. B. Atkins,et al.  Models of reading aloud: Dual-route and parallel-distributed-processing approaches. , 1993 .

[2]  LouAnn Gerken,et al.  Signal to syntax : bootstrapping from speech to grammar in early acquisition , 1996 .

[3]  Peter M. Vishton,et al.  Rule learning by seven-month-old infants. , 1999, Science.

[4]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[5]  James L. McClelland,et al.  A distributed, developmental model of word recognition and naming. , 1989, Psychological review.

[6]  Mark S. Seidenberg Visual Word Recognition: An Overview , 1995 .

[7]  Lokendra Shastri,et al.  A Spatiotemporal Connectionist Model of Algebraic Rule-learning , 1999 .

[8]  R N Aslin,et al.  Statistical Learning by 8-Month-Old Infants , 1996, Science.

[9]  Matthew Flatt,et al.  PsyScope: An interactive graphic system for designing and controlling experiments in the psychology laboratory using Macintosh computers , 1993 .

[10]  S Pinker,et al.  Rules of language. , 1991, Science.

[11]  Morten H. Christiansen,et al.  Learning to Segment Speech Using Multiple Cues: A Connectionist Model , 1998 .

[12]  E. Newport,et al.  WORD SEGMENTATION : THE ROLE OF DISTRIBUTIONAL CUES , 1996 .

[13]  Thomas R. Shultz,et al.  Rule learning by Habituation can be Simulated in Neural Networks , 2020, Proceedings of the Twenty First Annual Conference of the Cognitive Science Society.

[14]  P. D. Eimas,et al.  Speech, language, and communication , 1997 .

[15]  V. Marchman,et al.  From rote learning to system building: acquiring verb morphology in children and connectionist nets , 1993, Cognition.

[16]  Z. Dienes,et al.  Rule learning by seven-month-old infants and neural networks , 1999 .