Strong systematicity through sensorimotor conceptual grounding: an unsupervised, developmental approach to connectionist sentence processing

Connectionist language modelling typically has difficulty with syntactic systematicity, or the ability to generalise language learning to untrained sentences. This work develops an unsupervised connectionist model of infant grammar learning. Following the semantic boostrapping hypothesis, the network distils word category using a developmentally plausible infant-scale database of grounded sensorimotor conceptual representations, as well as a biologically plausible semantic co-occurrence activation function. The network then uses this knowledge to acquire an early benchmark clausal grammar using correlational learning, and further acquires separate conceptual and grammatical category representations. The network displays strongly systematic behaviour indicative of the general acquisition of the combinatorial systematicity present in the grounded infant-scale language stream, outperforms previous contemporary models that contain primarily noun and verb word categories, and successfully generalises broadly to novel untrained sensorimotor grounded sentences composed of unfamiliar nouns and verbs. Limitations as well as implications to later grammar learning are discussed.

[1]  Nello Cristianini,et al.  Are We There Yet? , 2002, Science.

[2]  R. Brown,et al.  A First Language , 1973 .

[3]  Steven Pinker,et al.  Generalisation of regular and irregular morphological patterns , 1993 .

[4]  E. Bates,et al.  2 ON THE INSEPARABILITY OF GRAMMAR AND THE LEXICON : EVIDENCE FROM ACQUISITION , APHASIA AND REAL-TIME PROCESSING , 1997 .

[5]  Geoffrey E. Hinton,et al.  Distributed Representations , 1986, The Philosophy of Artificial Intelligence.

[6]  L. Barsalou,et al.  Whither structured representation? , 1999, Behavioral and Brain Sciences.

[7]  Jeffrey L. Elman,et al.  Language as a dynamical system , 1996 .

[8]  Ronald J. Brachman,et al.  ON THE EPISTEMOLOGICAL STATUS OF SEMANTIC NETWORKS , 1979 .

[9]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[10]  R. C. Tees Review of The organization of behavior: A neuropsychological theory. , 2003 .

[11]  Nello Cristianini Are We There Yet? , 2009, ECML/PKDD.

[12]  L. Barsalou,et al.  Perceptual simulation in property verification , 2004, Memory & cognition.

[13]  K. Nelson,et al.  Structure and strategy in learning to talk. , 1973 .

[14]  Allan Collins,et al.  A spreading-activation theory of semantic processing , 1975 .

[15]  Frank van der Velde,et al.  A neural architecture for grounded cognition: Representation, structure, dynamics and learning , 2008, IJCNN.

[16]  James L. McClelland,et al.  On learning the past-tenses of English verbs: implicit rules or parallel distributed processing , 1986 .

[17]  F. Velde,et al.  Compositional connectionist structures based on in situ grounded representations , 2011, Connect. Sci..

[18]  J. Steven Reznick,et al.  Short-form versions of the MacArthur Communicative Development Inventories , 2000, Applied Psycholinguistics.

[19]  Roger C. Schank,et al.  Dynamic memory - a theory of reminding and learning in computers and people , 1983 .

[20]  Henrik Jacobsson,et al.  Sentence-processing in echo state networks: a qualitative analysis by finite state machine extraction , 2010, Connect. Sci..

[21]  P. Bloom How children learn the meanings of words , 2000 .

[22]  John G. Taylor,et al.  The temporal Kohönen map , 1993, Neural Networks.

[23]  S. Williamson,et al.  Tonotopic organization of human auditory association cortex , 1994, Brain Research.

[24]  Steven Pinker,et al.  Language learnability and language development , 1985 .

[25]  Risto Miikkulainen,et al.  Contour integration and segmentation with self-organized lateral connections , 2004, Biological Cybernetics.

[26]  Mark S. Seidenberg,et al.  On the nature and scope of featural representations of word meaning. , 1997, Journal of experimental psychology. General.

[27]  Herbert Jaeger,et al.  Adaptive Nonlinear System Identification with Echo State Networks , 2002, NIPS.

[28]  Thomas Voegtlin,et al.  Recursive self-organizing maps , 2002, Neural Networks.

[29]  Stefan L. Frank,et al.  Generalization and systematicity in echo state networks , 2008 .

[30]  Marc Strickert,et al.  Neural Gas for Sequences , 2003 .

[31]  A. Clark Supersizing the Mind , 2008 .

[32]  T. Heskes Energy functions for self-organizing maps , 1999 .

[33]  Peter Alexander Jansen A Self-Organizing Computational Neural Network Architecture with Applications to Sensorimotor Grounded Linguistic Grammar Acquisition , 2010 .

[34]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[35]  Mirko Farina Supersizing the Mind: Embodiment, Action and Cognitive Extension. , 2010 .

[36]  R. A. Brooks,et al.  Intelligence without Representation , 1991, Artif. Intell..

[37]  John R. Searle,et al.  Minds, brains, and programs , 1980, Behavioral and Brain Sciences.

[38]  Stevan Harnad,et al.  Symbol grounding problem , 1990, Scholarpedia.

[39]  Colin Humphries,et al.  Tonotopic organization of human auditory cortex , 2010, NeuroImage.

[40]  C SchankRoger,et al.  Dynamic Memory: A Theory of Reminding and Learning in Computers and People , 1983 .

[41]  William A. Woods,et al.  What's in a Link: Foundations for Semantic Networks , 1975 .

[42]  Risto Miikkulainen,et al.  SARDNET: A Self-Organizing Feature Map for Sequences , 1994, NIPS.

[43]  Ah Chung Tsoi,et al.  A self-organizing map for adaptive processing of structured data , 2003, IEEE Trans. Neural Networks.

[44]  Adam Prügel-Bennett,et al.  Analysis of synfire chains , 1995 .

[45]  Diane Pecher,et al.  Sensorimotor simulations underlie conceptual representations: Modality-specific effects of prior activation , 2004, Psychonomic bulletin & review.

[46]  P. D. Eimas,et al.  Studies on the formation of perceptually based basic-level categories in young infants. , 1994, Child development.

[47]  E. Bates,et al.  New Directions in Research on Language Development , 1993 .

[48]  M. Ross Quillian,et al.  Retrieval time from semantic memory , 1969 .

[49]  G. Dell,et al.  Becoming syntactic. , 2006, Psychological review.

[50]  J. Fodor,et al.  Connectionism and cognitive architecture: A critical analysis , 1988, Cognition.

[51]  Steve R. Howell,et al.  A Model of Grounded Language Acquisition: Sensorimotor Features Improve Lexical and Grammatical Learning. , 2005 .

[52]  G. Marcus Can connectionism save constructivism? , 1998, Cognition.

[53]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[54]  J. Elman Distributed Representations, Simple Recurrent Networks, And Grammatical Structure , 1991 .

[55]  Ronald J. Brachman,et al.  What's in a Concept: Structural Foundations for Semantic Networks , 1977, Int. J. Man Mach. Stud..

[56]  Robert F. Hadley Systematicity in Connectionist Language Learning , 1994 .

[57]  Frank van der Velde,et al.  Lack of combinatorial productivity in language processing with simple recurrent networks , 2004, Connect. Sci..

[58]  O. L. Z. Book Review: The Organization of Behaviour: A Neuropsychological Theory , 1950 .

[59]  T. Kohonen,et al.  Self-organizing semantic maps , 1989, Biological Cybernetics.

[60]  Igor Farkas,et al.  Syntactic systematicity in sentence processing with a recurrent self-organizing network , 2008, Neurocomputing.

[61]  J. Fodor,et al.  Connectionism and the problem of systematicity: Why Smolensky's solution doesn't work , 1990, Cognition.

[62]  R. Schvaneveldt,et al.  Facilitation in recognizing pairs of words: evidence of a dependence between retrieval operations. , 1971, Journal of experimental psychology.

[63]  Stefan L. Frank,et al.  Learn more by training less: systematicity in sentence processing by recurrent networks , 2006, Connect. Sci..

[64]  Michael C. Mozer,et al.  Neural net architectures for temporal sequence processing , 2007 .

[65]  Daniel C. Richardson,et al.  Spatial representations activated during real-time comprehension of verbs , 2003, Cogn. Sci..

[66]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[67]  Philemon Brakel,et al.  Strong systematicity in sentence processing by simple recurrent networks , 2009 .

[68]  Nick Chater,et al.  Toward a connectionist model of recursion in human linguistic performance , 1999, Cogn. Sci..