A Spatiotemporal Connectionist Model of Algebraic Rule-learning

Recent experiments by Marcus, Vijaya, Rao, and Vishton suggest that infants are capable of extracting and using abstract algebraic rules such as \the rst item X is the same as the third item Y". Such an algebraic rule represents a relationship between placeholders or variables for which one can substitute arbitrary values. As Marcus et al. point out, while most neural network models excel at capturing statistical patterns and regularities in data, they have diiculty in extracting algebraic rules that generalize to new items. We describe a connectionist network architecture that can readily acquire algebraic rules. The extracted rules are not tied to features of words used during habituation, and generalize to new words. Furthermore, the network acquires rules from a small number of examples, without using negative evidence, and without any pretraining. A signiicant aspect of the proposed model is that it identiies a suucient set of architectural and representational conditions that transform the problem of learning algebraic rules to the much simpler problem of learning to detect coincidences within a spatiotemporal pattern. Two key representational conditions are (i) the existence of nodes that encode serial position within a sequence and (ii) the use of temporal synchrony for expressing bindings between a positional role node and the item that occupies this position in a given sequence. This work suggests that even abstract algebraic rules can be grounded in concrete and basic notions such as spatial and temporal location, and coincidence.

[1]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[2]  A. Georgopoulos,et al.  Motor cortical encoding of serial order in a context-recall task. , 1999, Science.

[3]  M Negishi,et al.  Do infants learn grammar with algebra or statistics? , 1999, Science.

[4]  Raymond L. Watrous GRADSIM: A Connectionist Network Simulator Using Gradient Optimization Techniques , 1988 .

[5]  Marius Usher,et al.  Visual synchrony affects binding and segmentation in perception , 1998, Nature.

[6]  R. Christopher deCharms,et al.  Primary cortical representation of sounds by the coordination of action-potential timing , 1996, Nature.

[7]  Jerome A. Feldman,et al.  Neural Representation of Conceptual Knowledge. , 1986 .

[8]  L. Shastri,et al.  From simple associations to systematic reasoning: A connectionist representation of rules, variables and dynamic bindings using temporal synchrony , 1993, Behavioral and Brain Sciences.

[9]  W Singer,et al.  Visual feature integration and the temporal correlation hypothesis. , 1995, Annual review of neuroscience.

[10]  Lokendra Shastri,et al.  Exploiting Temporal Binding to Learn Relational Rules within a Connectionist Network , 1997 .

[11]  John E. Hummel,et al.  Distributed representations of structure: A theory of analogical access and mapping. , 1997 .

[12]  Peter M. Vishton,et al.  Rule learning by seven-month-old infants. , 1999, Science.

[13]  Christoph von der Malsburg,et al.  The Correlation Theory of Brain Function , 1994 .