What connectionist models learn: Learning and representation in connectionist networks

Connectionist models provide a promising alternative to the traditional computational approach that has for several decades dominated cognitive science and artificial intelligence, although the nature of connectionist models and their relation to symbol processing remains controversial. Connectionist models can be characterized by three general computational features: distinct layers of interconnected units, recursive rules for updating the strengths of the connections during learning, and “simple” homogeneous computing elements. Using just these three features one can construct surprisingly elegant and powerful models of memory, perception, motor control, categorization, and reasoning. What makes the connectionist approach unique is not its variety of representational possibilities (including “distributed representations”) or its departure from explicit rule-based models, or even its preoccupation with the brain metaphor. Rather, it is that connectionist models can be used to explore systematically the complex interaction between learning and representation, as we try to demonstrate through the analysis of several large networks.

[1]  Geoffrey E. Hinton,et al.  Parallel Models of Associative Memory , 1989 .

[2]  H. Gardner,et al.  The Mind's New Science , 1985 .

[3]  Tim van Gelder,et al.  Compositionality: A Connectionist Variation on a Classical Theme , 1990, Cogn. Sci..

[4]  E. Rosch,et al.  Structural bases of typicality effects. , 1976 .

[5]  David C. Palmer,et al.  THE INTERPRETATION OF COMPLEX HUMAN BEHAVIOR: SOME REACTIONS TO PARALLEL DISTRIBUTED PROCESSING, EDITED BY J. L. McCLELLAND, D. E. RUMELHART, AND THE PDP RESEARCH GROUP1 , 1989 .

[6]  Robert S. Epstein Representation: A concept that fills no gaps , 1982, Behavioral and Brain Sciences.

[7]  Thomas M. Cover,et al.  Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , 1965, IEEE Trans. Electron. Comput..

[8]  A. Logue Cognitive psychology's representation of behaviorism , 1982, Behavioral and Brain Sciences.

[9]  E. Kehoe A layered network model of associative learning: learning to learn and configuration. , 1988, Psychological review.

[10]  R. Shepard,et al.  Learning and memorization of classifications. , 1961 .

[11]  G. W. Strong,et al.  A solution to the tag-assignment problem for neural networks , 1989, Behavioral and Brain Sciences.

[12]  A G Barto,et al.  Toward a modern theory of adaptive networks: expectation and prediction. , 1981, Psychological review.

[13]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[14]  G. Bower,et al.  Evaluating an adaptive network model of human learning , 1988 .

[15]  Balas K. Natarajan,et al.  On learning Boolean functions , 1987, STOC.

[16]  Eric B. Baum,et al.  A Proposal for More Powerful Learning Algorithms , 1989, Neural Computation.

[17]  Douglas B. Lenat,et al.  Why AM and EURISKO Appear to Work , 1984, Artif. Intell..

[18]  Nils J. Nilsson,et al.  Learning Machines: Foundations of Trainable Pattern-Classifying Systems , 1965 .

[19]  N. Chater,et al.  Autonomy, implementation and cognitive architecture: A reply to Fodor and Pylyshyn , 1990, Cognition.

[20]  G. Shepherd,et al.  Logic operations are properties of computer-simulated interactions between excitable dendritic spines , 1987, Neuroscience.

[21]  Terrence J. Sejnowski,et al.  Learned classification of sonar targets using a massively parallel network , 1988, IEEE Trans. Acoust. Speech Signal Process..

[22]  B. Skinner,et al.  Science and human behavior , 1953 .

[23]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[24]  P. Culicover,et al.  Neural connections, mental computation , 1988 .

[25]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[26]  J. Elman Representation and structure in connectionist models , 1991 .

[27]  D. Robinson,et al.  Parietal association cortex in the primate: sensory mechanisms and behavioral modulations. , 1978, Journal of neurophysiology.

[28]  B. Bridgeman Temporal response characteristics of cells in monkey striate cortex measured with metacontrast masking and brightness discrimination , 1980, Brain Research.

[29]  Paul W. Cooper,et al.  The Hypersphere in Pattern Recognition , 1962, Inf. Control..

[30]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[31]  James S. Albus,et al.  New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC)1 , 1975 .

[32]  John R. Anderson The Architecture of Cognition , 1983 .

[33]  Robert O. Winder,et al.  Threshold logic , 1971, IEEE Spectrum.

[34]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[35]  V. Braitenberg Vehicles, Experiments in Synthetic Psychology , 1984 .

[36]  T. Zentall,et al.  Memory codes in pigeon short-term memory: Effects of varying the number of sample and comparison stimuli☆ , 1987 .

[37]  V. Mountcastle,et al.  Parietal lobe mechanisms for directed visual attention. , 1977, Journal of neurophysiology.

[38]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[39]  L. Nadel Some thoughts on the proper foundations for the study of cognition in animals , 1982, Behavioral and Brain Sciences.

[40]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[41]  Michael I. Jordan,et al.  Task Decomposition Through Competition in a Modular Connectionist Architecture: The What and Where Vision Tasks , 1990, Cogn. Sci..

[42]  James L. McClelland,et al.  James L. McClelland, David Rumelhart and the PDP Research Group, Parallel distributed processing: explorations in the microstructure of cognition . Vol. 1. Foundations . Vol. 2. Psychological and biological models . Cambridge MA: M.I.T. Press, 1987. , 1989, Journal of Child Language.

[43]  Pat Langley,et al.  Representational Issues in Learning Systems , 1983, Computer.

[44]  James L. McClelland,et al.  An interactive activation model of context effects in letter perception: I. An account of basic findings. , 1981 .

[45]  Jerome A. Feldman,et al.  Connectionist Models and Their Applications: Introduction , 1985 .

[46]  Douglas R. Hofstadter,et al.  Common sense and conceptual halos , 1988, Behavioral and Brain Sciences.

[47]  P. Langley,et al.  Production system models of learning and development , 1987 .

[48]  W. A. Phillips On the distinction between sensory storage and short-term visual memory , 1974 .

[49]  G. Hartmann,et al.  Parallel Processing in Neural Systems and Computers , 1990 .

[50]  W. Estes,et al.  Base-rate effects in category learning: a comparison of parallel network and memory storage-retrieval models. , 1989, Journal of experimental psychology. Learning, memory, and cognition.

[51]  Zenon W. Pylyshyn,et al.  What the Mind’s Eye Tells the Mind’s Brain: A Critique of Mental Imagery , 1973 .

[52]  E. Kehoe,et al.  Temporal primacy overrides prior training in serial compound conditioning of the rabbit’s nictitating membrane response , 1987 .

[53]  Brian Everitt,et al.  Cluster analysis , 1974 .

[54]  A. Tversky Features of Similarity , 1977 .

[55]  E. Hilgard,et al.  Theories of Learning , 1981 .

[56]  R. Shepard,et al.  Toward a universal law of generalization for psychological science. , 1987, Science.

[57]  Karl H. Pribram,et al.  The Languages of the Brain , 2002 .

[58]  James L. McClelland,et al.  Distributed memory and the representation of general and specific information. , 1985, Journal of experimental psychology. General.

[59]  J. Nadal,et al.  Learning in feedforward layered networks: the tiling algorithm , 1989 .

[60]  J. Hyvärinen,et al.  Function of the parietal associative area 7 as revealed from cellular discharges in alert monkeys. , 1974, Brain : a journal of neurology.

[61]  S. M. Carroll,et al.  Construction of neural nets using the radon transform , 1989, International 1989 Joint Conference on Neural Networks.

[62]  V. Mountcastle,et al.  Visual input to the visuomotor mechanisms of the monkey's parietal lobe. , 1977, Science.

[63]  G. Edelman Neural Darwinism: The Theory Of Neuronal Group Selection , 1989 .

[64]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[65]  Stephen A. Ritz,et al.  Distinctive features, categorical perception, and probability learning: some applications of a neural model , 1977 .

[66]  D. Rumelhart,et al.  Philosophy and Connectionist Theory , 1991 .

[67]  S. Pinker,et al.  On language and connectionism: Analysis of a parallel distributed processing model of language acquisition , 1988, Cognition.

[68]  D. Lightfoot The child's trigger experience: Degree-0 learnability , 1989, Behavioral and Brain Sciences.

[69]  T. Bever,et al.  The relation between linguistic structure and associative theories of language learning—A constructive critique of some connectionist learning models , 1988, Cognition.

[70]  Robert B. Allen,et al.  Connectionist Language Users , 1990 .

[71]  G. J. Tomko,et al.  Neuronal variability: non-stationary responses to identical visual stimuli. , 1974, Brain research.

[72]  W. Köhler The Mentality of Apes. , 2018, Nature.

[73]  David J. Burr,et al.  Experiments on neural net recognition of spoken and written text , 1988, IEEE Trans. Acoust. Speech Signal Process..

[74]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[75]  Christoph von der Malsburg,et al.  Pattern recognition by labeled graph matching , 1988, Neural Networks.

[76]  Gèunther Palm,et al.  Neural Assemblies: An Alternative Approach to Artificial Intelligence , 1982 .

[77]  R. Andersen,et al.  Callosal and prefrontal associational projecting cell populations in area 7A of the macaque monkey: A study using retrogradely transported fluorescent dyes , 1985, The Journal of comparative neurology.

[78]  V. Braitenberg Reading the structure of brains , 1990 .

[79]  Pat Langley,et al.  A computational theory of motor learning , 1987, Comput. Intell..

[80]  Teuvo Kohonen,et al.  Associative memory. A system-theoretical approach , 1977 .

[81]  Noam Chomsky,et al.  A Review of B. F. Skinner's Verbal Behavior , 1980 .

[82]  Idan Segev,et al.  Methods in Neuronal Modeling , 1988 .

[83]  Kehoe Ej Connectionist models of conditioning: A tutorial. , 1989 .

[84]  James L. McClelland,et al.  Parallel Distributed Processing: Explorations in the Microstructure of Cognition : Psychological and Biological Models , 1986 .

[85]  W. Levelt Formal grammars in linguistics and psycholinguistics : Vol.III, Psycholinguistic applications , 1974 .

[86]  Richard A. Andersen,et al.  Value, variable, and coarse coding by posterior parietal neurons , 1986, Behavioral and Brain Sciences.

[87]  Terrence J. Sejnowski,et al.  NETtalk: a parallel network that learns to read aloud , 1988 .

[88]  Allen Newell,et al.  Computer science as empirical inquiry: symbols and search , 1976, CACM.

[89]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[90]  D. Willshaw,et al.  Theories of associative recall , 1970, Quarterly Reviews of Biophysics.

[91]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[92]  Terrence J. Sejnowski,et al.  Network model of shape-from-shading: neural function arises from both receptive and projective fields , 1988, Nature.

[93]  S. Pinker,et al.  Connections and symbols , 1988 .

[94]  Stierlin Organization of Behavior. A Neuropsychological Theory , 1953 .

[95]  James A. Hendler,et al.  Marker‐passing over Microfeatures: Towards a Hybrid Symbolic/Connectionist Model , 1989 .

[96]  Noam Chomsky Review of B.F. Skinner, Verbal Behavior , 1959 .

[97]  Roger N. Shepard,et al.  How fully should connectionism be activated? Two sources of excitation and one of inhibition , 1988, Behavioral and Brain Sciences.

[98]  Barak A. Pearlmutter Learning state space trajectories in recurrent neural networks : a preliminary report. , 1988 .

[99]  Donald F. Specht,et al.  Generation of Polynomial Discriminant Functions for Pattern Recognition , 1967, IEEE Trans. Electron. Comput..

[100]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[101]  D. Massaro Some criticisms of connectionist models of human performance , 1988 .

[102]  R. Shepard,et al.  A nonmetric variety of linear factor analysis , 1974 .

[103]  Dean Allemang,et al.  Information processing abstractions: The message still counts more than the medium , 1988, Behavioral and Brain Sciences.

[104]  David S. Touretzky,et al.  A Computational Basis for Phonology , 1989, NIPS.

[105]  Paul E. Utgoff,et al.  Machine Learning of Inductive Bias , 1986 .

[106]  John R. Searle,et al.  Minds, brains, and programs , 1980, Behavioral and Brain Sciences.

[107]  L. N. Kanal,et al.  Uncertainty in Artificial Intelligence 5 , 1990 .

[108]  Jerome A. Feldman,et al.  Connectionist Models and Their Properties , 1982, Cogn. Sci..

[109]  Allen Newell,et al.  Towards Chunking as a General Learning Mechanism , 1984, AAAI.

[110]  Ronald J. Williams,et al.  A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[111]  David E. Rumelhart,et al.  Product Units: A Computationally Powerful and Biologically Plausible Extension to Backpropagation Networks , 1989, Neural Computation.

[112]  James L. McClelland,et al.  A distributed, developmental model of word recognition and naming. , 1989, Psychological review.

[113]  J. Stephen Judd,et al.  On the complexity of loading shallow neural networks , 1988, J. Complex..

[114]  R. Pfeifer,et al.  Connectionism in Perspective , 1989 .

[115]  M. Sidman,et al.  Conditional discrimination vs. matching to sample: an expansion of the testing paradigm. , 1982, Journal of the experimental analysis of behavior.

[116]  Pineda,et al.  Generalization of back-propagation to recurrent neural networks. , 1987, Physical review letters.

[117]  Yoshiro Miyata,et al.  The learning and planning of actions , 1988 .

[118]  R Ratcliff,et al.  Connectionist models of recognition memory: constraints imposed by learning and forgetting functions. , 1990, Psychological review.

[119]  J. Gibson The Senses Considered As Perceptual Systems , 1967 .

[120]  B. Bridgeman,et al.  The physiology of attention: participation of cat striate cortex in behavioral choice , 1989, Psychological research.

[121]  H. Roitblat The meaning of representation in animal memory , 1982, Behavioral and Brain Sciences.