A Modular Neural Network Model of Concept Acquisition

Previous neural network models of concept learning were mainly implemented with supervised learning schemes. However, studies of human conceptual memory have shown that concepts may be learned without a teacher who provides the category name to associate with exemplars. A modular neural network architecture that realizes concept acquisition through two functionally distinct operations, categorizing and naming, is proposed as an alternative. An unsupervised algorithm realizes the categorizing module by constructing representations of categories compatible with prototype theory. The naming module associates category names to the output of the categorizing module in a supervised mode. In such a modular architecture, the interface between the modules can be conceived of as an “information relay” that encodes, constrains, and propagates important information. Five experiments were conducted to analyze the relationships among internal conceptual codes and simple conceptual and lexical development. The first two experiments show a prototype effect and illustrate some basic characteristics of the system. The third experiment presents a bottom-up model of the narrowing down of children's early lexical categories that honors mutual exclusivity. The fourth experiment introduces top-down constraints on conceptual coding. The fifth experiment exhibits how hierarchical relationships between concepts are learned by the architecture, and also demonstrates how a spectrum of conceptual expertise may gradually emerge as a consequence of experiencing more with certain categories than with others.

[1]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[2]  David E. Rumelhart,et al.  The architecture of mind: a connectionist approach , 1989 .

[3]  Gregory L. Murphy Meaning and concepts , 1991 .

[4]  Maureen A. Callanan,et al.  How Parents Label Objects for Young Children: The Role of Input in the Acquisition of Category Hierarchies. , 1985 .

[5]  Stephen A. Ritz,et al.  Distinctive features, categorical perception, and probability learning: some applications of a neural model , 1977 .

[6]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[7]  E. R. Siqueland,et al.  The nature and structure of infant form categories , 1983 .

[8]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[9]  Linda B. Smith,et al.  The importance of shape in early lexical learning , 1988 .

[10]  Garrison W. Cottrell,et al.  Image compression by back-propagation: An example of extensional programming , 1988 .

[11]  J.A. Anderson,et al.  Theory of categorization based on distributed memory storage. , 1984 .

[12]  Stephen Grossberg,et al.  Competitive Learning: From Interactive Activation to Adaptive Resonance , 1987, Cogn. Sci..

[13]  Terrence J. Sejnowski,et al.  Analysis of hidden units in a layered network trained to classify sonar targets , 1988, Neural Networks.

[14]  T. Au,et al.  Children's use of information in word learning , 1990, Journal of Child Language.

[15]  Ellen M. Markman,et al.  Principles of Organization in Young Children's Natural Language Hierarchies. , 1982 .

[16]  D. H. Wheeler,et al.  The early growth of logic in the child : classification and seriation , 1965 .

[17]  Teuvo Kohonen,et al.  The 'neural' phonetic typewriter , 1988, Computer.

[18]  R Bäuerle [Vibrotactile information transfer using sequences of binary signals]. , 1974, Kybernetik.

[19]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[20]  Paul C. Quinn,et al.  On Categorization in Early Infancy. , 1986 .

[21]  S. Grossberg How does the brain build a cognitive code , 1988 .

[22]  Geoffrey E. Hinton Connectionist Learning Procedures , 1989, Artif. Intell..

[23]  James L. McClelland,et al.  A distributed, developmental model of word recognition and naming. , 1989, Psychological review.

[24]  W. Freeman Second Commentary: On the proper treatment of connectionism by Paul Smolensky (1988) - Neuromachismo Rekindled , 1989 .

[25]  J. Fodor The Modularity of mind. An essay on faculty psychology , 1986 .

[26]  D. Bullock,et al.  Apprenticeship in Word Use: Social Convergence Processes in Learning Categorically Related Nouns , 1986 .

[27]  C. Mervis,et al.  On the existence of competence errors in early comprehension: a reply to Fremgen & Fay and Chapman & Thomson , 1983, Journal of Child Language.

[28]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[29]  G. Bower,et al.  From conditioning to category learning: an adaptive network model. , 1988 .

[30]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[31]  M. Posner,et al.  Retention of Abstract Ideas. , 1970 .

[32]  Kurt Hornik,et al.  Neural networks and principal component analysis: Learning from examples without local minima , 1989, Neural Networks.

[33]  Dare A. Baldwin,et al.  Priorities in children's expectations about object label reference: form over color. , 1989, Child development.

[34]  E. Rosch,et al.  Family resemblances: Studies in the internal structure of categories , 1975, Cognitive Psychology.

[35]  Pamela Blewitt,et al.  Dog versus collie: Vocabulary in speech to young children. , 1983 .

[36]  L. Barsalou,et al.  Ad hoc categories , 1983, Memory & cognition.

[37]  J. Fodor,et al.  The structure of a semantic theory , 1963 .

[38]  David Zipser,et al.  Feature Discovery by Competive Learning , 1986, Cogn. Sci..

[39]  M. Posner,et al.  On the genesis of abstract ideas. , 1968, Journal of experimental psychology.

[40]  Peter E. Hart,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[41]  S. Gelman,et al.  Incorporating new words into the lexicon: preliminary evidence for language hierarchies in two-year-old children. , 1989, Child development.

[42]  E. Markman Perspectives on language and thought: The whole-object, taxonomic, and mutual exclusivity assumptions as initial constraints on word meanings , 1991 .

[43]  D. Homa Prototype abstraction and classification of new instances as a function of number of instances defining the prototype , 1973 .

[44]  James L. McClelland,et al.  Distributed memory and the representation of general and specific information. , 1985, Journal of experimental psychology. General.

[45]  Ellen M. Markman,et al.  Classes and collections: Principles of organization in the learning of hierarchical relations , 1980, Cognition.

[46]  A. H. Kawamoto Distributed Representations of Ambiguous Words and Their Resolution in a Connectionist Network , 1988 .

[47]  T. Au,et al.  The principle of mutual exclusivity in word learning: to honor or not to honor? , 1990, Child development.

[48]  Paul J. Feltovich,et al.  Categorization and Representation of Physics Problems by Experts and Novices , 1981, Cogn. Sci..

[49]  Robin S. Chapman,et al.  Who is ‘Daddy’ revisited: the status of two-year-olds' over-extended words in use and comprehension , 1977, Journal of Child Language.

[50]  J J Hopfield,et al.  Neurons with graded response have collective computational properties like those of two-state neurons. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Ronald L. Rivest,et al.  Training a 3-node neural network is NP-complete , 1988, COLT '88.

[52]  R. Golden The :20Brain-state-in-a-box Neural model is a gradient descent algorithm , 1986 .

[53]  J. Gruendel,et al.  Referential Extension in Early Language Development. , 1977 .

[54]  S. Grossberg How does a brain build a cognitive code , 1980 .

[55]  P. Smolensky On the proper treatment of connectionism , 1988, Behavioral and Brain Sciences.

[56]  D. Hofstadter,et al.  Godel, Escher, Bach: An Eternal Golden Braid , 1979 .

[57]  Gregory L. Murphy,et al.  Psychological concepts in a parallel system , 1986 .

[58]  L. Barsalou Context-independent and context-dependent information in concepts , 1982, Memory & cognition.

[59]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[60]  Edward E. Smith,et al.  Categories and concepts , 1984 .

[61]  Douglas L. Medin,et al.  Context theory of classification learning. , 1978 .

[62]  J. Tanaka,et al.  Object categories and expertise: Is the basic level in the eye of the beholder? , 1991, Cognitive Psychology.

[63]  D. Medin,et al.  The role of theories in conceptual coherence. , 1985, Psychological review.

[64]  E. Markman,et al.  Children's sensitivity to constraints on word meaning: Taxonomic versus thematic relations , 1984, Cognitive Psychology.

[65]  Maureen A. Callanan,et al.  Development of Object Categories and Inclusion Relations: Preschoolers' Hypotheses about Word Meanings. , 1989 .

[66]  Ellen M. Markman,et al.  Categorization and Naming in Children: Problems of Induction , 1989 .

[67]  Edward E. Smith,et al.  Basic-level superiority in picture categorization , 1982 .

[68]  L. Rescorla,et al.  Overextension in early language development , 1980, Journal of Child Language.

[69]  Eve V. Clark,et al.  WHAT'S IN A WORD? ON THE CHILD'S ACQUISITION OF SEMANTICS IN HIS FIRST LANGUAGE , 1973 .

[70]  C. Mervis,et al.  The effect of feedback on young children's inappropriate word usage , 1986, Journal of Child Language.

[71]  J. A. Anderson,et al.  A neural network model of multistable perception. , 1985, Acta psychologica.

[72]  Daniel F. Chambliss,et al.  The relative contributions of common and distinctive information on the abstraction from ill-defined categories , 1975 .