Density Plots of Hidden Value Unit Activations Reveal Interpretable Bands

A particular backpropagation network, called a network of value units, was trained to detect problem type and validity of a set of logic problems. This network differs from standard networks in using a Gaussian activation function. After training was successfully completed, jittered density plots were computed for each hidden unit, and used to represent the distribution of activations produced in each hidden unit by the entire training set. The density plots revealed a marked banding. Further analysis revealed that almost all of these bands could be assigned featural interpretations, and played an important role in explaining how the network classified input patterns. These results are discussed in the context of other techniques for analyzing network structure, and in the context of other parallel distributed processing architectures.

[1]  Beat Kleiner,et al.  Graphical Methods for Data Analysis , 1983 .

[2]  M. McCloskey Networks and Theories: The Place of Connectionism in Cognitive Science , 1991 .

[3]  H. J. Eysenck The logical basis of factor analysis. , 1953 .

[4]  John Moody,et al.  Fast Learning in Networks of Locally-Tuned Processing Units , 1989, Neural Computation.

[5]  D. Robinson Movement control: Implications of neural networks for how we think about brain function , 1992 .

[6]  Michael C. Mozer,et al.  Using Relevance to Reduce Network Size Automatically , 1989 .

[7]  P. Johnson-Laird Mental models , 1989 .

[8]  Stephen José Hanson,et al.  What connectionist models learn: Learning and representation in connectionist networks , 1990, Behavioral and Brain Sciences.

[9]  Geoffrey E. Hinton,et al.  Learning distributed representations of concepts. , 1989 .

[10]  James D. Keeler,et al.  Predicting the Future: Advantages of Semilocal Units , 1991, Neural Computation.

[11]  W. Bechtel,et al.  Connectionism and the Mind , 1991 .

[12]  M. Dawson,et al.  Connectionism, Confusion and Cognitive Science , 1994 .

[13]  Merrie Bergmann,et al.  The Logic Book , 1980 .

[14]  David A. Medler,et al.  Training redundant artificial neural networks: Imposing biology on technology , 1994, Psychological research.

[15]  W. Schneider Connectionism: Is it a paradigm shift for psychology? , 1987 .

[16]  Dana H. Ballard,et al.  Cortical connections and parallel processing: Structure and function , 1986, Behavioral and Brain Sciences.

[17]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[18]  Michael R. W. Dawson,et al.  Modifying the Generalized Delta Rule to Train Networks of Non-monotonic Processors for Pattern Classification , 1992 .

[19]  John M. Chambers,et al.  Graphical Methods for Data Analysis , 1983 .