Opening the Black Box: the Relationship between Neural Networks and Linear Discriminant Functions

Over the last ten years feed‐forward neural networks have become a popular tool for statistical decision making. During this time, they have been applied in many fields, including cytological classification. Neural networks are often treated as a black box, whose inner workings are concealed from the researcher. This is unfortunate, since the inner workings of a neural network can be understood in a manner similar to that of a linear discriminant function, which is the standard tool that researchers use for decision making. This paper discusses feed‐forward neural networks and some methods to improve their performance for classification problems. Their relationship to discriminant functions will be examined for a simple two‐dimensional classification problem.

[1]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[2]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[3]  David Haussler,et al.  What Size Net Gives Valid Generalization? , 1989, Neural Computation.

[4]  David B. Fogel An information criterion for optimal neural network selection , 1991, IEEE Trans. Neural Networks.

[5]  G. Kane Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .

[6]  Raymond L. Watrous Learning Algorithms for Connectionist Networks: Applied Gradient Methods of Nonlinear Optimization , 1988 .

[7]  Brian D. Ripley,et al.  Statistical aspects of neural networks , 1993 .

[8]  Robert A. Jacobs,et al.  Increased rates of convergence through learning rate adaptation , 1987, Neural Networks.

[9]  William H. Press,et al.  Numerical recipes , 1990 .

[10]  P. Wilding,et al.  The application of backpropagation neural networks to problems in pathology and laboratory medicine. , 1992, Archives of pathology & laboratory medicine.

[11]  Alberto L. Sangiovanni-Vincentelli,et al.  Efficient Parallel Learning Algorithms for Neural Networks , 1988, NIPS.

[12]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[13]  H. Akaike A new look at the statistical model identification , 1974 .

[14]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .