论文信息 - Structural adaptation of parsimonious higher-order neural classifiers

Structural adaptation of parsimonious higher-order neural classifiers

Abstract We exploit the potential of parsimonious higher-order neural classifiers for reduction of hardware expenses, speedup in learning, and robust generalization. Specifically, our neuron model allows for computation of input products of potentially unlimited order. Structural adaptation of the topology is achieved by two alternative algorithms, that ultimately allocate resources for the relevant nonlinear interactions only. At the same time, the problem of combinatorial explosion of higher-order terms is kept in check. The first algorithm, being a deterministic pruning variant, starts with the ultimate higher-order neuron, and performs an iterated process of weight elimination. The second algorithm, implementing a stochastic search, explores the space of sparse topologies. It starts with a randomly allocated set of higher-order terms, and modifies resource allocation, while keeping the size of the architecture fixed. Two challenging classification benchmarks were chosen to demonstrate the excellent performance of the presented approach: first, the two-spirals separation problem, and second the left-/right-shift classification problem for binary strings. Our simulation results show that the proposed model may be a powerful tool for a variety of hard classification problems.

Rolf Eckmiller | Gerald Fahner

[1] David E. Rumelhart,et al. Product Units: A Computationally Powerful and Biologically Plausible Extension to Backpropagation Networks , 1989, Neural Computation.

[2] Anthony N. Burkitt,et al. Refined Pruning Techniques for Feed-forward Neural Networks , 1992, Complex Syst..

[3] Tal Grossman,et al. The CHIR Algorithm for Feed Forward Networks with Binary Weights , 1989, NIPS.

[4] Gerald Fahner,et al. A higher order unit that performs arbitrary Boolean functions , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[5] Richard Rohwer,et al. The "Moving Targets" Training Algorithm , 1989, NIPS.

[6] P. Werbos,et al. Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[7] Eric B. Baum,et al. Constructing Hidden Units Using Examples and Queries , 1990, NIPS.

[8] M. Alexander,et al. Principles of Neural Science , 1981 .

[9] Scott E. Fahlman,et al. An empirical study of learning speed in back-propagation networks , 1988 .

[10] Anders Krogh,et al. A Cost Function for Internal Representations , 1989, NIPS.

[11] Vladimir Vapnik,et al. Principles of Risk Minimization for Learning Theory , 1991, NIPS.