论文信息 - Constructive learning of recurrent neural networks: limitations of recurrent cascade correlation and a simple solution

Constructive learning of recurrent neural networks: limitations of recurrent cascade correlation and a simple solution

It is often difficult to predict the optimal neural network size for a particular application. Constructive or destructive methods that add or subtract neurons, layers, connections, etc. might offer a solution to this problem. We prove that one method, recurrent cascade correlation, due to its topology, has fundamental limitations in representation and thus in its learning capabilities. It cannot represent with monotone (i.e., sigmoid) and hard-threshold activation functions certain finite state automata. We give a "preliminary" approach on how to get around these limitations by devising a simple constructive training method that adds neurons during training while still preserving the powerful fully-recurrent structure. We illustrate this approach by simulations which learn many examples of regular grammars that the recurrent cascade correlation method is unable to learn.

[1] C. L. Giles,et al. Inserting rules into recurrent neural networks , 1992, Neural Networks for Signal Processing II Proceedings of the 1992 IEEE Workshop.

[2] Marvin Minsky,et al. Computation : finite and infinite machines , 2016 .

[3] Anders Krogh,et al. Introduction to the theory of neural computation , 1994, The advanced book program.

[4] Michael C. Mozer,et al. Using Relevance to Reduce Network Size Automatically , 1989 .

[5] Srimat T. Chakradhar,et al. First-order versus second-order single-layer recurrent neural networks , 1994, IEEE Trans. Neural Networks.

[6] Marcus Frean,et al. The Upstart Algorithm: A Method for Constructing and Training Feedforward Neural Networks , 1990, Neural Computation.

[7] S C Kleene,et al. Representation of Events in Nerve Nets and Finite Automata , 1951 .

[8] E. A. Jackson,et al. Perspectives of nonlinear dynamics , 1990 .

[9] C. Lee Giles,et al. Extraction, Insertion and Refinement of Symbolic Rules in Dynamically Driven Recurrent Neural Networks , 1993 .

[10] Raymond L. Watrous,et al. Induction of Finite-State Languages Using Second-Order Recurrent Networks , 1992, Neural Computation.

[11] C. Lee Giles,et al. Extracting and Learning an Unknown Grammar with Recurrent Neural Networks , 1991, NIPS.

[12] Scott E. Fahlman,et al. The Recurrent Cascade-Correlation Architecture , 1990, NIPS.

[13] J. Nadal,et al. Learning in feedforward layered networks: the tiling algorithm , 1989 .