论文信息 - Improvement of Learning in Recurrent Networks by Substituting the Sigmoid Activation Function

Improvement of Learning in Recurrent Networks by Substituting the Sigmoid Activation Function

Several recurrent network architectures have been devised in recent years to deal with sequential tasks. One such model is the Simple Recurrent Network (SRN) proposed by Elman (Elman, 1988). The backpropagation rule was employed for learning in the former published works with SRNs, e.g. (Cleeremans et al., 1989). Later on, full gradient learning schemes, such as RTRL and BPTT, have been proposed for learning in fully-connected recurrent networks. These algorithms can also be used to train the weights of the recurrent hidden layer in SRNs.

René Alquézar | J. M. Sopena

[1] Jürgen Schmidhuber,et al. A Fixed Size Storage O(n3) Time Complexity Learning Algorithm for Fully Recurrent Continually Running Networks , 1992, Neural Computation.

[2] Scott E. Fahlman,et al. The Recurrent Cascade-Correlation Architecture , 1990, NIPS.

[3] James L. McClelland,et al. Finite State Automata and Simple Recurrent Networks , 1989, Neural Computation.

[4] David Zipser,et al. Learning Sequential Structure with the Real-Time Recurrent Learning Algorithm , 1991, Int. J. Neural Syst..

[5] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..