论文信息 - Kalman filters improve LSTM network performance in problems unsolvable by traditional recurrent nets - 字舞流文

Kalman filters improve LSTM network performance in problems unsolvable by traditional recurrent nets

Jürgen Schmidhuber | Douglas Eck | Felix A. Gers | Juan Antonio Pérez-Ortiz | J. Schmidhuber | F. Gers | D. Eck | J. A. Pérez-Ortiz | Felix Alexander Gers

[1] G. Miller. Learning to Forget , 2004, Science.

[2] Jürgen Schmidhuber,et al. LSTM recurrent networks learn simple context-free and context-sensitive languages , 2001, IEEE Trans. Neural Networks.

[3] S. Haykin. Kalman Filtering and Neural Networks , 2001 .

[4] Sepp Hochreiter,et al. Learning to Learn Using Gradient Descent , 2001, ICANN.

[5] Bram Bakker,et al. Reinforcement Learning with Long Short-Term Memory , 2001, NIPS.

[6] Yoshua Bengio,et al. Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .

[7] Janet Wiles,et al. Context-free and context-sensitive dynamics in recurrent neural networks , 2000, Connect. Sci..

[8] Jürgen Schmidhuber,et al. Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[9] Stephan K. Chalup,et al. Hill climbing in recurrent neural networks for learning the a/sup n/b/sup n/c/sup n/ language , 1999, ICONIP'99. ANZIIS'99 & ANNES'99 & ACNN'99. 6th International Conference on Neural Information Processing. Proceedings (Cat. No.99EX378).

[10] Paul Rodríguez,et al. A Recurrent Neural Network that Learns to Count , 1999, Connect. Sci..

[11] Simon Haykin,et al. Neural Networks: A Comprehensive Foundation , 1998 .

[12] Janet Wiles,et al. Recurrent Neural Networks Can Learn to Implement Symbol-Sensitive Counting , 1997, NIPS.

[13] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[14] Barak A. Pearlmutter. Gradient calculations for dynamic recurrent neural networks: a survey , 1995, IEEE Trans. Neural Networks.

[15] R. Shillcock,et al. Proceedings of the Seventeenth Annual Conference of the Cognitive Science Society , 1995 .

[16] Ronald J. Williams,et al. Gradient-based learning algorithms for recurrent networks and their computational complexity , 1995 .

[17] Janet Wiles,et al. Learning to count without a counter: A case study of dynamics and activation landscapes in recurrent networks , 1995 .

[18] G. V. Puskorius,et al. Training controllers for robustness: multi-stream DEKF , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[19] Lee A. Feldkamp,et al. Neurocontrol of nonlinear dynamical systems with Kalman filter trained recurrent networks , 1994, IEEE Trans. Neural Networks.

[20] Jürgen Schmidhuber,et al. A Fixed Size Storage O(n3) Time Complexity Learning Algorithm for Fully Recurrent Continually Running Networks , 1992, Neural Computation.

[21] Frank Fallside,et al. A recurrent error propagation network speech recognition system , 1991 .

[22] Jing Peng,et al. An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.

[23] Jeffrey L. Elman,et al. Finding Structure in Time , 1990, Cogn. Sci..

[24] David Zipser,et al. Learning Sequential Structure with the Real-Time Recurrent Learning Algorithm , 1991, Int. J. Neural Syst..

[25] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[26] Jürgen Schmidhuber,et al. A Local Learning Algorithm for Dynamic Feedforward and Recurrent Networks , 1989 .

[27] Geoffrey E. Hinton,et al. Experiments on Learning by Back Propagation. , 1986 .