论文信息 - Catastrophic Forgetting in Reinforcement-Learning Environments - 字舞流文

Catastrophic Forgetting in Reinforcement-Learning Environments

ii Acknowledgements iii

[1] Robert M. French,et al. Using Semi-Distributed Representations to Overcome Catastrophic Forgetting in Connectionist Networks , 1991 .

[2] Mark B. Ring. Continual learning in reinforcement environments , 1995, GMD-Bericht.

[3] Fred Henrik Hamker,et al. Life-long learning Cell Structures--continuously learning without catastrophic interference , 2001, Neural Networks.

[4] Longxin Lin. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.

[5] Doina Precup,et al. Combining TD-learning with Cascade-correlation Networks , 2003, ICML.

[6] Marcus Frean,et al. Catastrophic forgetting in simple networks: an analysis of the pseudorehearsal solution. , 1999, Network.

[7] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.

[8] R. French. Dynamically constraining connectionist networks to produce distributed, orthogonal representations to reduce catastrophic interference , 2019, Proceedings of the Sixteenth Annual Conference of the Cognitive Science Society.

[9] Risto Miikkulainen,et al. Reinforcement learning in high-diameter, continuous environments , 2007 .

[10] J. Kruschke,et al. ALCOVE: an exemplar-based connectionist model of category learning. , 1992, Psychological review.

[11] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[12] G. Deco,et al. An Information-Theoretic Approach to Neural Computing , 1997, Perspectives in Neural Computing.

[13] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.

[14] Long Ji Lin,et al. Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.

[15] Benjamin W. Wah,et al. Global Optimization for Neural Network Training , 1996, Computer.

[16] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[17] Tony R. Martinez,et al. The general inefficiency of batch training for gradient descent learning , 2003, Neural Networks.

[18] Anthony V. Robins,et al. Sequential learning in neural networks: A review and a discussion of pseudorehearsal based methods , 2004, Intell. Data Anal..

[19] R. French,et al. Catastrophic Forgetting in Connectionist Networks: Causes, Consequences and Solutions , 1994 .

[20] Bernard Ans. Sequential Learning in Distributed Neural Networks without Catastrophic Forgetting: A Single and Realistic Self-Refreshing Memory Can Do It , 2004 .

[21] Matthew W. Mitchell,et al. Using Markov-k Memory for Problems with Hidden-State , 2003, MLMTA.

[22] Sebastian Thrun,et al. Learning One More Thing , 1994, IJCAI.

[23] Nathan Rountree,et al. Initialising Neural Networks with Prior Knowledge , 2006 .

[24] Gerald Tesauro,et al. TD-Gammon: A Self-Teaching Backgammon Program , 1995 .

[25] Sebastian Thrun,et al. A lifelong learning perspective for mobile robot control , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[26] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.

[27] B. Baddeley,et al. Reinforcement Learning in Continuous Time and Space: Interference and Not Ill Conditioning Is the Main Problem When Using Distributed Function Approximators , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[28] Kevin Gurney,et al. Neural networks for perceptual processing: from simulation tools to theories. , 2007, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[29] Hongxing Li,et al. Hardware implementation of the quadruple inverted pendulum with single motor , 2004 .

[30] Anthony V. Robins,et al. Catastrophic Forgetting and the Pseudorehearsal Solution in Hopfield-type Networks , 1998, Connect. Sci..

[31] Anthony V. Robins,et al. Learning and Generalisation in a Stable Network , 1997, ICONIP.

[32] Alan Bundy,et al. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence - IJCAI-95 , 1995 .

[33] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[34] Robert M. French,et al. Self-refreshing memory in artificial neural networks: learning temporal sequences without catastrophic forgetting , 2004, Connect. Sci..

[35] Anthony V. Robins,et al. Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..

[36] Serban C. Musca,et al. Preventing Catastrophic Interference in Multiple-Sequence Learning Using Coupled Reverberating Elman Networks , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[37] Noel E. Sharkey,et al. An Analysis of Catastrophic Interference , 1995, Connect. Sci..

[38] Pentti Kanerva,et al. Sparse Distributed Memory , 1988 .