Catastrophic Forgetting in Reinforcement-Learning Environments
暂无分享,去创建一个
[1] Robert M. French,et al. Using Semi-Distributed Representations to Overcome Catastrophic Forgetting in Connectionist Networks , 1991 .
[2] Mark B. Ring. Continual learning in reinforcement environments , 1995, GMD-Bericht.
[3] Fred Henrik Hamker,et al. Life-long learning Cell Structures--continuously learning without catastrophic interference , 2001, Neural Networks.
[4] Longxin Lin. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.
[5] Doina Precup,et al. Combining TD-learning with Cascade-correlation Networks , 2003, ICML.
[6] Marcus Frean,et al. Catastrophic forgetting in simple networks: an analysis of the pseudorehearsal solution. , 1999, Network.
[7] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.
[8] R. French. Dynamically constraining connectionist networks to produce distributed, orthogonal representations to reduce catastrophic interference , 2019, Proceedings of the Sixteenth Annual Conference of the Cognitive Science Society.
[9] Risto Miikkulainen,et al. Reinforcement learning in high-diameter, continuous environments , 2007 .
[10] J. Kruschke,et al. ALCOVE: an exemplar-based connectionist model of category learning. , 1992, Psychological review.
[11] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[12] G. Deco,et al. An Information-Theoretic Approach to Neural Computing , 1997, Perspectives in Neural Computing.
[13] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.
[14] Long Ji Lin,et al. Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.
[15] Benjamin W. Wah,et al. Global Optimization for Neural Network Training , 1996, Computer.
[16] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[17] Tony R. Martinez,et al. The general inefficiency of batch training for gradient descent learning , 2003, Neural Networks.
[18] Anthony V. Robins,et al. Sequential learning in neural networks: A review and a discussion of pseudorehearsal based methods , 2004, Intell. Data Anal..
[19] R. French,et al. Catastrophic Forgetting in Connectionist Networks: Causes, Consequences and Solutions , 1994 .
[20] Bernard Ans. Sequential Learning in Distributed Neural Networks without Catastrophic Forgetting: A Single and Realistic Self-Refreshing Memory Can Do It , 2004 .
[21] Matthew W. Mitchell,et al. Using Markov-k Memory for Problems with Hidden-State , 2003, MLMTA.
[22] Sebastian Thrun,et al. Learning One More Thing , 1994, IJCAI.
[23] Nathan Rountree,et al. Initialising Neural Networks with Prior Knowledge , 2006 .
[24] Gerald Tesauro,et al. TD-Gammon: A Self-Teaching Backgammon Program , 1995 .
[25] Sebastian Thrun,et al. A lifelong learning perspective for mobile robot control , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).
[26] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.
[27] B. Baddeley,et al. Reinforcement Learning in Continuous Time and Space: Interference and Not Ill Conditioning Is the Main Problem When Using Distributed Function Approximators , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).
[28] Kevin Gurney,et al. Neural networks for perceptual processing: from simulation tools to theories. , 2007, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.
[29] Hongxing Li,et al. Hardware implementation of the quadruple inverted pendulum with single motor , 2004 .
[30] Anthony V. Robins,et al. Catastrophic Forgetting and the Pseudorehearsal Solution in Hopfield-type Networks , 1998, Connect. Sci..
[31] Anthony V. Robins,et al. Learning and Generalisation in a Stable Network , 1997, ICONIP.
[32] Alan Bundy,et al. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence - IJCAI-95 , 1995 .
[33] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .
[34] Robert M. French,et al. Self-refreshing memory in artificial neural networks: learning temporal sequences without catastrophic forgetting , 2004, Connect. Sci..
[35] Anthony V. Robins,et al. Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..
[36] Serban C. Musca,et al. Preventing Catastrophic Interference in Multiple-Sequence Learning Using Coupled Reverberating Elman Networks , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.
[37] Noel E. Sharkey,et al. An Analysis of Catastrophic Interference , 1995, Connect. Sci..
[38] Pentti Kanerva,et al. Sparse Distributed Memory , 1988 .