Catastrophic Forgetting in Reinforcement-Learning Environments

ii Acknowledgements iii

[1]  Robert M. French,et al.  Using Semi-Distributed Representations to Overcome Catastrophic Forgetting in Connectionist Networks , 1991 .

[2]  Mark B. Ring Continual learning in reinforcement environments , 1995, GMD-Bericht.

[3]  Fred Henrik Hamker,et al.  Life-long learning Cell Structures--continuously learning without catastrophic interference , 2001, Neural Networks.

[4]  Longxin Lin Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.

[5]  Doina Precup,et al.  Combining TD-learning with Cascade-correlation Networks , 2003, ICML.

[6]  Marcus Frean,et al.  Catastrophic forgetting in simple networks: an analysis of the pseudorehearsal solution. , 1999, Network.

[7]  Leslie Pack Kaelbling,et al.  Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.

[8]  R. French Dynamically constraining connectionist networks to produce distributed, orthogonal representations to reduce catastrophic interference , 2019, Proceedings of the Sixteenth Annual Conference of the Cognitive Science Society.

[9]  Risto Miikkulainen,et al.  Reinforcement learning in high-diameter, continuous environments , 2007 .

[10]  J. Kruschke,et al.  ALCOVE: an exemplar-based connectionist model of category learning. , 1992, Psychological review.

[11]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[12]  G. Deco,et al.  An Information-Theoretic Approach to Neural Computing , 1997, Perspectives in Neural Computing.

[13]  Leemon C. Baird,et al.  Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.

[14]  Long Ji Lin,et al.  Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.

[15]  Benjamin W. Wah,et al.  Global Optimization for Neural Network Training , 1996, Computer.

[16]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[17]  Tony R. Martinez,et al.  The general inefficiency of batch training for gradient descent learning , 2003, Neural Networks.

[18]  Anthony V. Robins,et al.  Sequential learning in neural networks: A review and a discussion of pseudorehearsal based methods , 2004, Intell. Data Anal..

[19]  R. French,et al.  Catastrophic Forgetting in Connectionist Networks: Causes, Consequences and Solutions , 1994 .

[20]  Bernard Ans Sequential Learning in Distributed Neural Networks without Catastrophic Forgetting: A Single and Realistic Self-Refreshing Memory Can Do It , 2004 .

[21]  Matthew W. Mitchell,et al.  Using Markov-k Memory for Problems with Hidden-State , 2003, MLMTA.

[22]  Sebastian Thrun,et al.  Learning One More Thing , 1994, IJCAI.

[23]  Nathan Rountree,et al.  Initialising Neural Networks with Prior Knowledge , 2006 .

[24]  Gerald Tesauro,et al.  TD-Gammon: A Self-Teaching Backgammon Program , 1995 .

[25]  Sebastian Thrun,et al.  A lifelong learning perspective for mobile robot control , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[26]  Andrew W. Moore,et al.  Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.

[27]  B. Baddeley,et al.  Reinforcement Learning in Continuous Time and Space: Interference and Not Ill Conditioning Is the Main Problem When Using Distributed Function Approximators , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[28]  Kevin Gurney,et al.  Neural networks for perceptual processing: from simulation tools to theories. , 2007, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[29]  Hongxing Li,et al.  Hardware implementation of the quadruple inverted pendulum with single motor , 2004 .

[30]  Anthony V. Robins,et al.  Catastrophic Forgetting and the Pseudorehearsal Solution in Hopfield-type Networks , 1998, Connect. Sci..

[31]  Anthony V. Robins,et al.  Learning and Generalisation in a Stable Network , 1997, ICONIP.

[32]  Alan Bundy,et al.  Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence - IJCAI-95 , 1995 .

[33]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[34]  Robert M. French,et al.  Self-refreshing memory in artificial neural networks: learning temporal sequences without catastrophic forgetting , 2004, Connect. Sci..

[35]  Anthony V. Robins,et al.  Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..

[36]  Serban C. Musca,et al.  Preventing Catastrophic Interference in Multiple-Sequence Learning Using Coupled Reverberating Elman Networks , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[37]  Noel E. Sharkey,et al.  An Analysis of Catastrophic Interference , 1995, Connect. Sci..

[38]  Pentti Kanerva,et al.  Sparse Distributed Memory , 1988 .