Adversarial Reinforcement Learning
暂无分享,去创建一个
[1] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..
[2] F. A. Seiler,et al. Numerical Recipes in C: The Art of Scientific Computing , 1989 .
[3] C. Atkeson,et al. Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time , 1993, Machine Learning.
[4] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.
[5] Sebastian Thrun,et al. Learning to Play the Game of Chess , 1994, NIPS.
[6] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[7] Gerald Tesauro,et al. Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..
[8] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.
[9] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[10] Andrew W. Moore,et al. Learning Evaluation Functions for Large Acyclic Domains , 1996, ICML.
[11] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .
[12] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[13] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[14] J. Ross Quinlan,et al. Induction of Decision Trees , 1986, Machine Learning.