Parallel reinforcement learning using multiple reward signals
暂无分享,去创建一个
[1] Justin A. Boyan,et al. Modular Neural Networks for Learning Context-Dependent Game Strategies , 2007 .
[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[3] Reuven Y. Rubinstein,et al. Simulation and the Monte Carlo method , 1981, Wiley series in probability and mathematical statistics.
[4] H. Raiffa,et al. Decisions with Multiple Objectives , 1993 .
[5] Michael I. Jordan,et al. A Modular Connectionist Architecture For Learning Piecewise Control Strategies , 1991, 1991 American Control Conference.
[6] Doina Precup,et al. Eligibility Traces for Off-Policy Policy Evaluation , 2000, ICML.
[7] Hirotaka Nakayama,et al. Theory of Multiobjective Optimization , 1985 .
[8] Bernard Widrow,et al. Adaptive switching circuits , 1988 .
[9] Drew McDermott,et al. Introduction to artificial intelligence , 1986, Addison-Wesley series in computer science.
[10] Sanjoy Dasgupta,et al. Off-Policy Temporal Difference Learning with Function Approximation , 2001, ICML.