暂无分享,去创建一个
[1] Peter L. Bartlett,et al. Infinite-Horizon Policy-Gradient Estimation , 2001, J. Artif. Intell. Res..
[2] Xin Xu,et al. Policy gradient fuzzy reinforcement learning , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).
[3] T. Horiuchi,et al. Fuzzy interpolation-based Q-learning with continuous states and actions , 1996, Proceedings of IEEE 5th International Fuzzy Systems.
[4] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[5] Masaharu Mizumoto,et al. Fuzzy controls under various fuzzy reasoning methods , 1988, Inf. Sci..
[6] Yukinobu Hoshino,et al. A Proposal of Reinforcement Learning with Fuzzy Environment Evaluation Rules and Its Application to Chess , 2001 .
[7] Harukazu Igarashi,et al. Policy Gradient Reinforcement Learning with a Fuzzy Controller for Policy: Decision Making in RoboCup Soccer Small Size League , 2014 .
[8] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[9] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[10] Chi-Hyon Oh,et al. Initialization of Q-values by fuzzy rules for accelerating Q-learning , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).
[11] Masaomi Kimura,et al. Reinforcement Learning in Non-Markov Decision Processes: Statistical Properties of Characteristic Eligibility , 2008 .
[12] Lionel Jouffe,et al. Fuzzy inference system learning by reinforcement methods , 1998, IEEE Trans. Syst. Man Cybern. Part C.
[13] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[14] H. Igarashi,et al. An Algorithm of Policy Gradient Reinforcement Learning with a Fuzzy Controller in Policies , 2013 .
[15] H.R. Berenji,et al. Cooperation and coordination between fuzzy reinforcement learning agents in continuous state partially observable Markov decision processes , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).