Actor-critic-based ink drop spread as an intelligent controller
暂无分享,去创建一个
Hesam Sagha | Iman Esmaili Paeen Afrakoti | Saeed Bagherishouraki | I. Afrakoti | H. Sagha | Saeed Bagherishouraki
[1] Azeddine Draou,et al. A Variable Gain PI Controller Used for Speed Control of a Direct Torque Neuro Fuzzy Controlled Induction Machine Drive , 2007 .
[2] David W. Aha,et al. Instance-Based Learning Algorithms , 1991, Machine Learning.
[3] Nakaji Honda,et al. Simulation of brain learning process through a novel fuzzy hardware approach , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).
[4] Xuesong Wang,et al. A fuzzy Actor-Critic reinforcement learning network , 2007, Inf. Sci..
[5] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[6] G. Feng,et al. A Survey on Analysis and Design of Model-Based Fuzzy Control Systems , 2006, IEEE Transactions on Fuzzy Systems.
[7] Andrew G. Barto,et al. Learning to Act Using Real-Time Dynamic Programming , 1995, Artif. Intell..
[8] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[9] M.A. Wiering,et al. Reinforcement Learning in Continuous Action Spaces , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.
[10] Hamid R. Berenji,et al. Learning and tuning fuzzy logic controllers through reinforcements , 1992, IEEE Trans. Neural Networks.
[11] Masayuki Murakami,et al. A study on the modeling ability of the IDS method: A soft computing technique using pattern-based information processing , 2007, Int. J. Approx. Reason..
[12] M. Schaale,et al. Application of the active learning method to the retrieval of pigment from spectral remote sensing reflectance data , 2009 .
[13] P. Kokotovic,et al. Nonlinear control via approximate input-output linearization: the ball and beam example , 1992 .
[14] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[15] Alfred C. Rufer,et al. JOE: a mobile, inverted pendulum , 2002, IEEE Trans. Ind. Electron..
[16] M.A. Wiering,et al. Two Novel On-policy Reinforcement Learning Algorithms based on TD(λ)-methods , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.