论文信息 - A reinforcement learning control scheme for nonlinear systems with multiple actions

A reinforcement learning control scheme for nonlinear systems with multiple actions

In this paper an attempt is made to apply reinforcement learning schemes to the adaptive control of nonlinear systems with multiple continuous control actions. The control task is formulated into a sequential optimization problem. A learning algorithm is developed based on the concepts of dynamic programming and stochastic approximation and the techniques of random search and parameter estimation. The proposed algorithm is complete and general enough so that the controller can be constituted by various computing models, e.g., neural networks. The efficiency of the proposed algorithm is demonstrated by applying the methods to the nonlinear control problems with multiple control actions.

Chung Chen | Chi-Cheng Jou | C. Jou | C. Chen

[1] Vijaykumar Gullapalli,et al. A stochastic reinforcement learning algorithm for learning real-valued functions , 1990, Neural Networks.

[2] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[3] Albert Y. Zomaya. Reinforcement learning for the adaptive control of nonlinear systems , 1994 .

[4] Richard S. Sutton,et al. Reinforcement Learning is Direct Adaptive Optimal Control , 1992, 1991 American Control Conference.

[5] Q. H. Wu,et al. Reinforcement learning control of unknown dynamic systems , 1993 .