This paper describes a learning control system using a reinforcement technique. The controller is capable of controlling a plant that may be nonlinear and nonstationary. The only a priori information required by the controller is the order of the plant. The approach is to design a controller which partitions the control measurement space into sets called control situations and then learns the best control choice for each control situation. The control measurements are those indicating the state of the plant and environment. The learning is accomplished by reinforcement of the probability of choosing a particular control choice for a given control situation. The system was stimulated on an IBM 1710-GEDA hybrid computer facility. Experimental results obtained from the simulation are presented.
[1]
J. Coales,et al.
An on-off servo mechanism with predicted change-over
,
1956
.
[2]
George S. Sebestyen,et al.
Pattern recognition by an adaptive process of sample set construction
,
1962,
IRE Trans. Inf. Theory.
[3]
M. A. Aizerman.
Automatic control learning systems (in the light of experiments on teaching the systems pattern recognition)
,
1963
.
[4]
M. D. Mesarovic,et al.
Self-Organizing Control Systems
,
1964,
IEEE Transactions on Applications and Industry.
[5]
Bernard Widrow,et al.
Pattern Recognition and Adaptive Control
,
1964,
IEEE Transactions on Applications and Industry.