A Reward Scheme for Production Systems with Overlapping Conflict Sets
暂无分享,去创建一个
[1] Thomas H. Westerdale,et al. An Application of Fisher's Theorem on Natural Selection to Some Re-enforcement Algorithms for Choice Strategies , 1974 .
[2] King-Sun Fu,et al. Formulation of learning automata and automata games , 1969, Inf. Sci..
[3] J. McDermott,et al. Production system conflict resolution strategies , 1977, SGAR.
[4] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..
[5] John H. Holland,et al. Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems , 1995 .
[6] Norio Baba. The absolutely expedient nonlinear reinforcement schemes under the unknown multiteacher environment , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[7] M. L. Tsetlin. On the Behavior of Finite Automata in Random Media , 1961 .
[8] Akihiro Takeuchi,et al. Random environments and automata , 1975, Inf. Sci..
[9] Kumpati S. Narendra,et al. Learning Automata - A Survey , 1974, IEEE Trans. Syst. Man Cybern..
[10] Ian H. Witten,et al. An Adaptive Optimal Controller for Discrete-Time Markov Environments , 1977, Inf. Control..