论文信息 - Learning Functions in k-DNF from Reinforcement - 字舞流文

Learning Functions in k-DNF from Reinforcement

Leslie Pack Kaelbling | L. Kaelbling

[1] Bernard Widrow,et al. Punish/Reward: Learning with a Critic in Adaptive Threshold Systems , 1973, IEEE Trans. Syst. Man Cybern..

[2] Leslie G. Valiant,et al. A theory of the learnable , 1984, STOC '84.

[3] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .

[4] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[5] Leslie G. Valiant,et al. Learning Disjunction of Conjunctions , 1985, IJCAI.

[6] Charles W. Anderson,et al. Learning and problem-solving with multilayer connectionist systems (adaptive, strategy learning, neural networks, reinforcement learning) , 1986 .

[7] Bernard Widrow,et al. Adaptive switching circuits , 1988 .

[8] Leslie Pack Kaelbling,et al. A Formal Framework for Learning in Embedded Systems , 1989, ML.