Learning Functions in k-DNF from Reinforcement
暂无分享,去创建一个
[1] Bernard Widrow,et al. Punish/Reward: Learning with a Critic in Adaptive Threshold Systems , 1973, IEEE Trans. Syst. Man Cybern..
[2] Leslie G. Valiant,et al. A theory of the learnable , 1984, STOC '84.
[3] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
[4] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.
[5] Leslie G. Valiant,et al. Learning Disjunction of Conjunctions , 1985, IJCAI.
[6] Charles W. Anderson,et al. Learning and problem-solving with multilayer connectionist systems (adaptive, strategy learning, neural networks, reinforcement learning) , 1986 .
[7] Bernard Widrow,et al. Adaptive switching circuits , 1988 .
[8] Leslie Pack Kaelbling,et al. A Formal Framework for Learning in Embedded Systems , 1989, ML.