Explaining Temporal Differences to Create Useful Concepts for Evaluating States
暂无分享,去创建一个
Paul E. Utgoff | Andrew G. Barto | Sharad Saxena | Richard C. Yee | A. Barto | R. Yee | P. Utgoff | S. Saxena
[1] Arthur L. Samuel,et al. Some Studies in Machine Learning Using the Game of Checkers , 1967, IBM J. Res. Dev..
[2] Richard Waldinger,et al. Achieving several goals simultaneously , 1977 .
[3] Steven A. Vere,et al. Multilevel Counterfactuals for Generalizations of Relational Concepts and Productions , 1980, Artif. Intell..
[4] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[5] Allen Newell,et al. Some Chunks Are Expensive , 1988, ML.
[6] Richard S. Sutton,et al. Sequential Decision Problems and Neural Networks , 1989, NIPS 1989.
[7] A. Barto,et al. Learning and Sequential Decision Making , 1989 .
[8] Milind Tambe,et al. Eliminating Expensive Chunks by Restricting Expressiveness , 1989, IJCAI.
[9] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[10] Steven Minton,et al. Quantitative Results Concerning the Utility of Explanation-based Learning , 1988, Artif. Intell..