Automatic Programming of Behavior-Based Robots Using Reinforcement Learning
暂无分享,去创建一个
[1] Leslie Pack Kaelbling,et al. Learning in embedded systems , 1993 .
[2] Steven D. Whitehead,et al. A Complexity Analysis of Cooperative Mechanisms in Reinforcement Learning , 1991, AAAI.
[3] Long Ji Lin,et al. Programming Robots Using Reinforcement Learning and Teaching , 1991, AAAI.
[4] Lambert E. Wixson,et al. Scaling Reinforcement Learning Techniques via Modularity , 1991, ML.
[5] Satinder P. Singh,et al. Transfer of Learning Across Compositions of Sequentail Tasks , 1991, ML.
[6] Long-Ji Lin,et al. Self-improving reactive agents: case studies of reinforcement learning frameworks , 1991 .
[7] Benjamin Kuipers,et al. Learning hill-climbing functions as a strategy for generating behaviors in a mobile robot , 1991 .
[8] David R. Pierce,et al. Learning a Set of Primitive Actions with an Uninterpreted Sensorimotor Apparatus , 1991, ML.
[9] Gary L. Drescher,et al. Made-up minds - a constructivist approach to artificial intelligence , 1991 .
[10] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.
[11] Rodney A. Brooks,et al. Learning to Coordinate Behaviors , 1990, AAAI.
[12] Jonathan H. Connell,et al. Minimalist mobile robotics - a colony-style architecture for an artificial creature , 1990, Perspectives in artificial intelligence.
[13] Andrew K. C. Wong,et al. Performance Analysis of a Probabilistic Inductive Learning System , 1990, ML.
[14] Claude Sammut,et al. Is Learning Rate a Good Performance Criterion for Learning? , 1990, ML.
[15] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[16] Alan D. Christiansen,et al. Learning reliable manipulation strategies without initial physical models , 1990, Proceedings., IEEE International Conference on Robotics and Automation.
[17] Rodney A. Brooks,et al. The Behavior Language: User''s Guide , 1990 .
[18] Ming Tan,et al. Cost-Sensitive Concept Learning of Sensor Use in Approach ad Recognition , 1989, ML.
[19] Ronald L. Rivest,et al. Inference of finite automata using homing sequences , 1989, STOC '89.
[20] C. Watkins. Learning from delayed rewards , 1989 .
[21] Rodney A. Brooks,et al. A Robust Layered Control Syste For A Mobile Robot , 2022 .
[22] Hans P. Moravec,et al. High resolution maps from wide angle sonar , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.
[23] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
[24] Richard S. Sutton,et al. Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.
[25] Tom M. Mitchell,et al. Generalization as Search , 2002 .