Robust performance of autonomous robots in unstructured environments.

The problem of navigating in and interacting with an unstructured environment presents challenges to traditional learning and control approaches. However, the nature of emergency response situations requires that autonomous robots’ performance be robust to unmodeled environments and unexpected challenges. One approach to providing this capability is presented here: S-Learning. S-Learning, an experience-based learning algorithm, is implemented in the control of a seven degree-offreedom robotic arm. S-Learning stores sequences of discretized (discrete in time), quantized (discrete in magnitude), and categorical (uninterpreted) sensor data and actuator commands. Handling the data in this way removes explicit models about the environment, robot kinematics, dynamics, and structure. Instead, a bootstrapped model is generated on the fly by observing sequences of sensory and command events. S-Learning is based on a neuro-psychological model of learning and movement control in humans and seeks to mimic the strategies used by the brain to solve this problem.

[1]  Jürgen Schmidhuber,et al.  Reinforcement Learning with Self-Modifying Policies , 1998, Learning to Learn.

[2]  Lorien Y. Pratt,et al.  A Survey of Connectionist Network Reuse Through Transfer , 1998, Learning to Learn.

[3]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[4]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[5]  Sebastian Thrun,et al.  Learning to Learn , 1998, Springer US.

[6]  Jonathan Baxter,et al.  Theoretical Models of Learning to Learn , 1998, Learning to Learn.

[7]  D. Shepard A two-dimensional interpolation function for irregularly-spaced data , 1968, ACM National Conference.

[8]  Nathan Intrator,et al.  Making a Low-Dimensional Representation Suitable for Diverse Tasks , 1998, Learning to Learn.

[9]  Sebastian Thrun,et al.  Learning to Learn: Introduction and Overview , 1998, Learning to Learn.

[10]  Nathan Intrator,et al.  Making a Low-dimensional Representation Suitable for Diverse Tasks , 1996, Connect. Sci..

[11]  E. R. Crossman A THEORY OF THE ACQUISITION OF SPEED-SKILL∗ , 1959 .

[12]  B. Rohrer,et al.  S-Learning: A Biomimetic Algorithm for Learning, Memory, and Control in Robots , 2007, 2007 3rd International IEEE/EMBS Conference on Neural Engineering.

[13]  Joshua B. Tenenbaum,et al.  Bayesian Modeling of Human Concept Learning , 1998, NIPS.