论文信息 - Towards Online Learning from Corrective Demonstrations

Towards Online Learning from Corrective Demonstrations

Robots operating in real-world human environments will likely encounter task execution failures. To address this, we would like to allow co-present humans to refine the robot's task model as errors are encountered. Existing approaches to task model modification require reasoning over the entire dataset and model, limiting the rate of corrective updates. We introduce the State-Indexed Task Updates (SITU) algorithm to efficiently incorporate corrective demonstrations into an existing task model by iteratively making local updates that only require reasoning over a small subset of the model. In future work, we will evaluate this approach with a user study.

Andrea Lockerd Thomaz | Scott Niekum | Elaine Schaertl Short | Reymundo Gutierrez

[1] Thorsten Joachims,et al. Learning Trajectory Preferences for Manipulators via Iterative Improvement , 2013, NIPS.

[2] Dmitry Berenson,et al. Simultaneous learning of hierarchy and primitives for complex robot tasks , 2019, Auton. Robots.

[3] Oliver Kroemer,et al. Towards learning hierarchical skills for multi-phase manipulation tasks , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[4] Anca D. Dragan,et al. Learning from Physical Human Corrections, One Feature at a Time , 2018, 2018 13th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[5] Scott Niekum,et al. Learning grounded finite-state representations from unstructured demonstrations , 2015, Int. J. Robotics Res..

[6] Andrea Lockerd Thomaz,et al. Incremental Task Modification via Corrective Demonstrations , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[7] Maya Cakmak,et al. Keyframe-based Learning from Demonstration , 2012, Int. J. Soc. Robotics.

[8] Aude Billard,et al. Iterative learning of grasp adaptation through human corrections , 2012, Robotics Auton. Syst..

[9] Andrea Lockerd Thomaz,et al. Simultaneously learning actions and goals from demonstration , 2016, Auton. Robots.

[10] Stefan Schaal,et al. Data-Driven Online Decision Making for Autonomous Manipulation , 2015, Robotics: Science and Systems.