Industrial robots are fenced for safety reasons because of hard-programmed robot behavior, containing implicit assumptions about the environment. A knowledge base (KB), containing objects and robot skills, can be built to make assumptions explicit [1] and classic AI search-, reasonand planning-algorithms can be deployed. Advances in robotic behavior representation via Behavior Trees (BT), skills with preand post-conditions [2] and Intuitive Programming [3] result in human understandable, explainable and therefore trustworthy task executions facilitating Human-Robot Collaboration. Considering a task to navigate around an obstacle, non-expert robot programmers tend to set a way-point at the startand end-point, not considering the robot calculating a direct trajectory, resulting in a crash. This shows that despite the above-mentioned efforts, humans still imply assumptions, resulting in incomplete, imperfect and incorrect task descriptions, demanding autonomy from the robotic agent. Autonomy can be reached with a learning algorithm exploring the continuous robotic action spaces efficiently. Reinforcement Learning (RL) algorithms utilize (raw) data stored in databases to learn a behavior that solves a specific problem under uncertainties [4]. Off-policy learning can be used to learn from human demonstration examples [5]. Finding solutions in high-dimensions combined with reward-sparsity is computationally expensive. If learning is realized in an end-toend fashion, the found solutions can be non-intuitive, impairing the possibility for Human-Robot Collaboration. Related work, combining robot behavior representation through BT and learning the structure of the BT, can be found in [6]. Furthermore, the leaves of the BT can be learned, which represent, but are not limited to, robot skills [7].
[1]
Danica Kragic,et al.
Deep predictive policy training using reinforcement learning
,
2017,
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[2]
Petter Ögren,et al.
Learning of Behavior Trees for Autonomous Agents
,
2015,
IEEE Transactions on Games.
[3]
Francesco Rovida,et al.
Extended behavior trees for quick definition of flexible robotic tasks
,
2017,
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[4]
Petter Ögren,et al.
Adding Neural Network Controllers to Behavior Trees without Destroying Performance Guarantees
,
2018,
ArXiv.
[5]
Danica Kragic,et al.
A sensorimotor reinforcement learning framework for physical Human-Robot Interaction
,
2016,
2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[6]
Jacek Malec,et al.
Ontology-Based Knowledge Representation for Increased Skill Reusability in Industrial Robots
,
2018,
2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).