Constructing policies for supportive behaviors and communicative actions in human-robot teaming

Current state-of-the-art robotic systems deployed in industry work in isolation from humans and do not allow for collaboration. Developing a robot that can work side-by-side with a human presents the advantage of allowing both the robot and the human worker to focus on the task each is best suited for, while assisting one another as needed. For the robot to provide assistive behavior to a human co-worker, it needs to learn what actions it should perform at each time step depending upon the state of the task. Such assistive actions are not intended to simply contribute to the completion of a particular task by instructing the robot to work on subtasks in isolation from the human worker; rather they are meant to help the worker complete the task more efficiently. As such, employing standard policy search or task and motion planning techniques is not sufficient to discover the supportive types of actions my system seeks to offer based on accurate estimations of the current task state. To this end, my research focuses on investigating policy search within hierarchical tasks that allow for two main abilities, namely helping the human co-worker more effectively complete a task and taking communicative actions that reduce state estimation uncertainty by asking the worker direct questions. The policy dictates what action the robot should take at each time step, based on inputs from a motion capture system providing observations about the configuration of the person's hands relative to the objects needed for accomplishing the task, as well as the person's answers to any questions posed by the robot.

[1]  Douglas A. Few,et al.  Improved Human-Robot Teaming through Facilitated Initiative , 2006, ROMAN 2006 - The 15th IEEE International Symposium on Robot and Human Interactive Communication.

[2]  Cynthia Breazeal,et al.  Improved human-robot team performance using Chaski, A human-inspired plan execution system , 2011, 2011 6th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[3]  Jan Peters,et al.  A Survey on Policy Search for Robotics , 2013, Found. Trends Robotics.

[4]  Brian Scassellati,et al.  Effective robot teammate behaviors for supporting sequential manipulation tasks , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[5]  Joelle Pineau,et al.  Policy-contingent abstraction for robust robot control , 2002, UAI.

[6]  Barry D. Nichols Reinforcement learning in continuous state- and action-space , 2014 .

[7]  Hado van Hasselt,et al.  Reinforcement Learning in Continuous State and Action Spaces , 2012, Reinforcement Learning.

[8]  Sridhar Mahadevan,et al.  Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..

[9]  Illah R. Nourbakhsh,et al.  Human-robot teaming for search and rescue , 2005, IEEE Pervasive Computing.

[10]  Brian Scassellati,et al.  A developmentally inspired transfer learning approach for predicting skill durations , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.