Learning Human Behaviors for Robot-Assisted Dressing

We investigate robotic assistants for dressing that can anticipate the motion of the person who is being helped. To this end, we use reinforcement learning to create models of human behavior during assistance with dressing. To explore this kind of interaction, we assume that the robot presents an open sleeve of a hospital gown to a person, and that the person moves their arm into the sleeve. The controller that models the person's behavior is given the position of the end of the sleeve and information about contact between the person's hand and the fabric of the gown. We simulate this system with a human torso model that has realistic joint ranges, a simple robot gripper, and a physics-based cloth model for the gown. Through reinforcement learning (specifically the TRPO algorithm) the system creates a model of human behavior that is capable of placing the arm into the sleeve. We aim to model what humans are capable of doing, rather than what they typically do. We demonstrate successfully trained human behaviors for three robot-assisted dressing strategies: 1) the robot gripper holds the sleeve motionless, 2) the gripper moves the sleeve linearly towards the person from the front, and 3) the gripper moves the sleeve linearly from the side.

[1]  Sergey Levine,et al.  High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.

[2]  C. Karen Liu,et al.  What does the person feel? Learning to infer applied forces during robot-assisted dressing , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[3]  Rachid Alami,et al.  A Human Aware Mobile Robot Motion Planner , 2007, IEEE Transactions on Robotics.

[4]  C. Karen Liu,et al.  Optimal feedback control for character animation using an abstract model , 2010, ACM Trans. Graph..

[5]  Takamitsu Matsubara,et al.  Learning motor skills with non-rigid materials by reinforcement learning , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[6]  Wendy A. Rogers,et al.  Identifying the Potential for Robotics to Assist Older Adults in Different Living Environments , 2014, Int. J. Soc. Robotics.

[7]  Tae-Yong Kim,et al.  Unified particle physics for real-time applications , 2014, ACM Trans. Graph..

[8]  M. V. D. Panne,et al.  SIMBICON: simple biped locomotion control , 2007, SIGGRAPH 2007.

[9]  Pieter Abbeel,et al.  Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.

[10]  Yiannis Demiris,et al.  User modelling for personalised dressing assistance by humanoid robots , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[11]  Greg Chance,et al.  A Quantitative Analysis of Dressing Dynamics for Robotic Dressing Assistance , 2017, Front. Robot. AI.

[12]  Manuela M. Veloso,et al.  Personalized Assistance for Dressing Users , 2015, ICSR.

[13]  Terrence E Murphy,et al.  Disability in activities of daily living, depression, and quality of life among older medical ICU survivors: a prospective cohort study , 2011, Health and quality of life outcomes.

[14]  Sergey Levine,et al.  Trust Region Policy Optimization , 2015, ICML.

[15]  C. Karen Liu,et al.  Animating human dressing , 2015, ACM Trans. Graph..

[16]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[17]  C. Karen Liu,et al.  Haptic simulation for robot-assisted dressing , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[18]  Siddhartha S. Srinivasa,et al.  Toward seamless human-robot handovers , 2013, Journal of Human-Robot Interaction.

[19]  C. Karen Liu Dextrous manipulation from a grasping pose , 2009, SIGGRAPH 2009.

[20]  J. Wiener,et al.  Measuring the activities of daily living: comparisons across national surveys. , 1990, Journal of gerontology.

[21]  Takamitsu Matsubara,et al.  Reinforcement learning of clothing assistance with a dual-arm robot , 2011, 2011 11th IEEE-RAS International Conference on Humanoid Robots.

[22]  Greg Turk,et al.  Learning to navigate cloth using haptics , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[23]  Alex Graves,et al.  Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[24]  Siddhartha S. Srinivasa,et al.  Human preferences for robot-human hand-over configurations , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  Sylvain Calinon,et al.  Learning adaptive dressing assistance from human demonstration , 2017, Robotics Auton. Syst..