Learning latent actions to control assistive robots

Assistive robot arms enable people with disabilities to conduct everyday tasks on their own. These arms are dexterous and high-dimensional; however, the interfaces people must use to control their robots are low-dimensional. Consider teleoperating a 7-DoF robot arm with a 2-DoF joystick. The robot is helping you eat dinner, and currently you want to cut a piece of tofu. Today’s robots assume a pre-defined mapping between joystick inputs and robot actions: in one mode the joystick controls the robot’s motion in the x–y plane, in another mode the joystick controls the robot’s z–yaw motion, and so on. But this mapping misses out on the task you are trying to perform! Ideally, one joystick axis should control how the robot stabs the tofu, and the other axis should control different cutting motions. Our insight is that we can achieve intuitive, user-friendly control of assistive robots by embedding the robot’s high-dimensional actions into low-dimensional and human-controllable latent actions. We divide this process into three parts. First, we explore models for learning latent actions from offline task demonstrations, and formalize the properties that latent actions should satisfy. Next, we combine learned latent actions with autonomous robot assistance to help the user reach and maintain their high-level goals. Finally, we learn a personalized alignment model between joystick inputs and latent actions. We evaluate our resulting approach in four user studies where non-disabled participants reach marshmallows, cook apple pie, cut tofu, and assemble dessert. We then test our approach with two disabled adults who leverage assistive devices on a daily basis.

[1]  Panagiotis K. Artemiadis,et al.  EMG-Based Control of a Robot Arm Using Low-Dimensional Embeddings , 2010, IEEE Transactions on Robotics.

[2]  Yannick Schroecker,et al.  Imitating Latent Policies from Observation , 2018, ICML.

[3]  Philip S. Thomas,et al.  Learning Action Representations for Reinforcement Learning , 2019, ICML.

[4]  Martin A. Riedmiller,et al.  Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images , 2015, NIPS.

[5]  Anca D. Dragan,et al.  Learning from Physical Human Corrections, One Feature at a Time , 2018, 2018 13th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[6]  Siddhartha S. Srinivasa,et al.  Eye-Hand Behavior in Human-Robot Shared Manipulation , 2018, 2018 13th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[7]  Siddhartha S. Srinivasa,et al.  Robot-Assisted Feeding: Generalizing Skewering Strategies across Food Items on a Realistic Plate , 2019, ArXiv.

[8]  M. Spong,et al.  Robot Modeling and Control , 2005 .

[9]  Siddhartha S. Srinivasa,et al.  Is More Autonomy Always Better?: Exploring Preferences of Users with Mobility Impairments in Robot-assisted Feeding , 2020, HRI.

[10]  B. Argall,et al.  Human-in-the-Loop Optimization of Shared Autonomy in Assistive Robotics , 2017, IEEE Robotics and Automation Letters.

[11]  Thilina Dulantha Lalitharatne,et al.  Meal assistance robots: A review on current status, challenges and future directions , 2015, 2015 IEEE/SICE International Symposium on System Integration (SII).

[12]  Anirudha Majumdar,et al.  Learning Task-Driven Control Policies via Information Bottlenecks , 2020, Robotics: Science and Systems.

[13]  José del R. Millán,et al.  Brain-Controlled Wheelchairs: A Robotic Architecture , 2013, IEEE Robotics & Automation Magazine.

[14]  Brenna D. Argall,et al.  Autonomy in Rehabilitation Robotics: An Intersection , 2018, Annu. Rev. Control. Robotics Auton. Syst..

[15]  Oliver Brock,et al.  State Representation Learning in Robotics: Using Prior Knowledge about Physical Interaction , 2014, Robotics: Science and Systems.

[16]  Siddhartha S. Srinivasa,et al.  Assistive teleoperation of robot arms via automatic time-optimal mode switching , 2016, 2016 11th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[17]  Wendy A. Rogers,et al.  Active Robot-Assisted Feeding with a General-Purpose Mobile Manipulator: Design, Evaluation, and Lessons Learned. , 2019 .

[18]  D. Weber,et al.  The role of assistive robotics in the lives of persons with disability. , 2010, American journal of physical medicine & rehabilitation.

[19]  Brenna Argall,et al.  Probabilistic Human Intent Recognition for Shared Autonomy in Assistive Robotics , 2019, ACM transactions on human-robot interaction.

[20]  Ethan K. Gordon,et al.  Adaptive Robot-Assisted Feeding: An Online Learning Framework for Acquiring Previously Unseen Food Items , 2019, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[21]  Chelsea Finn,et al.  Learning Latent Representations to Influence Multi-Agent Interaction , 2020, CoRL.

[22]  Xiaojin Zhu,et al.  Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.

[23]  Wendy A. Rogers,et al.  Toward Active Robot-Assisted Feeding with a General-Purpose Mobile Manipulator: Design, Evaluation, and Lessons Learned , 2019, ArXiv.

[24]  Todd Murphey,et al.  Data-driven Koopman operators for model-based shared control of human–machine systems , 2020, Int. J. Robotics Res..

[25]  Siddhartha S. Srinivasa,et al.  Transfer Depends on Acquisition: Analyzing Manipulation Strategies for Robotic Feeding , 2019, 2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[26]  Anca D. Dragan,et al.  Where Do You Think You're Going?: Inferring Beliefs about Dynamics from Behavior , 2018, NeurIPS.

[27]  Bilge Mutlu,et al.  Shared control–based bimanual robot manipulation , 2019, Science Robotics.

[28]  Nicholas Roy,et al.  Task-Conditioned Variational Autoencoders for Learning Movement Primitives , 2019, CoRL.

[29]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[30]  Carme Torras,et al.  Personalization Framework for Adaptive Robotic Feeding Assistance , 2016, ICSR.

[31]  A. Norberg,et al.  How people with stroke and healthy older people experience the eating process. , 2000, Journal of clinical nursing.

[32]  Siddhartha S. Srinivasa,et al.  HARMONIC: A multimodal dataset of assistive human–robot collaboration , 2018, Int. J. Robotics Res..

[33]  Dorsa Sadigh,et al.  Controlling Assistive Robots with Learned Latent Actions , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[34]  Joseph J. Lim,et al.  Accelerating Reinforcement Learning with Learned Skill Priors , 2020, CoRL.

[35]  Bilge Mutlu,et al.  A Motion Retargeting Method for Effective Mimicry-Based Teleoperation of Robot Arms , 2017, 2017 12th ACM/IEEE International Conference on Human-Robot Interaction (HRI.

[36]  Wendy A Rogers,et al.  Closing the Capacity-Ability Gap: Using Technology to Support Aging With Disability , 2018, Innovation in aging.

[37]  Dorsa Sadigh,et al.  Learning Visually Guided Latent Actions for Assistive Teleoperation , 2021, L4DC.

[38]  Anca D. Dragan,et al.  Shared Autonomy via Deep Reinforcement Learning , 2018, Robotics: Science and Systems.

[39]  Carl Doersch,et al.  Tutorial on Variational Autoencoders , 2016, ArXiv.

[40]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[41]  Sergey Levine,et al.  Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings , 2018, ICML.

[42]  Siddhartha S. Srinivasa,et al.  A policy-blending formalism for shared control , 2013, Int. J. Robotics Res..

[43]  Christian Cipriani,et al.  Real-time myoelectric control of a multi-fingered hand prosthesis using principal components analysis , 2012, Journal of NeuroEngineering and Rehabilitation.

[44]  Siddhartha S. Srinivasa,et al.  Human-Robot Mutual Adaptation in Shared Autonomy , 2017, 2017 12th ACM/IEEE International Conference on Human-Robot Interaction (HRI.

[45]  Sergey Levine,et al.  Learning Latent Plans from Play , 2019, CoRL.

[46]  Dorsa Sadigh,et al.  Shared Autonomy with Learned Latent Actions , 2020, Robotics: Science and Systems.

[47]  Matei T. Ciocarlie,et al.  Hand Posture Subspaces for Dexterous Robotic Grasping , 2009, Int. J. Robotics Res..

[48]  Siddhartha S. Srinivasa,et al.  Shared autonomy via hindsight optimization for teleoperation and teaming , 2017, Int. J. Robotics Res..

[49]  Jeannette Bohg,et al.  Learning User-Preferred Mappings for Intuitive Robot Control , 2020, ArXiv.

[50]  Martial Hebert,et al.  Autonomy infused teleoperation with application to brain computer interface controlled manipulation , 2017, Autonomous Robots.