Learning from demonstration: Teaching a myoelectric prosthesis with an intact limb via reinforcement learning

Prosthetic arms should restore and extend the capabilities of someone with an amputation. They should move naturally and be able to perform elegant, coordinated movements that approximate those of a biological arm. Despite these objectives, the control of modern-day prostheses is often nonintuitive and taxing. Existing devices and control approaches do not yet give users the ability to effect highly synergistic movements during their daily-life control of a prosthetic device. As a step towards improving the control of prosthetic arms and hands, we introduce an intuitive approach to training a prosthetic control system that helps a user achieve hard-to-engineer control behaviours. Specifically, we present an actor-critic reinforcement learning method that for the first time promises to allow someone with an amputation to use their non-amputated arm to teach their prosthetic arm how to move through a wide range of coordinated motions and grasp patterns. We evaluate our method during the myoelectric control of a multi-joint robot arm by non-amputee users, and demonstrate that by using our approach a user can train their arm to perform simultaneous gestures and movements in all three degrees of freedom in the robot's hand and wrist based only on information sampled from the robot and the user's above-elbow myoelectric signals. Our results indicate that this learning-from-demonstration paradigm may be well suited to use by both patients and clinicians with minimal technical knowledge, as it allows a user to personalize the control of his or her prosthesis without having to know the underlying mechanics of the prosthetic limb. These preliminary results also suggest that our approach may extend in a straightforward way to next-generation prostheses with precise finger and wrist control, such that these devices may someday allow users to perform fluid and intuitive movements like playing the piano, catching a ball, and comfortably shaking hands.

[1]  N. A. Bernshteĭn The co-ordination and regulation of movements , 1967 .

[2]  Judy A. Franklin,et al.  Biped dynamic walking using reinforcement learning , 1997, Robotics Auton. Syst..

[3]  Roberto Merletti,et al.  Control of Powered Upper Limb Prostheses , 2004 .

[4]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[5]  Stefan Schaal,et al.  Natural Actor-Critic , 2003, Neurocomputing.

[6]  Roberto Merletti,et al.  Electromyography. Physiology, engineering and non invasive applications , 2005 .

[7]  Francesco Lacquaniti,et al.  Control of Fast-Reaching Movements by Muscle Synergy Combinations , 2006, The Journal of Neuroscience.

[8]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[9]  Farbod Fahimi,et al.  Online human training of a myoelectric prosthesis controller via actor-critic reinforcement learning , 2011, 2011 IEEE International Conference on Rehabilitation Robotics.

[10]  Stefano Stramigioli,et al.  Myoelectric forearm prostheses: state of the art from a user-centered perspective. , 2011, Journal of rehabilitation research and development.

[11]  Matthew P. Para,et al.  Control System Architecture for the Modular Prosthetic Limb , 2011 .

[12]  Erik Scheme,et al.  Electromyogram pattern recognition for control of powered upper-limb prostheses: state of the art and challenges for clinical use. , 2011, Journal of rehabilitation research and development.

[13]  Patrick M. Pilarski,et al.  Model-Free reinforcement learning with continuous action in practice , 2012, 2012 American Control Conference (ACC).

[14]  Dario Farina,et al.  Myoelectric Control of Artificial Limbs¿Is There a Need to Change Focus? [In the Spotlight] , 2012, IEEE Signal Process. Mag..

[15]  Patrick M. Pilarski,et al.  Adaptive artificial limbs: a real-time approach to prediction and anticipation , 2013, IEEE Robotics & Automation Magazine.

[16]  Christian Cipriani,et al.  Abstract and Proportional Myoelectric Control for Multi-Fingered Hand Prostheses , 2013, Annals of Biomedical Engineering.

[17]  Manfredo Atzori,et al.  Recognition of hand movements in a trans-radial amputated subject by sEMG , 2013, 2013 IEEE 13th International Conference on Rehabilitation Robotics (ICORR).

[18]  Patrick M. Pilarski,et al.  Real-time prediction learning for the simultaneous actuation of multiple prosthetic joints , 2013, 2013 IEEE 13th International Conference on Rehabilitation Robotics (ICORR).

[19]  Barbara Caputo,et al.  Exploiting accelerometers to improve movement classification for prosthetics , 2013, 2013 IEEE 13th International Conference on Rehabilitation Robotics (ICORR).

[20]  H. Harry Asada,et al.  A robot on the shoulder: Coordinated human-wearable robot control using Coloured Petri Nets and Partial Least Squares predictions , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Panagiotis K. Artemiadis,et al.  Proceedings of the first workshop on Peripheral Machine Interfaces: going beyond traditional surface electromyography , 2014, Front. Neurorobot..

[22]  D. Farina,et al.  Linear and Nonlinear Regression Techniques for Simultaneous and Proportional Myoelectric Control , 2014, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[23]  M. R. Dawson,et al.  DEVELOPMENT OF THE BENTO ARM : AN IMPROVED ROBOTIC ARM FOR MYOELECTRIC TRAINING AND RESEARCH , 2014 .

[24]  J. Kelso The Bernstein Perspective: I. The Problems of Degrees of Freedom and Context-Conditioned Variability , 2014 .

[25]  P. Pilarski Prosthetic Devices as Goal-Seeking Agents , 2015 .

[26]  Craig Sherstan,et al.  Application of real-time machine learning to myoelectric prosthesis control: A case series in adaptive switching , 2016, Prosthetics and orthotics international.

[27]  Patrick M. Pilarski,et al.  Face valuing: Training user interfaces with facial expressions and reinforcement learning , 2016, ArXiv.

[28]  Ann L. Edwards,et al.  Adaptive and Autonomous Switching: Shared Control of Powered Prosthetic Arms Using Reinforcement Learning , 2016 .

[29]  Patrick M. Pilarski,et al.  Simultaneous Control and Human Feedback in the Training of a Robotic Agent with Actor-Critic Reinforcement Learning , 2016, ArXiv.

[30]  Dario Farina,et al.  High-Density Electromyography and Motor Skill Learning for Robust Long-Term Control of a 7-DoF Robot Arm , 2016, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[31]  Patrick M. Pilarski,et al.  Representing high-dimensional data to intelligent prostheses and other wearable assistive robots: A first comparison of tile coding and selective Kanerva coding , 2017, 2017 International Conference on Rehabilitation Robotics (ICORR).

[32]  Patrick M. Pilarski,et al.  Upper and Lower Limb Robotic Prostheses , 2017 .