Human in the Loop of Robot Learning: EEG-Based Reward Signal for Target Identification and Reaching Task

Shared control and shared autonomy play an important role in assistive technologies, allowing the offloading of the cognitive burden required for control from the user to the intelligent robotic device. In this context, electrophysiological measures of error detection, directly measured from a person's brain activity as Error-related Potentials (ErrPs), can be exploited to provide passive adaptation of an external semi-autonomous system to the human. This concept was implemented in an online robot learning task, where user's evaluation of the robot's actions, in terms of detected ErrP, was exploited to update a reward function in a Reinforcement Learning (RL) framework. Results from both simulated and experimental studies show that the introduction of human evaluation in the robot learning loop allows for: (1) the acceleration of optimal policy learning in a target reaching task, (2) the introduction of a further degree of control in robot learning, namely identification of one among multiple targets, according to the user's will. Overall, presented results support the potential of human-robot co-adaptive and co-operative strategies to develop human-centered assistive technologies.

[1]  Ricardo Chavarriaga,et al.  Teaching brain-machine interfaces as an alternative paradigm to neuroprosthetics control , 2015, Scientific Reports.

[2]  Hendrik Van Brussel,et al.  Shared control for intelligent wheelchairs: an implicit estimation of the user intention , 2003 .

[3]  Iñaki Iturrate,et al.  Shared control of a robot using EEG-based feedback signals , 2013, MLIS '13.

[4]  Ricardo Chavarriaga,et al.  Errare machinale est: the use of error-related potentials in brain-machine interfaces , 2014, Front. Neurosci..

[5]  Mandayam A. Srinivasan,et al.  Continuous shared control for stabilizing reaching and grasping with brain-machine interfaces , 2006, IEEE Transactions on Biomedical Engineering.

[6]  José del R. Millán,et al.  The role of shared-control in BCI-based telepresence , 2010, 2010 IEEE International Conference on Systems, Man and Cybernetics.

[7]  José del R. Millán,et al.  You Are Wrong! - Automatic Detection of Interaction Errors from Brain Waves , 2005, IJCAI.

[8]  José del R. Millán,et al.  Noninvasive brain-actuated control of a mobile robot by human EEG , 2004, IEEE Transactions on Biomedical Engineering.

[9]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[10]  Kiyoshi Kotani,et al.  A Novel Unsupervised Adaptive Learning Method for Long-Term Electromyography (EMG) Pattern Recognition , 2017, Sensors.

[11]  Ricardo Chavarriaga,et al.  Latency correction of error potentials between different experiments reduces calibration time for single-trial classification , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[12]  R Chavarriaga,et al.  Learning From EEG Error-Related Potentials in Noninvasive Brain-Computer Interfaces , 2010, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[13]  G. Pfurtscheller,et al.  Brain-Computer Interfaces for Communication and Control. , 2011, Communications of the ACM.

[14]  Joseph DelPreto,et al.  Correcting robot mistakes in real time using EEG signals , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Emmanuel Maby,et al.  Objective and Subjective Evaluation of Online Error Correction during P300-Based Spelling , 2012, Adv. Hum. Comput. Interact..

[16]  Karim A. Tahboub,et al.  A Semi-Autonomous Reactive Control Architecture , 2001, J. Intell. Robotic Syst..

[17]  Luis Montesano,et al.  Single trial recognition of error-related potentials during observation of robot operation , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[18]  Leonardo S. Mattos,et al.  Does tactile feedback enhance single-trial detection of error-related eeg potentials? , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[19]  Iñaki Iturrate,et al.  Robot reinforcement learning using EEG-based reward signals , 2010, 2010 IEEE International Conference on Robotics and Automation.

[20]  Vicenç Gómez,et al.  Adaptive Classification on Brain-Computer Interfaces Using Reinforcement Signals , 2012, Neural Computation.