Online Adaptive Decoding of Motor Imagery Based on Reinforcement Learning

The development of electronic and computer technology makes non-invasive acquisition systems of EEG more universal adoption. Therefore this paper focus on the BCI based on motor imagery, which is a popular studied spontaneous EEG, producing without external stimulus. 6 healthy people participate in the examination, 5 of which are saved while one of which is thrown due to disturbing. Independent component correlation algorithm (ICA) was used to extract motor related component, meanwhile other filters also designed for better signal quality. Taking the motor related components PSD distribution after ICA as features, signal modal decoding based on reinforcement learning is processed in two algorithms. Both of them have good performance. Finally, on-line decoding model is descripted on the basis of the off-line decoding model. The result shows the accuracy of training model has significant difference with random level. Furthermore, the conclusion can be developed into human-computer integration based on EEG signals.

[1]  J.J. Vidal,et al.  Real-time detection of brain events in EEG , 1977, Proceedings of the IEEE.

[2]  Peter Stone,et al.  Reinforcement learning , 2019, Scholarpedia.

[3]  I. Rampil A Primer for EEG Signal Processing in Anesthesia , 1998, Anesthesiology.

[4]  Andrew S. Whitford,et al.  Cortical control of a prosthetic arm for self-feeding , 2008, Nature.

[5]  Clay B. Holroyd,et al.  The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity. , 2002, Psychological review.

[6]  Yuanqing Li,et al.  Control of a Wheelchair in an Indoor Environment Based on a Brain–Computer Interface and Automated Navigation , 2016, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[7]  Gerwin Schalk,et al.  A brain–computer interface using electrocorticographic signals in humans , 2004, Journal of neural engineering.

[8]  E. Dewan Occipital Alpha Rhythm Eye Position and Lens Accommodation , 1967, Nature.

[9]  P. Comon,et al.  Ica: a potential tool for bci systems , 2008, IEEE Signal Processing Magazine.

[10]  Thomas V. Wiecki,et al.  fMRI and EEG Predictors of Dynamic Decision Parameters during Human Reinforcement Learning , 2015, The Journal of Neuroscience.

[11]  N. Birbaumer,et al.  BCI2000: a general-purpose brain-computer interface (BCI) system , 2004, IEEE Transactions on Biomedical Engineering.

[12]  M.M. Moore,et al.  Real-world applications for brain-computer interface technology , 2003, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[13]  Terrence J. Sejnowski,et al.  Independent Component Analysis Using an Extended Infomax Algorithm for Mixed Subgaussian and Supergaussian Sources , 1999, Neural Computation.

[14]  A. Angrilli,et al.  The efficacy of EEG neurofeedback aimed at enhancing sensory-motor rhythm theta ratio in healthy subjects , 2012, Experimental Brain Research.

[15]  G. Pfurtscheller,et al.  ERD/ERS patterns reflecting sensorimotor activation and deactivation. , 2006, Progress in brain research.

[16]  Iñaki Iturrate,et al.  Robot reinforcement learning using EEG-based reward signals , 2010, 2010 IEEE International Conference on Robotics and Automation.

[17]  Aapo Hyvärinen,et al.  Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.