Evolutionary algorithm optimization of biological learning parameters in a biomimetic neuroprosthesis

Biomimetic simulation permits neuroscientists to better understand the complex neuronal dynamics of the brain. Embedding a biomimetic simulation in a closed-loop neuroprosthesis, which can read and write signals from the brain, will permit applications for amelioration of motor, psychiatric, and memory-related brain disorders. Biomimetic neuroprostheses require real-time adaptation to changes in the external environment, thus constituting an example of a dynamic data-driven application system. As model fidelity increases, so does the number of parameters and the complexity of finding appropriate parameter configurations. Instead of adapting synaptic weights via machine learning, we employed major biological learning methods: spike-timing dependent plasticity and reinforcement learning. We optimized the learning metaparameters using evolutionary algorithms, which were implemented in parallel and which used an island model approach to obtain sufficient speed. We employed these methods to train a cortical spiking model to utilize macaque brain activity, indicating a selected target, to drive a virtual musculoskeletal arm with realistic anatomical and biomechanical properties to reach to that target. The optimized system was able to reproduce macaque data from a comparable experimental motor task. These techniques can be used to efficiently tune the parameters of multiscale systems, linking realistic neuronal dynamics to behavior, and thus providing a useful tool for neuroscience and neuroprosthetics.

[1]  Ana Pekanovic,et al.  Dopaminergic Projections from Midbrain to Primary Motor Cortex Mediate Motor Skill Learning , 2011, The Journal of Neuroscience.

[2]  James Kozloski,et al.  Closed-Loop Brain Model of Neocortical Information-Based Exchange , 2016, Front. Neuroanat..

[3]  Justin C. Sanchez,et al.  A confidence metric for using neurobiological feedback in actor-critic reinforcement learning based brain-machine interfaces , 2014, Front. Neurosci..

[4]  F A Mussa-Ivaldi,et al.  Adaptive representation of dynamics during learning of a motor task , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[5]  Eduardo Ros,et al.  Cerebellar Input Configuration Toward Object Model Abstraction in Manipulation Tasks , 2011, IEEE Transactions on Neural Networks.

[6]  Jianing Yu,et al.  Top-down laminar organization of the excitatory network in motor cortex , 2008, Nature Neuroscience.

[7]  G. Stanley Reading and writing the neural code , 2013, Nature Neuroscience.

[8]  Anna S. Bulanova,et al.  Calcium regulation of HCN channels supports persistent activity in a multiscale model of neocortex , 2016, Neuroscience.

[9]  L. Miller,et al.  Accurate decoding of reaching movements from field potentials in the absence of spikes , 2012, Journal of neural engineering.

[10]  E. Izhikevich Solving the distal reward problem through linkage of STDP and dopamine signaling , 2007, BMC Neuroscience.

[11]  Michael L. Hines,et al.  Simulation Neurotechnologies for Advancing Brain Research: Parallelizing Large Networks in NEURON , 2016, Neural Computation.

[12]  Timothy Rumbell,et al.  Automatic fitness function selection for compartment model optimization , 2014, BMC Neuroscience.

[13]  David E. Orin,et al.  Robot dynamics: equations and algorithms , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[14]  Justin C. Sanchez,et al.  DARPA-funded efforts in the development of novel brain–computer interface technologies , 2015, Journal of Neuroscience Methods.

[15]  Joseph T. Francis,et al.  Toward an Autonomous Brain Machine Interface: Integrating Sensorimotor Reward Modulation and Reinforcement Learning , 2015, The Journal of Neuroscience.

[16]  Wulfram Gerstner,et al.  Reinforcement Learning Using a Continuous Time Actor-Critic Framework with Spiking Neurons , 2013, PLoS Comput. Biol..

[17]  T DeWolf,et al.  The neural optimal control hierarchy for motor control , 2011, Journal of neural engineering.

[18]  Anne Auger,et al.  Evolution Strategies , 2018, Handbook of Computational Intelligence.

[19]  H. Seo,et al.  Neural basis of reinforcement learning and decision making. , 2012, Annual review of neuroscience.

[20]  E. Underwood DARPA Aims to Rebuild Brains , 2013 .

[21]  Halil Özcan Gülçür,et al.  Toward Building Hybrid Biological/in silico Neural Networks for Motor Neuroprosthetic Control , 2015, Front. Neurorobot..

[22]  James G. King,et al.  Reconstruction and Simulation of Neocortical Microcircuitry , 2015, Cell.

[23]  Michael L. Hines,et al.  Parallel network simulations with NEURON , 2006, Journal of Computational Neuroscience.

[24]  William W. Lytton,et al.  Correction: Cortical Plasticity Induced by Spike-Triggered Microstimulation in Primate Somatosensory Cortex , 2013, PLoS ONE.

[25]  Emery N. Brown,et al.  The BRAIN Initiative: developing technology to catalyse neuroscience discovery , 2015, Philosophical Transactions of the Royal Society B: Biological Sciences.

[26]  Riccardo Poli,et al.  Parallel genetic algorithm taxonomy , 1999, 1999 Third International Conference on Knowledge-Based Intelligent Information Engineering Systems. Proceedings (Cat. No.99TH8410).

[27]  J. Krakauer,et al.  A computational neuroanatomy for motor control , 2008, Experimental Brain Research.

[28]  L. Miller,et al.  Restoring sensorimotor function through intracortical interfaces: progress and looming challenges , 2014, Nature Reviews Neuroscience.

[29]  Kan Li,et al.  Restoring Behavior via Inverse Neurocontroller in a Lesioned Cortical Spiking Model Driving a Virtual Arm , 2016, Front. Neurosci..

[30]  Zbigniew Skolicki,et al.  The influence of migration sizes and intervals on island models , 2005, GECCO '05.

[31]  Nikil Dutt,et al.  An efficient automated parameter tuning framework for spiking neural networks , 2014, Front. Neurosci..

[32]  Sergio Martinoia,et al.  Modular Neuronal Assemblies Embodied in a Closed-Loop Environment: Toward Future Integration of Brains and Machines , 2012, Front. Neural Circuits.

[33]  Sho Yagishita,et al.  A critical time window for dopamine actions on the structural plasticity of dendritic spines , 2014, Science.

[34]  Paolo Del Giudice,et al.  A neuro-inspired model-based closed-loop neuroprosthesis for the substitution of a cerebellar learning function in anesthetized rats , 2015, Scientific Reports.

[35]  J. T. Francis,et al.  Electrostimulation as a Prosthesis for Repair of Information Flow in a Computer Model of Neocortex , 2012, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[36]  Shamim Nemati,et al.  Biomimetic Brain Machine Interfaces for the Control of Movement , 2007, The Journal of Neuroscience.

[37]  Wenjie Zhang,et al.  Towards real-time communication between in vivo neurophysiological data sources and simulator-based brain biomimetic models , 2014, Journal of computational surgery.

[38]  William W. Lytton,et al.  Reinforcement Learning of Two-Joint Virtual Arm Reaching in a Computer Model of Sensorimotor Cortex , 2013, Neural Computation.

[39]  Andreas Schulze-Bonhage,et al.  Reaching Movement Onset- and End-Related Characteristics of EEG Spectral Power Modulations , 2012, Front. Neurosci..

[40]  G. Edelman,et al.  Large-scale model of mammalian thalamocortical systems , 2008, Proceedings of the National Academy of Sciences.

[41]  Brandi T. Marsh,et al.  Title : Towards an Autonomous Brain Machine Interface : Integrating Sensorimotor 1 Reward Modulation and Reinforcement Learning 2 3 Short Title : Towards an Autonomous Brain Machine Interface , 2015 .

[42]  James P. Cohoon,et al.  C6.3 Island (migration) models: evolutionary algorithms based on punctuated equilibria , 1997 .

[43]  K. Molina-Luna,et al.  Dopamine in Motor Cortex Is Necessary for Skill Learning and Synaptic Plasticity , 2009, PloS one.

[44]  Patrick R. Hof,et al.  Automated evolutionary optimization of ion channel conductances and kinetics in models of young and aged rhesus monkey pyramidal neurons , 2016, Journal of Computational Neuroscience.

[45]  J. C. Sanchez,et al.  Dynamically Repairing and Replacing Neural Networks: Using Hybrid Computational and Biological Tools , 2012, IEEE Pulse.

[46]  William W. Lytton,et al.  Cortical Spiking Network Interfaced with Virtual Musculoskeletal Arm and Robotic Arm , 2015, Front. Neurorobot..

[47]  Taro Kiritani,et al.  Sublayer-specific microcircuits of corticospinal and corticostriatal neurons in motor cortex , 2010, Nature Neuroscience.

[48]  Jeffrey L. Krichmar,et al.  Evolution of biologically plausible neural networks performing a visually guided reaching task , 2014, GECCO.

[49]  J. Carmena Advances in Neuroprosthetic Learning and Control , 2013, PLoS biology.

[50]  Lubica Kubikova,et al.  Dopaminergic system in birdsong learning and maintenance , 2010, Journal of Chemical Neuroanatomy.

[52]  William W. Lytton,et al.  Motor Cortex Microcircuit Simulation Based on Brain Activity Mapping , 2014, Neural Computation.

[53]  Erik De Schutter,et al.  Automated neuron model optimization techniques: a review , 2008, Biological Cybernetics.

[54]  W. Lytton,et al.  Reinforcement Learning of Targeted Movement in a Spiking Neuronal Model of Motor Cortex , 2012, PloS one.

[55]  Scott L Delp,et al.  Generating dynamic simulations of movement using computed muscle control. , 2003, Journal of biomechanics.