Learning to Walk: Bio-Mimetic Hexapod Locomotion via Reinforcement-Based Spiking Central Pattern Generation

Online learning for the legged robot locomotion under performance and energy constraints remains to be a challenge. Methods such as stochastic gradient, deep reinforcement learning (RL) have been explored for bipeds, quadrupeds and hexapods. These techniques are computationally intensive and thus difficult to implement on edge computing platforms. These methods are also inefficient in energy consumption and throughput because of their reliance on complex sensors and pre-processing of data. On the other hand, neuromorphic computing paradigms, such as spiking neural networks (SNN), become increasingly favorable in low power computing on edge intelligence. SNN has exhibited the capability of performing reinforcement learning mechanisms with biomimetic spike time-dependent plasticity (STDP) of synapses. However, training a legged robot to walk in the synchronized gait patterns generated by a central pattern generator (CPG) in an SNN framework has not yet been explored. Such a method can combine the efficiency of SNNs with the synchronized locomotion of CPG based systems – providing breakthrough performance improvement of end-to-end learning in mobile robotics. In this paper, we propose a reinforcement based stochastic learning technique for training a spiking CPG for a hexapod robot which learns to walk using bio-inspired tripod gait without prior knowledge. The whole system is implemented on a lightweight raspberry pi platform with integrated sensors. Our method opens new opportunities for online learning with limited edge computing resources.

[1]  Jan Peters,et al.  Recurrent Spiking Networks Solve Planning Tasks , 2016, Scientific Reports.

[2]  R. Mann,et al.  Kinematic Responses to Changes in Walking Orientation and Gravitational Load in Drosophila melanogaster , 2014, PloS one.

[3]  Gordon Wyeth,et al.  Mapping a Suburb With a Single Camera Using a Biologically Inspired SLAM System , 2008, IEEE Transactions on Robotics.

[4]  Hong Wang,et al.  Loihi: A Neuromorphic Manycore Processor with On-Chip Learning , 2018, IEEE Micro.

[5]  D. F. Hoyt,et al.  Gait and the energetics of locomotion in horses , 1981, Nature.

[6]  Roland Strauss,et al.  Motor-Skill Learning in an Insect Inspired Neuro-Computational Control System , 2017, Front. Neurorobot..

[7]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[8]  Arijit Raychowdhury,et al.  A 55nm time-domain mixed-signal neuromorphic accelerator with stochastic synapses and embedded reinforcement learning for autonomous micro-robots , 2018, 2018 IEEE International Solid - State Circuits Conference - (ISSCC).

[9]  N. A. Bernshteĭn The co-ordination and regulation of movements , 1967 .

[10]  David Haussler,et al.  Neuromorphic Closed-Loop Control of a Flexible Modular Robot by a Simulated Spiking Central Pattern Generator , 2020, 2020 3rd IEEE International Conference on Soft Robotics (RoboSoft).

[11]  Auke Jan Ijspeert,et al.  Central pattern generators for locomotion control in animals and robots: A review , 2008, Neural Networks.

[12]  Juan Martín Carpio Valadez,et al.  Design of Spiking Central Pattern Generators for Multiple Locomotion Gaits in Hexapod Robots by Christiansen Grammar Evolution , 2016, Front. Neurorobot..

[13]  A. Selverston,et al.  Invertebrate central pattern generator circuits , 2010, Philosophical Transactions of the Royal Society B: Biological Sciences.

[14]  Yang Zhang,et al.  A study on control mechanism of above knee robotic prosthesis based on CPG model , 2010, 2010 IEEE International Conference on Robotics and Biomimetics.

[15]  S. Zill,et al.  Invertebrate Neurobiology: Brain Control of Insect Walking , 2010, Current Biology.

[16]  Arijit Raychowdhury,et al.  Bio-inspired Gait Imitation of Hexapod Robot Using Event-Based Vision Sensor and Spiking Neural Network , 2020, 2020 International Joint Conference on Neural Networks (IJCNN).

[17]  Akio Ishiguro,et al.  A Quadruped Robot Exhibiting Spontaneous Gait Transitions from Walking to Trotting to Galloping , 2017, Scientific Reports.

[18]  E. Izhikevich Solving the distal reward problem through linkage of STDP and dopamine signaling , 2007, BMC Neuroscience.

[19]  Jun Ueda,et al.  A 5 mg micro-bristle-bot fabricated by two-photon lithography , 2019, Journal of Micromechanics and Microengineering.

[20]  Arijit Raychowdhury,et al.  A 65-nm 8-to-3-b 1.0–0.36-V 9.1–1.1-TOPS/W Hybrid-Digital-Mixed-Signal Computing Platform for Accelerating Swarm Robotics , 2020, IEEE Journal of Solid-State Circuits.

[21]  Arijit Raychowdhury,et al.  14.1 A 65nm 1.1-to-9.1TOPS/W Hybrid-Digital-Mixed-Signal Computing Platform for Accelerating Model-Based and Model-Free Swarm Robotics , 2019, 2019 IEEE International Solid- State Circuits Conference - (ISSCC).

[22]  Razvan V. Florian,et al.  Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity , 2007, Neural Computation.

[23]  Arijit Raychowdhury,et al.  Learning to Walk: Spike Based Reinforcement Learning for Hexapod Robot Central Pattern Generation , 2020, 2020 2nd IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS).

[24]  César Torres-Huitzil,et al.  A CPG system based on spiking neurons for hexapod robot locomotion , 2015, Neurocomputing.

[25]  Ken Saito,et al.  Four-leg independent mechanism for MEMS microrobot , 2017, Artificial Life and Robotics.

[26]  Andrew S. Cassidy,et al.  A million spiking-neuron integrated circuit with a scalable communication network and interface , 2014, Science.

[27]  Alois Knoll,et al.  Task-Independent Spiking Central Pattern Generator: A Learning-Based Approach , 2020, Neural Processing Letters.

[28]  Daniel Gutierrez-Galan,et al.  NeuroPod: a real-time neuromorphic spiking CPG applied to robotics , 2019, Neurocomputing.

[29]  Arijit Raychowdhury,et al.  A 55-nm, 1.0–0.4V, 1.25-pJ/MAC Time-Domain Mixed-Signal Neuromorphic Accelerator With Stochastic Synapses for Reinforcement Learning in Autonomous Mobile Robots , 2019, IEEE Journal of Solid-State Circuits.

[30]  Ioannis Polykretis,et al.  An Astrocyte-Modulated Neuromorphic Central Pattern Generator for Hexapod Robot Locomotion on Intel’s Loihi , 2020, ICONS.

[31]  R. Mann,et al.  Quantification of gait parameters in freely walking wild type and sensory deprived Drosophila melanogaster , 2013, eLife.

[32]  Roland Strauss,et al.  Nonlinear Circuits and Systems for Neuro-inspired Robot Control , 2018 .

[33]  F. Ponulak ReSuMe-New Supervised Learning Method for Spiking Neural Networks , 2005 .