A Framework to Explore Workload-Specific Performance and Lifetime Trade-offs in Neuromorphic Computing

Neuromorphic hardware with non-volatile memory (NVM) can implement machine learning workload in an energy-efficient manner. Unfortunately, certain NVMs such as phase change memory (PCM) require high voltages for correct operation. These voltages are supplied from an on-chip charge pump. If the charge pump is activated too frequently, its internal CMOS devices do not recover from stress, accelerating their aging and leading to negative bias temperature instability (NBTI) generated defects. Forcefully discharging the stressed charge pump can lower the aging rate of its CMOS devices, but makes the neuromorphic hardware unavailable to perform computations while its charge pump is being discharged. This negatively impacts performance such as latency and accuracy of the machine learning workload being executed. In this letter, we propose a novel framework to exploit workload-specific performance and lifetime trade-offs in neuromorphic computing. Our framework first extracts the precise times at which a charge pump in the hardware is activated to support neural computations within a workload. This timing information is then used with a characterized NBTI reliability model to estimate the charge pump's aging during the workload execution. We use our framework to evaluate workload-specific performance and reliability impacts of using 1) different SNN mapping strategies and 2) different charge pump discharge strategies. We show that our framework can be used by system designers to explore performance and reliability trade-offs early in the design of neuromorphic hardware such that appropriate reliability-oriented design margins can be set.

[1]  Nikil D. Dutt,et al.  CARLsim 4: An Open Source Library for Large Scale, Biologically Detailed Spiking Neural Network Simulation using Heterogeneous Clusters , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[2]  Nikil D. Dutt,et al.  Mapping Spiking Neural Networks to Neuromorphic Hardware , 2019, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[3]  Wolfgang Maass,et al.  Networks of Spiking Neurons: The Third Generation of Neural Network Models , 1996, Electron. Colloquium Comput. Complex..

[4]  Sonja Grün,et al.  Analysis of Parallel Spike Trains , 2010 .

[5]  Nikil D. Dutt,et al.  Unsupervised Heart-rate Estimation in Wearables With Liquid States and A Probabilistic Readout , 2017, Neural Networks.

[6]  Francky Catthoor,et al.  Heartbeat Classification in Wearables Using Multi-layer Perceptron and Time-Frequency Joint Distribution of ECG , 2018, 2018 IEEE/ACM International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE).

[7]  Dimitri Linten,et al.  NBTI-Generated Defects in Nanoscaled Devices: Fast Characterization Methodology and Modeling , 2017, IEEE Transactions on Electron Devices.

[8]  Wofgang Maas,et al.  Networks of spiking neurons: the third generation of neural network models , 1997 .

[9]  Bharadwaj Veeravalli,et al.  Aging-aware hardware-software task partitioning for reliable reconfigurable multiprocessor systems , 2013, 2013 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES).

[10]  Romain Brette,et al.  The Brian Simulator , 2009, Front. Neurosci..

[11]  Boyu Shen,et al.  Zero Reversion Loss, High-Efficiency Charge Pump for Wide Output Current Load Range , 2018, 2018 IEEE International Symposium on Circuits and Systems (ISCAS).

[12]  Jun Zhou,et al.  A System-Level Simulator for RRAM-Based Neuromorphic Computing Chips , 2019, ACM Trans. Archit. Code Optim..