Composable Probabilistic Inference Networks Using MRAM-based Stochastic Neurons

Magnetoresistive random access memory (MRAM) technologies with thermally unstable nanomagnets are leveraged to develop an intrinsic stochastic neuron as a building block for restricted Boltzmann machines (RBMs) to form deep belief networks (DBNs). The embedded MRAM-based neuron is modeled using precise physics equations. The simulation results exhibit the desired sigmoidal relation between the input voltages and probability of the output state. A probabilistic inference network simulator (PIN-Sim) is developed to realize a circuit-level model of an RBM utilizing resistive crossbar arrays along with differential amplifiers to implement the positive and negative weight values. The PIN-Sim is composed of five main blocks to train a DBN, evaluate its accuracy, and measure its power consumption. The MNIST dataset is leveraged to investigate the energy and accuracy tradeoffs of seven distinct network topologies in SPICE using the 14nm HP-FinFET technology library with the nominal voltage of 0.8V, in which an MRAM-based neuron is used as the activation function. The software and hardware level simulations indicate that a 784× 200× 10 topology can achieve less than 5% error rates with ∼400pJ energy consumption. The error rates can be reduced to 2.5% by using a 784× 500× 500× 500× 10 DBN at the cost of ∼10× higher energy consumption and significant area overhead. Finally, the effects of specific hardware-level parameters on power dissipation and accuracy tradeoffs are identified via the developed PIN-Sim framework.

[1]  Hitoshi Kubota,et al.  Neural-like computing with populations of superparamagnetic basis functions , 2016, Nature Communications.

[2]  J. Appenzeller,et al.  Spin-torque devices with hard axis initialization as Stochastic Binary Neurons , 2018, Scientific Reports.

[3]  Terrence J. Sejnowski,et al.  The Computational Brain , 1996, Artif. Intell..

[4]  Sara A. Majetich,et al.  Current control of time-averaged magnetization in superparamagnetic tunnel junctions , 2017 .

[5]  F DemaraRonald,et al.  Composable Probabilistic Inference Networks Using MRAM-based Stochastic Neurons , 2019 .

[6]  Naoya Onizawa,et al.  VLSI Implementation of Deep Neural Network Using Integral Stochastic Computing , 2015, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[7]  Engin Ipek,et al.  Memristive Boltzmann machine: A hardware accelerator for combinatorial optimization and deep learning , 2017, 2017 Fifth Berkeley Symposium on Energy Efficient Electronic Systems & Steep Transistors Workshop (E3S).

[8]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[9]  S. Yuasa,et al.  Giant room-temperature magnetoresistance in single-crystal Fe/MgO/Fe magnetic tunnel junctions , 2004, Nature materials.

[10]  Brian M. Sutton,et al.  Stochastic p-bits for Invertible Logic , 2016, 1610.00377.

[11]  Supriyo Datta,et al.  Modular Approach to Spintronics , 2015, Scientific Reports.

[12]  A. Panchula,et al.  Giant tunnelling magnetoresistance at room temperature with MgO (100) tunnel barriers , 2004, Nature materials.

[13]  Yoshua Bengio,et al.  BinaryConnect: Training Deep Neural Networks with binary weights during propagations , 2015, NIPS.

[14]  Supriyo Datta,et al.  Experimental demonstration of nanomagnet networks as hardware for Ising computing , 2016, 2016 IEEE International Electron Devices Meeting (IEDM).

[15]  Chung Lam,et al.  Training a Probabilistic Graphical Model With Resistive Switching Electronic Synapses , 2016, IEEE Transactions on Electron Devices.

[16]  A. Fert,et al.  Current-induced magnetization switching in atom-thick tungsten engineered perpendicular magnetic tunnel junctions with large tunnel magnetoresistance , 2017, Nature Communications.

[17]  Supriyo Datta,et al.  Intrinsic optimization using stochastic nanomagnets , 2016, Scientific Reports.

[18]  Kaushik Roy,et al.  Perspective: Stochastic magnetic devices for cognitive computing , 2018, Journal of Applied Physics.

[19]  Robert Hecht-Nielsen,et al.  Theory of the backpropagation neural network , 1989, International 1989 Joint Conference on Neural Networks.

[20]  I A Basheer,et al.  Artificial neural networks: fundamentals, computing, design, and application. , 2000, Journal of microbiological methods.

[21]  Supriyo Datta,et al.  Implementing p-bits With Embedded MTJ , 2017, IEEE Electron Device Letters.

[22]  Hiroshi Imamura,et al.  Spin dice: A scalable truly random number generator based on spintronics , 2014 .

[23]  Paul Chow,et al.  High-Performance Reconfigurable Hardware Architecture for Restricted Boltzmann Machines , 2010, IEEE Transactions on Neural Networks.

[24]  Chris H. Kim,et al.  Low-Energy Deep Belief Networks Using Intrinsic Sigmoidal Spintronic-based Probabilistic Neurons , 2017, ACM Great Lakes Symposium on VLSI.

[25]  Kaushik Roy,et al.  Magnetic Tunnel Junction Enabled Stochastic Spiking Neural Networks: From Non-Telegraphic to Telegraphic Switching Regime , 2017, ArXiv.

[26]  Chris H. Kim,et al.  A Magnetic Tunnel Junction based True Random Number Generator with conditional perturb and real-time output probability tracking , 2014, 2014 IEEE International Electron Devices Meeting.

[27]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[28]  Kaushik Roy,et al.  Hybrid Spintronic-CMOS Spiking Neural Network With On-Chip Learning: Devices, Circuits and Systems , 2015, ArXiv.

[29]  Luan Tran,et al.  45nm low power CMOS logic compatible embedded STT MRAM utilizing a reverse-connection 1T/1MTJ cell , 2009, 2009 IEEE International Electron Devices Meeting (IEDM).

[30]  Hitoshi Kubota,et al.  Noise-enhanced synchronization of stochastic magnetic oscillators , 2014, 1405.4360.

[31]  D. Ralph,et al.  Measurement of the spin-transfer-torque vector in magnetic tunnel junctions , 2007, 0705.4207.

[32]  J. Scott,et al.  HIGH-DIELECTRIC CONSTANT THIN FILMS FOR DYNAMIC RANDOM ACCESS MEMORIES (DRAM) , 1998 .

[33]  Nicola A. Spaldin,et al.  Origin of the dielectric dead layer in nanoscale capacitors , 2006, Nature.

[34]  Boulder,et al.  Large-angle, gigahertz-rate random telegraph switching induced by spin-momentum transfer , 2004, cond-mat/0404109.

[35]  Masatoshi Okutomi,et al.  A Novel Inference of a Restricted Boltzmann Machine , 2014, 2014 22nd International Conference on Pattern Recognition.

[36]  Koichiro Inomata,et al.  Giant tunneling magnetoresistance up to 330% at room temperature in sputter deposited Co2FeAl/MgO/CoFe magnetic tunnel junctions , 2009 .

[37]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[38]  Qing Wu,et al.  Hardware realization of BSB recall function using memristor crossbar arrays , 2012, DAC Design Automation Conference 2012.

[39]  Andrew S. Cassidy,et al.  A million spiking-neuron integrated circuit with a scalable communication network and interface , 2014, Science.

[40]  Witold Pedrycz,et al.  Contrastive divergence for memristor-based restricted Boltzmann machine , 2015, Engineering applications of artificial intelligence.

[41]  Punyashloka Debashis,et al.  Design of Stochastic Nanomagnets for Probabilistic Spin Logic , 2018, IEEE Magnetics Letters.

[42]  Supriyo Datta,et al.  A building block for hardware belief networks , 2016, Scientific Reports.

[43]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[44]  R. Cowburn,et al.  Single-Domain Circular Nanomagnets , 1999 .

[45]  Yu (Kevin) Cao,et al.  What is Predictive Technology Model (PTM)? , 2009, SIGD.

[46]  Kaushik Roy,et al.  Probabilistic Deep Spiking Neural Systems Enabled by Magnetic Tunnel Junction , 2016, IEEE Transactions on Electron Devices.

[47]  Miguel Á. Carreira-Perpiñán,et al.  On Contrastive Divergence Learning , 2005, AISTATS.

[48]  Bo Yuan,et al.  VLSI Architectures for the Restricted Boltzmann Machine , 2017, ACM J. Emerg. Technol. Comput. Syst..

[49]  Wolfgang Maass,et al.  Neural Dynamics as Sampling: A Model for Stochastic Computation in Recurrent Networks of Spiking Neurons , 2011, PLoS Comput. Biol..

[50]  Kaushik Roy,et al.  Magnetic Tunnel Junction Mimics Stochastic Cortical Spiking Neurons , 2015, Scientific Reports.

[51]  D. Stewart,et al.  The missing memristor found , 2008, Nature.

[52]  H. Ohno,et al.  Spintronics based random access memory: a review , 2017 .

[53]  Kunle Olukotun,et al.  A Large-Scale Architecture for Restricted Boltzmann Machines , 2010, 2010 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines.

[54]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[55]  Geoffrey E. Hinton,et al.  Application of Deep Belief Networks for Natural Language Understanding , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[56]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.