Autonomous Probabilistic Coprocessing With Petaflips per Second

In this article we present a concrete design for a probabilistic (p-) computer based on a network of p-bits, robust classical entities fluctuating between −1 and +1, with probabilities that are controlled through an input constructed from the outputs of other p-bits. The architecture of this probabilistic computer is similar to a stochastic neural network with the p-bit playing the role of a binary stochastic neuron, but with one key difference: there is no sequencer used to enforce an ordering of p-bit updates, as is typically required. Instead, we explore sequencerless designs where all p-bits are allowed to flip autonomously and demonstrate that such designs can allow ultrafast operation unconstrained by available clock speeds without compromising the solution’s fidelity. Based on experimental results from a hardware benchmark of the autonomous design and benchmarked device models, we project that a nanomagnetic implementation can scale to achieve petaflips per second with millions of neurons. A key contribution of this article is the focus on a hardware metric − flips per second − as a problem and substrate-independent figure-of-merit for an emerging class of hardware annealers known as Ising Machines. Much like the shrinking feature sizes of transistors that have continually driven Moore’s Law, we believe that flips per second can be continually improved in later technology generations of a wide class of probabilistic, domain specific hardware.

[1]  Supriyo Datta,et al.  Modular Approach to Spintronics , 2015, Scientific Reports.

[2]  Damien Querlioz,et al.  Spintronic Nanodevices for Bioinspired Computing , 2016, Proceedings of the IEEE.

[3]  S. Majetich,et al.  Superparamagnetic perpendicular magnetic tunnel junctions for true random number generators , 2018 .

[4]  Vicky Choi,et al.  Minor-embedding in adiabatic quantum computation: II. Minor-universal graph design , 2010, Quantum Inf. Process..

[5]  Matthias Troyer,et al.  Computational complexity and fundamental limitations to fermionic quantum Monte Carlo simulations , 2004, Physical review letters.

[6]  John W. Fisher,et al.  Estimating the Partition Function by Discriminance Sampling , 2015, UAI.

[7]  Nikesh S. Dattani,et al.  Quadratization in discrete optimization and quantum mechanics , 2019, ArXiv.

[8]  Richard F. Lyon,et al.  Neural Networks for Machine Learning , 2017 .

[9]  Vicky Choi,et al.  Minor-embedding in adiabatic quantum computation: I. The parameter setting problem , 2008, Quantum Inf. Process..

[10]  Ryan Babbush,et al.  What is the Computational Value of Finite Range Tunneling , 2015, 1512.02206.

[11]  Hitoshi Kubota,et al.  Neural-like computing with populations of superparamagnetic basis functions , 2016, Nature Communications.

[12]  Leonardo Franco,et al.  High precision FPGA implementation of neural network activation functions , 2014, 2014 IEEE Symposium on Intelligent Embedded Systems (IES).

[13]  Masanao Yamaoka,et al.  An Ising Computer Based on Simulated Quantum Annealing by Path Integral Monte Carlo Method , 2017, 2017 IEEE International Conference on Rebooting Computing (ICRC).

[14]  Masanori Hariyama,et al.  OpenCL-based design of an FPGA accelerator for quantum annealing simulation , 2019, The Journal of Supercomputing.

[15]  P. Pfeuty The one-dimensional Ising model with a transverse field , 1970 .

[16]  Philippe Talatchian,et al.  Energy-efficient stochastic computing with superparamagnetic tunnel junctions , 2020, Physical review applied.

[17]  Marcelo A. Montemurro,et al.  FPGA Hardware Acceleration of Monte Carlo Simulations for the Ising Model , 2016, IEEE Transactions on Parallel and Distributed Systems.

[18]  Denis Navarro,et al.  Janus II: A new generation application-driven computer for spin-system simulations , 2013, Comput. Phys. Commun..

[19]  S. Kirkpatrick,et al.  Solvable Model of a Spin-Glass , 1975 .

[20]  Yamaoka Masanao,et al.  7.3 STATICA: A 512-Spin 0.25M-Weight Full-Digital Annealing Processor with a Near-Memory All-Spin-Updates-at-Once Architecture for Combinatorial Optimization with Complete Spin-Spin Interactions , 2020 .

[21]  Andrew Lucas,et al.  Ising formulations of many NP problems , 2013, Front. Physics.

[22]  Supriyo Datta,et al.  Scaled Quantum Circuits Emulated with Room Temperature p-Bits , 2018, 1810.07144.

[23]  Yang Lv,et al.  Experimental Demonstration of Probabilistic Spin Logic by Magnetic Tunnel Junctions , 2019, IEEE Magnetics Letters.

[24]  Parul Parashar,et al.  Neural Networks in Machine Learning , 2014 .

[25]  Yu Wang,et al.  Technological Exploration of RRAM Crossbar Array for Matrix-Vector Multiplication , 2015, Journal of Computer Science and Technology.

[26]  Shimeng Yu,et al.  Harnessing Intrinsic Noise in Memristor Hopfield Neural Networks for Combinatorial Optimization , 2019, ArXiv.

[27]  Ken-ichi Kawarabayashi,et al.  A coherent Ising machine for 2000-node optimization problems , 2016, Science.

[28]  Hiroshi Imamura,et al.  Spin dice: A scalable truly random number generator based on spintronics , 2014 .

[29]  Supriyo Datta,et al.  Low-Barrier Magnet Design for Efficient Hardware Binary Stochastic Neurons , 2019, IEEE Magnetics Letters.

[30]  Bongjin Kim,et al.  31.2 CIM-Spin: A 0.5-to-1.2V Scalable Annealing Processor Using Digital Compute-In-Memory Spin Operators and Register-Based Spins for Combinatorial Optimization Problems , 2020, 2020 IEEE International Solid- State Circuits Conference - (ISSCC).

[31]  M. Troyer,et al.  Quantum versus classical annealing of Ising spin glasses , 2014, Science.

[32]  Ran El-Yaniv,et al.  Binarized Neural Networks , 2016, NIPS.

[33]  Jaijeet S. Roychowdhury,et al.  OIM: Oscillator-based Ising Machines for Solving Combinatorial Optimisation Problems , 2019, UCNC.

[34]  W. Rippard,et al.  Switching Distributions for Perpendicular Spin-Torque Devices Within the Macrospin Approximation , 2012, IEEE Transactions on Magnetics.

[35]  M. Suzuki,et al.  Relationship between d-Dimensional Quantal Spin Systems and (d+1)-Dimensional Ising Systems: Equivalence, Critical Exponents and Systematic Approximants of the Partition Function and Spin Correlations , 1976 .

[36]  Emile H. L. Aarts,et al.  Simulated annealing and Boltzmann machines - a stochastic approach to combinatorial optimization and neural computing , 1990, Wiley-Interscience series in discrete mathematics and optimization.

[37]  Jianping Wang,et al.  A single magnetic-tunnel-junction stochastic computing unit , 2017, 2017 IEEE International Electron Devices Meeting (IEDM).

[38]  Yu Wang,et al.  Memristor-based approximated computation , 2013, International Symposium on Low Power Electronics and Design (ISLPED).

[39]  Andrew S. Cassidy,et al.  A million spiking-neuron integrated circuit with a scalable communication network and interface , 2014, Science.

[40]  R. Car,et al.  Theory of Quantum Annealing of an Ising Spin Glass , 2002, Science.

[41]  Supriyo Bandyopadhyay,et al.  Reliability and Scalability of p-Bits Implemented With Low Energy Barrier Nanomagnets , 2019, IEEE Magnetics Letters.

[42]  Nikhil Rangarajan,et al.  Energy-Efficient Computing With Probabilistic Magnetic Bits—Performance Modeling and Comparison Against Probabilistic CMOS Logic , 2017, IEEE Transactions on Magnetics.

[43]  Wolfgang Porod,et al.  Computing With Networks of Oscillatory Dynamical Systems , 2019, Proceedings of the IEEE.

[44]  Jonathan Rose,et al.  Measuring the Gap Between FPGAs and ASICs , 2006, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[45]  Supriyo Datta,et al.  Integer factorization using stochastic magnetic tunnel junctions , 2019, Nature.

[46]  Catherine D. Schuman,et al.  A Survey of Neuromorphic Computing and Neural Networks in Hardware , 2017, ArXiv.

[47]  Supriyo Datta,et al.  p-Bits for Probabilistic Spin Logic , 2018, Applied Physics Reviews.

[48]  Wolfgang Maass,et al.  Neural Dynamics as Sampling: A Model for Stochastic Computation in Recurrent Networks of Spiking Neurons , 2011, PLoS Comput. Biol..

[49]  Supriyo Datta,et al.  Intrinsic optimization using stochastic nanomagnets , 2016, Scientific Reports.

[50]  Supriyo Datta,et al.  Hardware emulation of stochastic p-bits for invertible logic , 2017, Scientific Reports.

[51]  A. R. Trivedi,et al.  Low Power Restricted Boltzmann Machine Using Mixed-Mode Magneto-Tunneling Junctions , 2019, IEEE Electron Device Letters.

[52]  L. Reichl A modern course in statistical physics , 1980 .

[53]  Abhishek Khanna,et al.  Inherent Weight Normalization in Stochastic Neural Networks , 2019, NeurIPS.

[54]  Radford M. Neal Connectionist Learning of Belief Networks , 1992, Artif. Intell..

[55]  J. Slaughter,et al.  A Fully Functional 64 Mb DDR3 ST-MRAM Built on 90 nm CMOS Technology , 2013, IEEE Transactions on Magnetics.

[56]  M. R. Mahmoodi,et al.  Versatile stochastic dot product circuits based on nonvolatile memories for high performance neurocomputing and neurooptimization , 2019, Nature Communications.

[57]  Hayato Goto,et al.  Combinatorial optimization by simulating adiabatic bifurcations in nonlinear Hamiltonian systems , 2019, Science Advances.

[58]  Joseph S. Friedman,et al.  Low-Energy Truly Random Number Generation with Superparamagnetic Tunnel Junctions for Unconventional Computing , 2017, 1706.05262.

[59]  Hiroyuki Mizuno,et al.  A 20k-Spin Ising Chip to Solve Combinatorial Optimization Problems With CMOS Annealing , 2016, IEEE Journal of Solid-State Circuits.

[60]  Jaijeet S. Roychowdhury,et al.  Oscillator-based Ising Machine , 2017, ArXiv.

[61]  Ken-ichi Kawarabayashi,et al.  Experimental investigation of performance differences between coherent Ising machines and a quantum annealer , 2018, Science Advances.

[62]  Supriyo Datta,et al.  A building block for hardware belief networks , 2016, Scientific Reports.

[63]  Kaushik Roy,et al.  Stochastic Spiking Neural Networks Enabled by Magnetic Tunnel Junctions: From Nontelegraphic to Telegraphic Switching Regimes , 2017 .

[64]  Hong Wang,et al.  Loihi: A Neuromorphic Manycore Processor with On-Chip Learning , 2018, IEEE Micro.

[65]  Supriyo Datta,et al.  Scalable Emulation of Stoquastic Hamiltonians with Room Temperature p-bits , 2019 .

[66]  Supriyo Datta,et al.  Implementing p-bits With Embedded MTJ , 2017, IEEE Electron Device Letters.

[67]  Kazuyuki Aihara,et al.  A fully programmable 100-spin coherent Ising machine with all-to-all connections , 2016, Science.

[68]  P. Upadhyaya,et al.  Modular Compact Modeling of MTJ Devices , 2018, IEEE Transactions on Electron Devices.

[69]  Brian M. Sutton,et al.  Stochastic p-bits for Invertible Logic , 2016, 1610.00377.

[70]  R. Feynman Simulating physics with computers , 1999 .

[71]  S. Datta,et al.  Proposal for an all-spin logic device with built-in memory. , 2010, Nature nanotechnology.

[72]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[73]  Srinivas Devadas,et al.  FPGA PUF using programmable delay lines , 2010, 2010 IEEE International Workshop on Information Forensics and Security.

[74]  Masanao Yamaoka,et al.  FPGA-based Annealing Processor for Ising Model , 2016, 2016 Fourth International Symposium on Computing and Networking (CANDAR).

[75]  Toshiyuki Miyazawa,et al.  Physics-Inspired Optimization for Quadratic Unconstrained Problems Using a Digital Annealer , 2018, Front. Phys..

[76]  Daniel A. Lidar,et al.  Evidence for quantum annealing with more than one hundred qubits , 2013, Nature Physics.

[77]  Masato Motomura,et al.  7.3 STATICA: A 512-Spin 0.25M-Weight Full-Digital Annealing Processor with a Near-Memory All-Spin-Updates-at-Once Architecture for Combinatorial Optimization with Complete Spin-Spin Interactions , 2020, 2020 IEEE International Solid- State Circuits Conference - (ISSCC).

[78]  Sebastiano Vigna,et al.  Scrambled Linear Pseudorandom Number Generators , 2018, ACM Trans. Math. Softw..

[79]  Supriyo Datta,et al.  Composable Probabilistic Inference Networks Using MRAM-based Stochastic Neurons , 2018, ACM J. Emerg. Technol. Comput. Syst..

[80]  Rob A. Rutenbar,et al.  Digital Circuit Design Challenges and Opportunities in the Era of Nanoscale CMOS , 2008, Proceedings of the IEEE.