Exploring a SOT-MRAM Based In-Memory Computing for Data Processing

In this paper, we propose a Spin-Orbit Torque Magnetic Random-Access Memory (SOT-MRAM) array design that can simultaneously work as non-volatile memory and implement a reconfigurable in-memory logic operation without add-on logic circuits. The computed output can be simply read out like a typical MRAM bit-cell through the modified peripheral circuit. Such intrinsic in-memory computation can be used to process data locally and transfer the “cooked” data to the primary processing unit (i.e., CPU or GPU) for complex computations with high precision requirement. It greatly reduces the power-hungry and long-distance data communication, and further leads to extreme parallel computation within memory. In this work, we further propose an in-memory edge extraction algorithm as a case study to demonstrate the efficiency of the in-memory pre-processing methodology. The simulation results show that our edge extraction method reduces data communication as much as 8x for grayscale image, thus greatly reducing system energy consumption. Meanwhile, the F-measure result shows only <inline-formula><tex-math notation="LaTeX">$\sim$</tex-math><alternatives><inline-graphic xlink:href="he-ieq1-2836967.gif"/></alternatives></inline-formula>10 percent degradation compared to conventional edge detection operator, such as Prewitt, Sobel, and Roberts. Moreover, the edges extracted from the memory show comparable good quality with Canny edges in the context of edge-based motion detection and cross-modality object recognition.

[1]  Shaahin Angizi,et al.  Exploring STT-MRAM Based In-Memory Computing Paradigm with Application of Image Edge Extraction , 2017, 2017 IEEE International Conference on Computer Design (ICCD).

[2]  David Blaauw,et al.  Compute Caches , 2017, 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA).

[3]  D. Ralph,et al.  Spin-torque ferromagnetic resonance induced by the spin Hall effect. , 2010, Physical review letters.

[4]  Shaahin Angizi,et al.  IMCS2: Novel Device-to-Architecture Co-Design for Low-Power In-Memory Computing Platform Using Coterminous Spin Switch , 2018, IEEE Transactions on Magnetics.

[5]  Charless C. Fowlkes,et al.  Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Shaahin Angizi,et al.  Leveraging Dual-Mode Magnetic Crossbar for Ultra-low Energy In-memory Data Encryption , 2017, ACM Great Lakes Symposium on VLSI.

[7]  A. Anane,et al.  Nearly total spin polarization in La2/3Sr1/3MnO3 from tunneling experiments , 2003 .

[8]  Shoji Ikeda,et al.  Dependence of Giant Tunnel Magnetoresistance of Sputtered CoFeB/MgO/CoFeB Magnetic Tunnel Junctions on MgO Barrier Thickness and Annealing Temperature , 2005 .

[9]  G Autès,et al.  Strong enhancement of the tunneling magnetoresistance by electron filtering in an Fe/MgO/Fe/GaAs(001) junction. , 2010, Physical review letters.

[10]  Shaahin Angizi,et al.  RIMPA: A New Reconfigurable Dual-Mode In-Memory Processing Architecture with Spin Hall Effect-Driven Domain Wall Motion Device , 2017, 2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI).

[11]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[12]  Yiming Huai,et al.  Spin-Transfer Torque MRAM (STT-MRAM): Challenges and Prospects , 2008 .

[13]  Mehdi Baradaran Tahoori,et al.  Ultra-Fast and High-Reliability SOT-MRAM: From Cache Replacement to Normally-Off Computing , 2016, IEEE Transactions on Multi-Scale Computing Systems.

[14]  Kaushik Roy,et al.  A framework for simulating hybrid MTJ/CMOS circuits: Atoms to system approach , 2012, 2012 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[15]  D. Ralph,et al.  Spin transfer torque devices utilizing the giant spin Hall effect of tungsten , 2012, 1208.1711.

[16]  Cong Xu,et al.  Pinatubo: A processing-in-memory architecture for bulk bitwise operations in emerging non-volatile memories , 2016, 2016 53nd ACM/EDAC/IEEE Design Automation Conference (DAC).

[17]  Emilio Maggio,et al.  Hybrid particle filter and mean shift tracker with adaptive transition model , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[18]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[19]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[20]  B. Hoefflinger ITRS: The International Technology Roadmap for Semiconductors , 2011 .

[21]  Deliang Fan,et al.  Energy Efficient Reconfigurable Threshold Logic Circuit with Spintronic Devices , 2017, IEEE Transactions on Emerging Topics in Computing.

[22]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[23]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Marjana Lezaic,et al.  Half-metallic ferromagnets for magnetic tunnel junctions by ab initio calculations , 2005 .

[25]  Lawrence G. Roberts,et al.  Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.

[26]  Fadi Dornaika,et al.  An Edge-Based Approach to Motion Detection , 2006, International Conference on Computational Science.

[27]  Irwin Edward Sobel,et al.  Camera Models and Machine Perception , 1970 .

[28]  D. Ralph,et al.  Spin-Torque Switching with the Giant Spin Hall Effect of Tantalum , 2012, Science.

[29]  Cong Xu,et al.  NVSim: A Circuit-Level Performance, Energy, and Area Model for Emerging Nonvolatile Memory , 2012, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[30]  Yanfeng Sun,et al.  A novel edge detection approach using a fusion model , 2014, Multimedia Tools and Applications.

[31]  A. Fert,et al.  Giant spin Hall effect induced by skew scattering from bismuth impurities inside thin film CuBi alloys. , 2012, Physical review letters.

[32]  Tao Zhang,et al.  PRIME: A Novel Processing-in-Memory Architecture for Neural Network Computation in ReRAM-Based Main Memory , 2016, 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA).

[33]  Kaushik Roy,et al.  Write-optimized reliable design of STT MRAM , 2012, ISLPED '12.

[34]  Shaahin Angizi,et al.  Low power in-memory computing based on dual-mode SOT-MRAM , 2017, 2017 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED).

[35]  Kaushik Roy,et al.  Spin-Transfer Torque Memories: Devices, Circuits, and Systems , 2016, Proceedings of the IEEE.

[36]  Rama Chellappa,et al.  Visual Domain Adaptation: A survey of recent advances , 2015, IEEE Signal Processing Magazine.

[37]  James Hays,et al.  The sketchy database , 2016, ACM Trans. Graph..

[38]  H. Ohno,et al.  A spin-orbit torque switching scheme with collinear magnetic easy axis and current configuration. , 2016, Nature nanotechnology.

[39]  K. Roy,et al.  Injection-Locked Spin Hall-Induced Coupled-Oscillators for Energy Efficient Associative Computing , 2015, IEEE Transactions on Nanotechnology.

[40]  Supriyo Datta,et al.  Non-volatile spin switch for Boolean and non-Boolean logic , 2012 .

[41]  G. Huang,et al.  An Energy-Efficient Nonvolatile In-Memory Computing Architecture for Extreme Learning Machine by Domain-Wall Nanowire Devices , 2015, IEEE Transactions on Nanotechnology.

[42]  Shaahin Angizi,et al.  HielM: Highly flexible in-memory computing using STT MRAM , 2018, 2018 23rd Asia and South Pacific Design Automation Conference (ASP-DAC).

[43]  Kaushik Roy,et al.  High Performance and Energy-Efficient On-Chip Cache Using Dual Port (1R/1W) Spin-Orbit Torque MRAM , 2016, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.