论文信息 - SoC FPAA Hardware Implementation of a VMM+WTA Embedded Learning Classifier

SoC FPAA Hardware Implementation of a VMM+WTA Embedded Learning Classifier

This paper focuses on the circuit aspects required for an on-chip, on-line system on chip large-scale field-programmable analog array learning for vector-matrix multiplier (VMM) + winner-take-all (WTA) classifier structure. We start by describing the VMM+WTA classifier structure, and then show techniques required to handle device mismatch. The approach is initially explained using a VMM+WTA as a two-input XOR classifier structure. The approach requires considering the entire mixed-mode system, including the analog classifier data path, control circuitry for weight updates, and digital algorithm for computing digital weight updates and resulting floating-gate programming during the algorithm.

Jennifer Hasler | Sahil Shah

[1] Robert A. Legenstein,et al. Pattern representation and recognition with accelerated analog neuromorphic systems , 2017, 2017 IEEE International Symposium on Circuits and Systems (ISCAS).

[2] Naveen Verma,et al. A Low-Power Processor With Configurable Embedded Machine-Learning Accelerators for High-Order and Adaptive Analysis of Medical-Sensor Signals , 2013, IEEE Journal of Solid-State Circuits.

[3] Sihwan Kim,et al. Integrated Floating-Gate Programming Environment for System-Level ICs , 2016, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[4] Jennifer Hasler,et al. VMM + WTA Embedded Classifiers Learning Algorithm Implementable on SoC FPAA Devices , 2018, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[5] M. A. Jabri,et al. A low power trainable analogue neural network classifier chip , 1993, Proceedings of IEEE Custom Integrated Circuits Conference - CICC '93.

[6] Wolfgang Maass,et al. On the Computational Power of Winner-Take-All , 2000, Neural Computation.

[7] John Lazzaro,et al. Winner-Take-All Networks of O(N) Complexity , 1988, NIPS.

[8] David Blaauw,et al. A Fixed-Point Neural Network Architecture for Speech Applications on Resource Constrained Hardware , 2016, Journal of Signal Processing Systems.

[9] Ethem Alpaydin,et al. Introduction to machine learning , 2004, Adaptive computation and machine learning.

[10] C PlattJ,et al. A Neural Network Classifier for the I1000 OCR Chip. , 1996 .

[11] Sihwan Kim,et al. Scaling Floating-Gate Devices Predicting Behavior for Programmable and Configurable Circuits and Systems , 2016 .

[12] Jennifer Hasler,et al. Modeling, simulation and implementation of circuit elements in an open-source tool set on the FPAA , 2017 .

[13] Gert Cauwenberghs,et al. An analog VLSI recurrent neural network learning a continuous-time trajectory , 1996, IEEE Trans. Neural Networks.

[14] Jennifer Hasler,et al. Vector-Matrix Multiply and Winner-Take-All as an Analog Classifier , 2014, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[15] Jennifer Hasler,et al. Opportunities in physical computing driven by analog realization , 2016, 2016 IEEE International Conference on Rebooting Computing (ICRC).

[16] Simha Sethumadhavan,et al. Energy-Efficient Hybrid Analog/Digital Approximate Computation in Continuous Time , 2016, IEEE Journal of Solid-State Circuits.

[17] Zhengya Zhang,et al. A 6.67mW sparse coding ASIC enabling on-chip learning and inference , 2014, 2014 Symposium on VLSI Circuits Digest of Technical Papers.

[18] Giacomo Indiveri,et al. A reconfigurable on-line learning spiking neuromorphic processor comprising 256 neurons and 128K synapses , 2015, Front. Neurosci..

[19] Gert Cauwenberghs,et al. Neuromorphic learning VLSI systems: a survey , 1998 .

[20] Jennifer Hasler,et al. Reconfigurable analog classifier for knee-joint rehabilitation , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[21] Fabrice Seguin,et al. Ultra-Low-Energy Mixed-Signal IC Implementing Encoded Neural Networks , 2016, IEEE Transactions on Circuits and Systems I: Regular Papers.

[22] K. Ming Leung,et al. Learning Vector Quantization , 2017, Encyclopedia of Machine Learning and Data Mining.

[23] Jennifer Hasler,et al. Low power speech detector on a FPAA , 2017, 2017 IEEE International Symposium on Circuits and Systems (ISCAS).

[24] Marwan A. Jabri,et al. Weight perturbation: an optimal architecture and learning technique for analog VLSI feedforward and recurrent multilayer networks , 1992, IEEE Trans. Neural Networks.

[25] K. Hirotsu,et al. An analog neural network chip with random weight change learning algorithm , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[26] Geoffrey E. Hinton,et al. Acoustic Modeling Using Deep Belief Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[27] David W. Graham,et al. Reconfiguration costs in analog sensor interfaces for wireless sensing applications , 2013, 2013 IEEE 56th International Midwest Symposium on Circuits and Systems (MWSCAS).

[28] David V. Anderson,et al. A floating-gate vector-quantizer , 2002, The 2002 45th Midwest Symposium on Circuits and Systems, 2002. MWSCAS-2002..

[29] David V. Anderson,et al. An analog programmable multi-dimensional radial basis function based classifier , 2007, 2007 IFIP International Conference on Very Large Scale Integration.

[30] Venkatesh Srinivasan,et al. A 531 nW/MHz, 128/spl times/32 current-mode programmable analog vector-matrix multiplier with over two decades of linearity , 2004, Proceedings of the IEEE 2004 Custom Integrated Circuits Conference (IEEE Cat. No.04CH37571).

[31] Jennifer Hasler,et al. A proof-of-concept classifier for acoustic signals from the knee joint on a FPAA , 2016, 2016 IEEE SENSORS.

[32] Paul E. Hasler,et al. A Highly Dense, Low Power, Programmable Analog Vector-Matrix Multiplier: The FPAA Implementation , 2011, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[33] Jussi H. Poikonen,et al. A mixed-mode array computing architecture for online dictionary learning , 2017, 2017 IEEE International Symposium on Circuits and Systems (ISCAS).

[34] Sihwan Kim,et al. Calibration of Floating-Gate SoC FPAA System , 2017, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[35] Sihwan Kim,et al. A Programmable and Configurable Mixed-Mode FPAA SoC , 2016, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[36] Tadashi Shibata,et al. A Neuron-MOS-Based VLSI Implementation of Pulse-Coupled Neural Networks for Image Feature Generation , 2010, IEEE Transactions on Circuits and Systems I: Regular Papers.

[37] Steven R. Young,et al. A 1 TOPS/W Analog Deep Machine-Learning Engine With Floating-Gate Storage in 0.13 µm CMOS , 2014, IEEE Journal of Solid-State Circuits.

[38] Hong Yan,et al. Handwritten digit recognition by adaptive-subspace self-organizing map (ASSOM) , 1999, IEEE Trans. Neural Networks.

[39] Jennifer Hasler,et al. Tuning of Multiple Parameters With a BIST System , 2017, IEEE Transactions on Circuits and Systems I: Regular Papers.

[40] David V. Anderson,et al. An Analog Programmable Multidimensional Radial Basis Function Based Classifier , 2007, IEEE Transactions on Circuits and Systems I: Regular Papers.

[41] Jennifer Hasler. Energy Constraints for Building Large-Scale Systems , 2016 .

[42] Nils J. Nilsson,et al. Introduction to Machine Learning , 2020, Machine Learning for iOS Developers.

[43] J. Kwong,et al. An Energy-Efficient Biomedical Signal Processing Platform , 2010, IEEE Journal of Solid-State Circuits.

[44] Paul E. Hasler,et al. An analog floating-gate node for Supervised learning , 2005, IEEE Transactions on Circuits and Systems I: Regular Papers.