A Scalable Multi-TeraOPS Core for AI Training and Inference
暂无分享,去创建一个
Swagath Venkataramani | Vijayalakshmi Srinivasan | George Gristede | Pong-Fei Lu | Sunil Shukla | Kailash Gopalakrishnan | Jungwook Choi | Jinwook Oh | Chia-Yu Chen | Christos Vezyrtzis | Pierce Chuang | Leland Chang | Shih-Hsien Lo | Nianzheng Cao | Michael Klaiber | Naigang Wang | Michael Guillorn | Ching Zhou | Fanchieh Yee | Matthew Ziegler | Ankur Agrawal | Tina Babinsky | Silvia Mueller | Howard Haynie | Thomas Fox | Bruce Fleischer | Gary Maier | Michael Scheuermann | Joel Silberman | Brian Curran | Dongsoo Lee | A. Agrawal | K. Gopalakrishnan | Chia-Yu Chen | Jungwook Choi | Jinwook Oh | J. Silberman | P. Lu | S. Mueller | Naigang Wang | Swagath Venkataramani | Leland Chang | V. Srinivasan | M. Scheuermann | S. Lo | G. Maier | B. Fleischer | Dongsoo Lee | P. Chuang | G. Gristede | T. Fox | Christos Vezyrtzis | M. Klaiber | M. Ziegler | B. Curran | N. Cao | Michael Guillorn | Howard Haynie | Ching Zhou | Sunil Shukla | Tina Babinsky | F. Yee
[1] Swagath Venkataramani,et al. POSTER: Design Space Exploration for Performance Optimization of Deep Neural Networks on Shared Memory Accelerators , 2017, 2017 26th International Conference on Parallel Architectures and Compilation Techniques (PACT).
[2] Chester Liu,et al. A 1.40mm2 141mW 898GOPS sparse neuromorphic processor in 40nm CMOS , 2016, 2016 IEEE Symposium on VLSI Circuits (VLSI-Circuits).
[3] David A. Patterson,et al. In-datacenter performance analysis of a tensor processing unit , 2017, 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA).
[4] Song Han,et al. Trained Ternary Quantization , 2016, ICLR.
[5] Leibo Liu,et al. A 1.06-to-5.09 TOPS/W reconfigurable hybrid-neural-network processor for deep learning applications , 2017, 2017 Symposium on VLSI Circuits.
[6] Desoli Mr Giuseppe,et al. 14.1 A 2.9TOPS/W deep convolutional neural network SoC in FD-SOI 28nm for intelligent embedded systems , 2017 .
[7] Tadahiro Kuroda,et al. BRein memory: A 13-layer 4.2 K neuron/0.8 M synapse binary/ternary reconfigurable in-memory deep neural network accelerator in 65 nm CMOS , 2017, 2017 Symposium on VLSI Circuits.