A Reconfigurable Posit Tensor Unit with Variable-Precision Arithmetic and Automatic Data Streaming
暂无分享,去创建一个
Nuno Roma | Pedro Tomás | Nuno Neves | N. Roma | P. Tomás | Nuno Neves
[1] Sharad Malik,et al. Cache miss equations: an analytical representation of cache misses , 1997, ICS '97.
[2] Kunle Olukotun,et al. Spatial: a language and compiler for application accelerators , 2018, PLDI.
[3] David A. Patterson,et al. A new golden age for computer architecture , 2019, Commun. ACM.
[4] David A. Patterson,et al. A domain-specific architecture for deep neural networks , 2018, Commun. ACM.
[5] David A. Patterson,et al. In-datacenter performance analysis of a tensor processing unit , 2017, 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA).
[6] Bin Wu,et al. OpenRAM: An open-source memory compiler , 2016, 2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).
[7] Kunle Olukotun,et al. Plasticine: A reconfigurable architecture for parallel patterns , 2017, 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA).
[8] Karthikeyan Sankaralingam,et al. Stream-dataflow acceleration , 2017, 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA).