A Hardware Accelerator for Computing an Exact Dot Product
暂无分享,去创建一个
Krste Asanovic | Jonathan Bachrach | Jack Koenig | David Biancolin | K. Asanović | J. Bachrach | David Biancolin | Jack Koenig
[1] John Wawrzynek,et al. Chisel: Constructing hardware in a Scala embedded language , 2012, DAC Design Automation Conference 2012.
[2] Andrew Waterman,et al. The RISC-V Instruction Set Manual. Volume 1: User-Level ISA, Version 2.0 , 2014 .
[3] Norman P. Jouppi,et al. CACTI: an enhanced cache access and cycle time model , 1996, IEEE J. Solid State Circuits.
[4] Ansi Ieee,et al. IEEE Standard for Binary Floating Point Arithmetic , 1985 .
[5] Karthikeyan Sankaralingam,et al. Dark Silicon and the End of Multicore Scaling , 2012, IEEE Micro.
[6] Yunsup Lee,et al. The RISC-V Instruction Set Manual , 2014 .
[7] Ulrich W. Kulisch,et al. Computer Arithmetic and Validity - Theory, Implementation, and Applications , 2008, de Gruyter studies in mathematics.
[8] George Ho,et al. PAPI: A Portable Interface to Hardware Performance Counters , 1999 .
[9] David Defour,et al. ExBLAS: Reproducible and Accurate BLAS Library , 2015 .
[10] Vincent Lefèvre,et al. MPFR: A multiple-precision binary floating-point library with correct rounding , 2007, TOMS.
[11] Peter Ahrens,et al. Efficient Reproducible Floating Point Summation and BLAS , 2015 .
[12] Adam M. Izraelevitz,et al. The Rocket Chip Generator , 2016 .
[13] Florent de Dinechin,et al. Design-space exploration for the Kulisch accumulator , 2017 .
[14] Siegfried M. Rump,et al. Accurate Sum and Dot Product , 2005, SIAM J. Sci. Comput..