A Tag Based Random Order Vector Reduction Circuit
暂无分享,去创建一个
[1] DAVID WILSON,et al. The Unified Accumulator Architecture , 2016, ACM Trans. Reconfigurable Technol. Syst..
[2] Jason D. Bakos,et al. A high-performance double precision accumulator , 2009, 2009 International Conference on Field-Programmable Technology.
[3] Viktor K. Prasanna,et al. High-Performance Reduction Circuits Using Deeply Pipelined Operators on FPGAs , 2007, IEEE Transactions on Parallel and Distributed Systems.
[4] Henk J. Sips,et al. An Improved Vector-Reduction Method , 1991, IEEE Trans. Computers.
[5] Marek Cieplucha. High performance FPGA-based implementation of a parallel multiplier-accumulator , 2013, Proceedings of the 20th International Conference Mixed Design of Integrated Circuits and Systems - MIXDES 2013.
[6] Kai Hwang,et al. Vector-Reduction Techniques for Arithmetic Pipelines , 1985, IEEE Transactions on Computers.
[7] Wayne Luk,et al. dfesnippets: An Open-Source Library for Dataflow Acceleration on FPGAs , 2017, ARC.
[8] Peter M. Kogge,et al. The Architecture of Pipelined Computers , 1981 .
[9] Kleanthis Psarris,et al. Accelerating Matrix Operations with Improved Deeply Pipelined Vector Reduction , 2012, IEEE Transactions on Parallel and Distributed Systems.
[10] Dongdong Chen,et al. High performance and energy efficient single-precision and double-precision merged floating-point adder on FPGA , 2018, IET Comput. Digit. Tech..
[11] Mi Lu,et al. Group-Alignment based Accurate Floating-Point Summation on FPGAs , 2006, ERSA.
[12] Peter Zipf,et al. Optimization of Constant Matrix Multiplication with Low Power and High Throughput , 2017, IEEE Transactions on Computers.
[13] Peter Zipf,et al. Constant Matrix Multiplication with Ternary Adders , 2018, 2018 25th IEEE International Conference on Electronics, Circuits and Systems (ICECS).
[14] Anastasios I. Dounis,et al. An Efficient FPGA Implementation of the Big Bang-Big Crunch Optimization Algorithm , 2018, ARC.
[15] A. Alvandpour,et al. A 6.2-GFlops Floating-Point Multiply-Accumulator With Conditional Normalization , 2006, IEEE Journal of Solid-State Circuits.
[16] Ming Wei,et al. A tag based vector reduction circuit , 2015, 2015 IEEE High Performance Extreme Computing Conference (HPEC).
[17] Miaoqing Huang,et al. Modular Design of Fully Pipelined Reduction Circuits on FPGAs , 2013, IEEE Transactions on Parallel and Distributed Systems.
[18] Margaret Martonosi,et al. Accelerating Pipelined Integer and Floating-Point Accumulations in Configurable Hardware with Delayed Addition Techniques , 2000, IEEE Trans. Computers.