Taxonomy of Spatial Parallelism on FPGAs for Massively Parallel Applications
暂无分享,去创建一个
[1] Kenta Kasai,et al. Flexible non-binary LDPC decoding on FPGAs , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Sheng-De Wang,et al. OpenCL computing on FPGA using multiported , 2015, 2015 25th International Conference on Field Programmable Logic and Applications (FPL).
[3] David R. Kaeli,et al. Runtime Support for Adaptive Spatial Partitioning and Inter-Kernel Communication on GPUs , 2014, 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing.
[4] David R. Kaeli,et al. Hardware thread reordering to boost OpenCL throughput on FPGAs , 2016, 2016 IEEE 34th International Conference on Computer Design (ICCD).
[5] Doris Chen,et al. Fractal video compression in OpenCL: An evaluation of CPUs, GPUs, and FPGAs as acceleration platforms , 2013, 2013 18th Asia and South Pacific Design Automation Conference (ASP-DAC).
[6] Ralph Duncan. A survey of parallel computer architectures , 1990, Computer.
[7] Mehdi Baradaran Tahoori,et al. Energy Efficient Scientific Computing on FPGAs using OpenCL , 2017, FPGA.
[8] Satoshi Matsuoka,et al. Evaluating and Optimizing OpenCL Kernels for High Performance Computing with FPGAs , 2016, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.
[9] David R. Kaeli,et al. Exploring the Efficiency of the OpenCL Pipe Semantic on an FPGA , 2016, SIGARCH Comput. Archit. News.
[10] Wu-chun Feng,et al. On the performance and energy efficiency of FPGAs and GPUs for polyphase channelization , 2014, 2014 International Conference on ReConFigurable Computing and FPGAs (ReConFig14).
[11] Gregory D. Peterson,et al. Performance Comparison of Cholesky Decomposition on GPUs and FPGAs , 2011 .
[12] Kevin Skadron,et al. Rodinia: A benchmark suite for heterogeneous computing , 2009, 2009 IEEE International Symposium on Workload Characterization (IISWC).
[13] Bingsheng He,et al. Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture , 2013, Proc. VLDB Endow..
[14] Vijay Janapa Reddi,et al. PIN: a binary instrumentation tool for computer architecture research and education , 2004, WCAE '04.
[15] Martin Margala,et al. High level programming of FPGAs for HPC and data centric applications , 2014, 2014 IEEE High Performance Extreme Computing Conference (HPEC).
[16] David B. Skillicorn. A taxonomy for computer architectures , 1988, Computer.
[17] Michael J. Flynn,et al. Some Computer Organizations and Their Effectiveness , 1972, IEEE Transactions on Computers.
[18] Jungwon Kim,et al. OpenACC to FPGA: A Framework for Directive-Based High-Performance Reconfigurable Computing , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS).
[19] David R. Kaeli,et al. HQL: A Scalable Synchronization Mechanism for GPUs , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.
[20] Bingsheng He,et al. In-Cache Query Co-Processing on Coupled CPU-GPU Architectures , 2014, Proc. VLDB Endow..