Implementation of Integrated CPU-GPU for Efficient Uniform Memory Access Method and Verification System
暂无分享,去创建一个
In this paper, we propose a system for efficient use of shared memory between CPU and GPU. The system, called Fusion Architecture, assures consistency of the shared memory and minimizes cache misses that frequently occurs on Heterogeneous System Architecture or Unified Virtual Memory based systems. It also maximizes the performance for memory intensive jobs by efficient allocation of GPU cores. To test between architectures on various scenarios, we introduce the Fusion Architecture Analyzer, which compares OpenMP, OpenCL, CUDA, and the proposed architecture in terms of memory overhead and process time. As a result, Proposed fusion architectures show that the Fusion Architecture runs benchmarks 55% faster and reduces memory overheads by 220% in average.
[1] Christoph Hagleitner,et al. Giving Text Analytics a Boost , 2014, IEEE Micro.
[2] Jong-Myon Kim,et al. Implementation and Performance Evaluation of the Faddev-Leverrier Algorithm using GPGPU , 2013 .
[3] John R. Heath,et al. Coherency Hub Design for Multisocket Sun Servers with CoolThreads Technology , 2009, IEEE Micro.
[4] Joong-Hwee Cho,et al. Efficient Implementation of Candidate Region Extractor for Pedestrian Detection System with Stereo Camera based on GP-GPU , 2013 .