An Evaluation of Unified Memory Technology on NVIDIA GPUs
暂无分享,去创建一个
Simon See | Guanghao Jin | Xuewen Cui | Wenqiang Li | S. See | Xuewen Cui | Wenqiang Li | Guanghao Jin
[1] Kevin Skadron,et al. Scalable parallel programming , 2008, 2008 IEEE Hot Chips 20 Symposium (HCS).
[2] David I. August,et al. Automatic CPU-GPU communication management and optimization , 2011, PLDI '11.
[3] Raphael Landaverde,et al. An investigation of Unified Memory Access performance in CUDA , 2014, 2014 IEEE High Performance Extreme Computing Conference (HPEC).
[4] Fadi J. Kurdahi,et al. MorphoSys: An Integrated Reconfigurable System for Data-Parallel and Computation-Intensive Applications , 2000, IEEE Trans. Computers.
[5] Wen-mei W. Hwu,et al. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA , 2008, PPoPP.
[6] David R. Kaeli,et al. Exploring the multiple-GPU design space , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.
[7] Andrew S. Tanenbaum,et al. Operating systems: design and implementation , 1987, Prentice-Hall software series.
[8] David A. Patterson,et al. Computer Architecture: A Quantitative Approach , 1969 .
[9] John E. Stone,et al. An asymmetric distributed shared memory model for heterogeneous parallel systems , 2010, ASPLOS XV.
[10] Wen-mei W. Hwu,et al. Parboil: A Revised Benchmark Suite for Scientific and Commercial Throughput Computing , 2012 .
[11] Andrew S. Tanenbaum,et al. Operating systems - design and implementation, 3rd Edition , 2005 .
[12] Steven S. Lumetta,et al. CUBA: an architecture for efficient CPU/co-processor data communication , 2008, ICS '08.