Optimizing a shared virtual memory system for a heterogeneous CPU-accelerator platform
暂无分享,去创建一个
Ying Gao | Hu Chen | Shoumeng Yan | Xiaocheng Zhou | Gansha Wu | Bratin Saha | Sai Luo
[1] Edward T. Grochowski,et al. Larrabee: A many-Core x86 architecture for visual computing , 2008, 2008 IEEE Hot Chips 20 Symposium (HCS).
[2] P. K. Dubey,et al. Recognition, Mining and Synthesis Moves Comp uters to the Era of Tera , 2005 .
[3] Ricardo Bianchini,et al. Hiding communication latency and coherence overhead in software DSMs , 1996, ASPLOS VII.
[4] Jens H. Krüger,et al. GPGPU: general purpose computation on graphics hardware , 2004, SIGGRAPH '04.
[5] Ricardo Bianchini,et al. Data prefetching for software DSMs , 1998, ICS '98.
[6] Alan L. Cox,et al. An integrated compile-time/run-time software distributed shared memory system , 1996, ASPLOS VII.
[7] John E. Stone,et al. An asymmetric distributed shared memory model for heterogeneous parallel systems , 2010, ASPLOS XV.
[8] Alan L. Cox,et al. Practical, transparent operating system support for superpages , 2002, OPSR.
[9] Kai Li,et al. Heterogeneous Distributed Shared Memory , 1992, IEEE Trans. Parallel Distributed Syst..
[10] Avi Mendelson,et al. Programming model for a heterogeneous x86 platform , 2009, PLDI '09.
[11] Vivek Sarkar,et al. X10: concurrent programming for modern architectures , 2007, PPOPP.
[12] R. Govindarajan,et al. CAS-DSM: A Compiler Assisted Software Distributed Shared Memory , 2004, International Journal of Parallel Programming.
[13] Ying Gao,et al. Terascale chip multiprocessor memory hierarchy and programming model , 2009, 2009 International Conference on High Performance Computing (HiPC).