Locality Optimizations for Parallel Computing Using Data Access Information
暂无分享,去创建一个
[1] Robert J. Fowler,et al. Improving Processor and Cache Locality in Fine-Grain Parallel Computations using Object-Affinity Scheduling and Continuation Passing , 1992 .
[2] Anoop Gupta,et al. Tolerating Latency Through Software-Controlled Prefetching in Shared-Memory Multiprocessors , 1991, J. Parallel Distributed Comput..
[3] Anoop Gupta,et al. Data locality and load balancing in COOL , 1993, PPOPP '93.
[4] Anoop Gupta,et al. SPLASH: Stanford parallel applications for shared-memory , 1992, CARN.
[5] Daniel E. Lenoski,et al. The design and analysis of DASH: a scalable directory-based multiprocessor , 1992 .
[6] Edward Eric Rothberg,et al. Exploiting the memory hierarchy in sequential and parallel sparse Cholesky factorization , 1992 .
[7] Reinaldo J. Michelena,et al. Tomographic string inversion , 1990 .
[8] Monica S. Lam,et al. Jade: a high-level, machine-independent language for parallel programming , 1993, Computer.
[9] Monica S. Lam,et al. Heterogeneous parallel programming in Jade , 1992, Proceedings Supercomputing '92.
[10] Martin Rinard,et al. The design, implementation and evaluation of Jade: a portable, implicitly parallel programming language , 1994 .
[11] Monica S. Lam,et al. Semantic foundations of Jade , 1992, POPL '92.
[12] Anoop Gupta,et al. Design and evaluation of a compiler algorithm for prefetching , 1992, ASPLOS V.