The performance advantages of integrating block data transfer in cache-coherent multiprocessors
暂无分享,去创建一个
[1] Todd C. Mowry,et al. Tolerating latency through software-controlled data prefetching , 1994 .
[2] Anant Agarwal,et al. Integrating message-passing and shared-memory: early experience , 1993, SIGP.
[3] Alan L. Cox,et al. An Evaluation of Software Distributed Shared Memory for Next-Generation Processors and Networks , 1993, ISCA 1993.
[4] Anoop Gupta,et al. SPLASH: Stanford parallel applications for shared-memory , 1992, CARN.
[5] Guy E. Blelloch,et al. A comparison of sorting algorithms for the connection machine CM-2 , 1991, SPAA '91.
[6] Alan L. Cox,et al. Evaluation of release consistent software distributed shared memory on emerging network technology , 1993, ISCA '93.
[7] John L. Hennessy,et al. The Performance Advantages of Integrating Message Passing in Cache-Coherent Multiprocessors , 1993 .
[8] Anoop Gupta,et al. Working sets, cache sizes, and node granularity issues for large-scale multiprocessors , 1993, ISCA '93.
[9] Anoop Gupta,et al. The Stanford FLASH multiprocessor , 1994, ISCA '94.
[10] Anoop Gupta,et al. The directory-based cache coherence protocol for the DASH multiprocessor , 1990, [1990] Proceedings. The 17th Annual International Symposium on Computer Architecture.
[11] David H. Bailey,et al. FFTs in external or hierarchical memory , 1989, Proceedings of the 1989 ACM/IEEE Conference on Supercomputing (Supercomputing '89).
[12] Anoop Gupta,et al. The directory-based cache coherence protocol for the DASH multiprocessor , 1990, ISCA '90.
[13] Anoop Gupta,et al. An efficient block-oriented approach to parallel sparse Cholesky factorization , 1993, Supercomputing '93. Proceedings.
[14] James R. Larus,et al. Tempest and typhoon: user-level shared memory , 1994, ISCA '94.
[15] James R. Larus,et al. The Wisconsin Wind Tunnel: virtual prototyping of parallel computers , 1993, SIGMETRICS '93.
[16] Stephen R. Goldschmidt,et al. Simulation of multiprocessors: accuracy and performance , 1993 .
[17] John L. Hennessy,et al. The Effects of Latency, Occupancy, and Bandwidth in Distributed Shared Memory Multiprocessors , 1995 .
[18] D. Brandt,et al. Multi-level adaptive solutions to boundary-value problems math comptr , 1977 .
[19] Anant Agarwal,et al. APRIL: a processor architecture for multiprocessing , 1990, ISCA '90.
[20] Anoop Gupta,et al. Integration of message passing and shared memory in the Stanford FLASH multiprocessor , 1994, ASPLOS VI.
[21] Brian N. Bershad,et al. The Midway distributed shared memory system , 1993, Digest of Papers. Compcon Spring.
[22] Anoop Gupta,et al. Scaling parallel programs for multiprocessors: methodology and examples , 1993, Computer.
[23] David H. Bailey,et al. The Nas Parallel Benchmarks , 1991, Int. J. High Perform. Comput. Appl..