An extensible global address space framework with decoupled task and data abstractions
暂无分享,去创建一个
Sriram Krishnamoorthy | Jarek Nieplocha | P. Sadayappan | Atanas Rountev | Ümit V. Çatalyürek | J. Nieplocha | S. Krishnamoorthy | A. Rountev | P. Sadayappan
[1] Katherine A. Yelick,et al. Titanium: A High-performance Java Dialect , 1998, Concurr. Pract. Exp..
[2] Ian Foster,et al. Disk resident arrays: an array-oriented I/O library for out-of-core computations , 1996, Proceedings of 6th Symposium on the Frontiers of Massively Parallel Computation (Frontiers '96).
[3] J. Mellor-Crummey,et al. A multi-platform co-array Fortran compiler , 2004, Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004..
[4] Robyn R. Lutz,et al. Generalized portable shmem library for high performance computing , 2003 .
[5] Laxmikant V. Kalé,et al. A load balancing strategy for prioritized execution of tasks , 1993, [1993] Proceedings Seventh International Parallel Processing Symposium.
[6] Joel H. Saltz,et al. A Hypergraph-Based Workload Partitioning Strategy for Parallel Data Aggregation , 2001, PPSC.
[7] Laxmikant V. Kalé,et al. CHARM++: a portable concurrent object oriented system based on C++ , 1993, OOPSLA '93.
[8] Jarek Nieplocha,et al. Advances, Applications and Performance of the Global Arrays Shared Memory Programming Toolkit , 2006, Int. J. High Perform. Comput. Appl..
[9] AykanatCevdet,et al. Hypergraph-Partitioning-Based Decomposition for Parallel Sparse-Matrix Vector Multiplication , 1999 .
[10] Bruce Hendrickson,et al. The Chaco user`s guide. Version 1.0 , 1993 .
[11] Keith H. Randall,et al. Cilk: efficient multithreaded computing , 1998 .
[12] Vikram S. Adve,et al. Using integer sets for data-parallel program analysis and optimization , 1998, PLDI.
[13] Robert J. Harrison,et al. Global Arrays: a portable "shared-memory" programming model for distributed memory computers , 1994, Proceedings of Supercomputing '94.
[14] Sriram Krishnamoorthy,et al. Data and Computation Abstractions for Dynamic and Irregular Computations , 2005, HiPC.
[15] John N. Shadid,et al. Official Aztec user''s guide: version 2.1 , 1999 .
[16] Bryan Carpenter,et al. ARMCI: A Portable Remote Memory Copy Libray for Ditributed Array Libraries and Compiler Run-Time Systems , 1999, IPPS/SPDP Workshops.
[17] Keshav Pingali,et al. Synthesizing transformations for locality enhancement of imperfectly-nested loop nests , 2000 .
[18] J I Brauman,et al. Computing in science. , 1993, Science.
[19] David E. Bernholdt,et al. A High-Level Approach to Synthesis of High-Performance Codes for Quantum Chemistry , 2002, ACM/IEEE SC 2002 Conference (SC'02).
[20] Robert J. Harrison,et al. Shared Memory Programming in Metacomputing Environments: The Global Array Approach , 1997, The Journal of Supercomputing.
[21] Thomas Lengauer,et al. Combinatorial algorithms for integrated circuit layout , 1990, Applicable theory in computer science.
[22] Keshav Pingali,et al. Synthesizing Transformations for Locality Enhancement of Imperfectly-Nested Loop Nests , 2001, International Journal of Parallel Programming.
[23] Iain S. Duff,et al. Level 3 basic linear algebra subprograms for sparse matrices: a user-level interface , 1997, TOMS.
[24] Keshav Pingali,et al. Tiling Imperfectly-nested Loop Nests , 2000, ACM/IEEE SC 2000 Conference (SC'00).