Minimizing completion time for loop tiling with computation and communication overlapping
暂无分享,去创建一个
Nectarios Koziris | Georgios I. Goumas | Aristidis Sotiropoulos | N. Koziris | G. Goumas | A. Sotiropoulos
[1] Yves Robert,et al. (Pen)-ultimate tiling? , 1994, Integr..
[2] Weijia Shang,et al. Independent Partitioning of Algorithms with Uniform Dependencies , 1992, IEEE Trans. Computers.
[3] Jingling Xue,et al. On Tiling as a Loop Transformation , 1997, Parallel Process. Lett..
[4] Nectarios Koziris,et al. Optimal Scheduling for UET/UET-UCT Generalized n-Dimensional Grid Task Graphs , 1999, J. Parallel Distributed Comput..
[5] Andrew A. Chien,et al. Software overhead in messaging layers: where does the time go? , 1994, ASPLOS VI.
[6] Nectarios Koziris,et al. Evaluation of loop grouping methods based on orthogonal projection spaces , 2000, Proceedings 2000 International Conference on Parallel Processing.
[7] Hermann Hellwagner,et al. SISCI - Implementing a Standard Software Infrastructure on an SCI Cluster , 1997 .
[8] Nectarios Koziris,et al. Chain Grouping: A Method for Partitioning Loops onto Mesh-Connected Processor Arrays , 2000, IEEE Trans. Parallel Distributed Syst..
[9] François Irigoin,et al. Supernode partitioning , 1988, POPL '88.
[10] Weijia Shang,et al. On Supernode Transformation with Minimized Total Running Time , 1998, IEEE Trans. Parallel Distributed Syst..
[11] David A. Patterson,et al. Computer Organization & Design: The Hardware/Software Interface , 1993 .
[12] Weijia Shang,et al. Time Optimal Linear Schedules for Algorithms with Uniform Dependencies , 1991, IEEE Trans. Computers.
[13] Erik H. D'Hollander,et al. Partitioning and Labeling of Loops by Unimodular Transformations , 1992, IEEE Trans. Parallel Distributed Syst..
[14] Jingling Xue,et al. Communication-Minimal Tiling of Uniform Dependence Loops , 1996, J. Parallel Distributed Comput..
[15] Hiroshi Tezuka,et al. The design and implementation of zero copy MPI using commodity hardware with a high performance network , 1998, ICS '98.
[16] Thorsten von Eicken,et al. U-Net: a user-level network interface for parallel and distributed computing , 1995, SOSP.
[17] Knut Omang,et al. VIA over SCI - consequences of a zero copy implementation, and comparison with VIA over myrinet , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.
[18] Matthias A. Blumrich. Network interface for protected, user-level communication , 1996 .
[19] Wolfgang Rehm,et al. Memory Management in a Combined VIA/SCI Hardware , 2000, IPDPS Workshops.