A pipelined execution of tiled nested loops on SMPs with computation and communication overlapping
暂无分享,去创建一个
[1] Yves Robert,et al. (Pen)-ultimate tiling? , 1994, Integr..
[2] Nectarios Koziris,et al. Evaluation of loop grouping methods based on orthogonal projection spaces , 2000, Proceedings 2000 International Conference on Parallel Processing.
[3] Jang-Ping Sheu,et al. Partitioning and Mapping Nested Loops on Multiprocessor Systems , 1991, IEEE Trans. Parallel Distributed Syst..
[4] Jingling Xue,et al. On Tiling as a Loop Transformation , 1997, Parallel Process. Lett..
[5] François Irigoin,et al. Supernode partitioning , 1988, POPL '88.
[6] Weijia Shang,et al. On supernode transformation with minimized total running time , 1996, Proceedings of International Conference on Application Specific Systems, Architectures and Processors: ASAP '96.
[7] Nectarios Koziris,et al. Optimal Scheduling for UET/UET-UCT Generalized n-Dimensional Grid Task Graphs , 1999, J. Parallel Distributed Comput..
[8] Chung-Ta King,et al. Pipelined Data Parallel Algorithms-II: Design , 1990, IEEE Trans. Parallel Distributed Syst..
[9] J. Ramanujam,et al. Tiling Multidimensional Itertion Spaces for Multicomputers , 1992, J. Parallel Distributed Comput..
[10] Donald J. Patterson,et al. Computer organization and design: the hardware-software interface (appendix a , 1993 .
[11] Nectarios Koziris,et al. Enhancing the performance of tiled loop execution onto clusters using memory mapped network interfaces and pipelined schedules , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.
[12] David A. Patterson,et al. Computer Organization & Design: The Hardware/Software Interface , 1993 .
[13] Nectarios Koziris,et al. Minimizing completion time for loop tiling with computation and communication overlapping , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.
[14] T. KingC.,et al. Pipelined Data Parallel Algorithms-I , 1990 .
[15] Nectarios Koziris,et al. Chain Grouping: A Method for Partitioning Loops onto Mesh-Connected Processor Arrays , 2000, IEEE Trans. Parallel Distributed Syst..
[16] Jingling Xue. Communication-Minimal Tiling of Uniform Dependence Loops , 1997, J. Parallel Distributed Comput..