Code Generation for Distributed-Memory Architectures
暂无分享,去创建一个
Jie Zhao | Rongcai Zhao | Jinchen Xu | Jie Zhao | Rongcai Zhao | Jinchen Xu
[1] Sang-Yong Han,et al. MPI backend for an automatic parallelizing compiler , 1999, Proceedings Fourth International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN'99).
[2] Vivek Sarkar,et al. X10: an object-oriented approach to non-uniform cluster computing , 2005, OOPSLA '05.
[3] Monica S. Lam,et al. Maximizing parallelism and minimizing synchronization with affine transforms , 1997, POPL '97.
[4] Corinne Ancourt,et al. A Linear Algebra Framework for Static HPF Code Distribution , 2007 .
[5] Edith Schonberg,et al. An HPF Compiler for the IBM SP2 , 1995, Proceedings of the IEEE/ACM SC95 Conference.
[6] Alan L. Cox,et al. An integrated compile-time/run-time software distributed shared memory system , 1996, ASPLOS VII.
[7] Sandeep K. S. Gupta,et al. On Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines , 1993, 1993 International Conference on Parallel Processing - ICPP'93.
[8] Katherine Yelick,et al. UPC Language Specifications V1.1.1 , 2003 .
[9] Monica S. Lam,et al. Communication optimization and code generation for distributed memory machines , 1993, PLDI '93.
[10] Prithviraj Banerjee,et al. Advanced compilation techniques in the PARADIGM compiler for distributed-memory multicomputers , 1995, ICS '95.
[11] Martin Griebl,et al. Issues of the Automatic Generation of HPF Loop Programs , 2000, LCPC.
[12] Charles Koelbel,et al. Compiling Global Name-Space Parallel Loops for Distributed Execution , 1991, IEEE Trans. Parallel Distributed Syst..
[13] Marina C. Chen,et al. Compiling Communication-Efficient Programs for Massively Parallel Machines , 1991, IEEE Trans. Parallel Distributed Syst..
[14] Chau-Wen Tseng. An optimizing Fortran D compiler for MIMD distributed-memory machines , 1993 .
[15] Manish Gupta,et al. A methodology for high-level synthesis of communication on multicomputers , 1992, ICS '92.
[16] Larry Meadows,et al. Compiling High Performance Fortran , 1995, PPSC.
[17] Michael F. P. O'Boyle,et al. Towards a holistic approach to auto-parallelization: integrating profile-driven parallelism detection and machine-learning based mapping , 2009, PLDI '09.
[18] Rudolf Eigenmann,et al. A hybrid approach of OpenMP for clusters , 2012, PPoPP '12.
[19] Clayton S. Femer. The Paraguin Compiler-Message-passing Code Generation Using SUIF , 2002 .
[20] Chau-Wen Tseng,et al. An Overview of the SUIF Compiler for Scalable Parallel Machines , 1995, PPSC.
[21] Katherine Yelick,et al. Titanium: a high-performance Java dialect , 1998 .
[22] Ken Kennedy,et al. Compiling Fortran D for MIMD distributed-memory machines , 1992, CACM.
[23] David H. Bailey,et al. The Nas Parallel Benchmarks , 1991, Int. J. High Perform. Comput. Appl..
[24] Martin Griebl,et al. Automatic code generation for distributed memory architectures in the polytope model , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.
[25] Robert W. Numrich,et al. Co-array Fortran for parallel programming , 1998, FORF.
[26] Thomas R. Gross,et al. Generating Communication for Array Statement: Design, Implementation, and Evaluation , 1994, J. Parallel Distributed Comput..
[27] R. F. Van der Wijngaart. Efficient implementation of a 3-dimensional ADI method on the iPSC/860 , 1993 .
[28] Uday Bondhugula. Compiling affine loop nests for distributed-memory parallel architectures , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).
[29] Clayton S. Ferner. Revisiting communication code generation algorithms for message-passing systems , 2006, Int. J. Parallel Emergent Distributed Syst..
[30] K. Kennedy,et al. Preliminary experiences with the Fortran D compiler , 1993, Supercomputing '93.
[31] Vikram S. Adve,et al. Using integer sets for data-parallel program analysis and optimization , 1998, PLDI.
[32] Rudolf Eigenmann,et al. Cetus - An Extensible Compiler Infrastructure for Source-to-Source Transformation , 2003, LCPC.
[33] Daoxu Chen,et al. JAPS: an automatic parallelizing system based on JAVA , 1999 .
[34] Alexander Schrijver,et al. Theory of linear and integer programming , 1986, Wiley-Interscience series in discrete mathematics and optimization.
[35] Jonathan Harris,et al. Compiling High Performance Fortran for Distributed-memory Systems , 1995, Digit. Tech. J..
[36] Monica S. Lam,et al. Array-data flow analysis and its use in array privatization , 1993, POPL '93.
[37] Henk J. Sips,et al. An Implementation Framework for HPF Distributed Arrays on Message-Passing Parallel Computer Systems , 1996, IEEE Trans. Parallel Distributed Syst..