论文信息 - Termination detection in parallel loop nests with while loops

Termination detection in parallel loop nests with while loops

Abstract One central problem in the execution of parallel nested loops with non-affine bounds is the precise scanning (i.e., enumeration) of the points in their iteration space and the detection of their termination. Scanning schemes have been proposed for both shared-memory and distributed-memory implementations. However, these schemes work only for perfectly nested while loops. We propose a scheme which also works for not perfectly nested while loops on shared memory. This scheme has been incorporated in our loop parallelizer LooPo.

[1] Frédéric Vivien,et al. Revisiting the Decomposition of Karp, Miller and Winograd , 1995, Parallel Process. Lett..

[2] Volker Weispfenning,et al. Simulation and Optimization by Quantifier Elimination , 1997, J. Symb. Comput..

[3] Martin Griebl,et al. Classifying Loops for Space-Time Mapping , 1996, Euro-Par, Vol. I.

[4] Martin Griebl,et al. On the parallelization of loop nests containing while loops , 1995, Proceedings the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis.

[5] Leslie Lamport,et al. The parallel execution of DO loops , 1974, CACM.

[6] Michael Wolfe,et al. High performance compilers for parallel computing , 1995 .

[7] Martin Griebl,et al. The Loop Parallelizer LooPo-Announcement , 1996, LCPC.

[8] Paul Feautrier,et al. Some efficient solutions to the affine scheduling problem. I. One-dimensional time , 1992, International Journal of Parallel Programming.

[9] Jingling Xue. Communication-Minimal Tiling of Uniform Dependence Loops , 1997, J. Parallel Distributed Comput..

[10] Jean-Francois Collard,et al. Automatic parallelization ofwhile-loops using speculative execution , 1995, International Journal of Parallel Programming.

[11] P. Feautrier. Array expansion , 1988 .

[12] Martin Griebl,et al. Code generation in the polytope model , 1998, Proceedings. 1998 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.98EX192).

[13] Alfred V. Aho,et al. Compilers: Principles, Techniques, and Tools , 1986, Addison-Wesley series in computer science / World student series edition.

[14] Wayne R. Dyksen,et al. Pipelined iterative methods for shared memory machines , 1989, Parallel Comput..

[15] Jürgen Teich,et al. Partitioning of processor arrays: a piecewise regular approach , 1993, Integr..

[16] Corinne Ancourt,et al. Scanning polyhedra with DO loops , 1991, PPOPP '91.

[17] Paul Feautrier,et al. Automatic Parallelization in the Polytope Model , 1996, The Data Parallel Programming Model.

[18] Lawrence Rauchwerger,et al. Parallelizing while loops for multiprocessor systems , 1995, Proceedings of 9th International Parallel Processing Symposium.

[19] Steven W. K. Tjiang,et al. SUIF: an infrastructure for research on parallelizing and optimizing compilers , 1994, SIGP.

[20] Martin Griebl,et al. On the Space-Time Mapping of While-Loops , 1994, Parallel Process. Lett..

[21] Christian Lengauer,et al. Loop Parallelization in the Polytope Model , 1993, CONCUR.

[22] Paul Feautrier,et al. Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time , 1992, International Journal of Parallel Programming.

[23] Martin Griebl,et al. A communication scheme for the distributed execution of loop nests withwhile loops , 2007, International Journal of Parallel Programming.

[24] Michael Wolfe,et al. Optimizing supercompilers for supercomputers , 1989, ICS.

[25] Paul Feautrier. Toward Automatic Distribution , 1994, Parallel Process. Lett..

[26] Constantine D. Polychronopoulos,et al. The structure of parafrase-2: an advanced parallelizing compiler for C and FORTRAN , 1990 .

[27] Martin Griebl,et al. Generation of Synchronous Code for Automatic Parallelization of while Loops , 1995, Euro-Par.

[28] Yves Robert,et al. Mapping affine loop nests: new results , 1995, HPCN Europe.

[29] Peter S. Pacheco. Parallel programming with MPI , 1996 .

[30] Lawrence Rauchwerger,et al. Polaris: Improving the Effectiveness of Parallelizing Compilers , 1994, LCPC.

[31] Yves Robert,et al. (Pen)-ultimate tiling? , 1994, Integr..