Termination detection in parallel loop nests with while loops

Abstract One central problem in the execution of parallel nested loops with non-affine bounds is the precise scanning (i.e., enumeration) of the points in their iteration space and the detection of their termination. Scanning schemes have been proposed for both shared-memory and distributed-memory implementations. However, these schemes work only for perfectly nested while loops. We propose a scheme which also works for not perfectly nested while loops on shared memory. This scheme has been incorporated in our loop parallelizer LooPo.

[1]  Frédéric Vivien,et al.  Revisiting the Decomposition of Karp, Miller and Winograd , 1995, Parallel Process. Lett..

[2]  Volker Weispfenning,et al.  Simulation and Optimization by Quantifier Elimination , 1997, J. Symb. Comput..

[3]  Martin Griebl,et al.  Classifying Loops for Space-Time Mapping , 1996, Euro-Par, Vol. I.

[4]  Martin Griebl,et al.  On the parallelization of loop nests containing while loops , 1995, Proceedings the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis.

[5]  Leslie Lamport,et al.  The parallel execution of DO loops , 1974, CACM.

[6]  Michael Wolfe,et al.  High performance compilers for parallel computing , 1995 .

[7]  Martin Griebl,et al.  The Loop Parallelizer LooPo-Announcement , 1996, LCPC.

[8]  Paul Feautrier,et al.  Some efficient solutions to the affine scheduling problem. I. One-dimensional time , 1992, International Journal of Parallel Programming.

[9]  Jingling Xue Communication-Minimal Tiling of Uniform Dependence Loops , 1997, J. Parallel Distributed Comput..

[10]  Jean-Francois Collard,et al.  Automatic parallelization ofwhile-loops using speculative execution , 1995, International Journal of Parallel Programming.

[11]  P. Feautrier Array expansion , 1988 .

[12]  Martin Griebl,et al.  Code generation in the polytope model , 1998, Proceedings. 1998 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.98EX192).

[13]  Alfred V. Aho,et al.  Compilers: Principles, Techniques, and Tools , 1986, Addison-Wesley series in computer science / World student series edition.

[14]  Wayne R. Dyksen,et al.  Pipelined iterative methods for shared memory machines , 1989, Parallel Comput..

[15]  Jürgen Teich,et al.  Partitioning of processor arrays: a piecewise regular approach , 1993, Integr..

[16]  Corinne Ancourt,et al.  Scanning polyhedra with DO loops , 1991, PPOPP '91.

[17]  Paul Feautrier,et al.  Automatic Parallelization in the Polytope Model , 1996, The Data Parallel Programming Model.

[18]  Lawrence Rauchwerger,et al.  Parallelizing while loops for multiprocessor systems , 1995, Proceedings of 9th International Parallel Processing Symposium.

[19]  Steven W. K. Tjiang,et al.  SUIF: an infrastructure for research on parallelizing and optimizing compilers , 1994, SIGP.

[20]  Martin Griebl,et al.  On the Space-Time Mapping of While-Loops , 1994, Parallel Process. Lett..

[21]  Christian Lengauer,et al.  Loop Parallelization in the Polytope Model , 1993, CONCUR.

[22]  Paul Feautrier,et al.  Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time , 1992, International Journal of Parallel Programming.

[23]  Martin Griebl,et al.  A communication scheme for the distributed execution of loop nests withwhile loops , 2007, International Journal of Parallel Programming.

[24]  Michael Wolfe,et al.  Optimizing supercompilers for supercomputers , 1989, ICS.

[25]  Paul Feautrier Toward Automatic Distribution , 1994, Parallel Process. Lett..

[26]  Constantine D. Polychronopoulos,et al.  The structure of parafrase-2: an advanced parallelizing compiler for C and FORTRAN , 1990 .

[27]  Martin Griebl,et al.  Generation of Synchronous Code for Automatic Parallelization of while Loops , 1995, Euro-Par.

[28]  Yves Robert,et al.  Mapping affine loop nests: new results , 1995, HPCN Europe.

[29]  Peter S. Pacheco Parallel programming with MPI , 1996 .

[30]  Lawrence Rauchwerger,et al.  Polaris: Improving the Effectiveness of Parallelizing Compilers , 1994, LCPC.

[31]  Yves Robert,et al.  (Pen)-ultimate tiling? , 1994, Integr..