论文信息 - Synchronization-Free Automatic Parallelization for Arbitrarily Nested Affine Loops

Synchronization-Free Automatic Parallelization for Arbitrarily Nested Affine Loops

This paper presents a new approach for extracting synchronization-free parallelism available in program loop nests. The approach allows for extracting parallelism for arbitrarily nested parametric loop nests, where the loop bounds and data accesses are affine functions of loop indices and symbolic parameters. Parallelization is realized using the transitive closure of a dependence graph. Speed-up of parallel code produced by means of the approach is studied using the NAS benchmark suite. Parallelism of loop nests is obtained by creating a kernel of computations represented in the OpenMP standard to be executed independently on multi-core computers. Results of an experimental study carried out by means of the many integrated core architecture Intel Xeon Phi is discussed.

Marek Palkowski | Wlodzimierz Bielecki | Tomasz Klimek

[1] Uday Bondhugula,et al. A practical automatic polyhedral parallelizer and locality optimizer , 2008, PLDI '08.

[2] William Pugh,et al. Transitive Closure of Infinite Graphs and its Applications , 1995, Int. J. Parallel Program..

[3] Wlodzimierz Bielecki,et al. Extracting Coarse-Grained Parallelism for Affine Perfectly Nested Quasi-uniform Loops , 2011, PPAM.

[4] William Pugh,et al. Iteration space slicing and its application to communication optimization , 1997, ICS '97.

[5] Marek Palkowski,et al. TRACO: An automatic loop nest parallelizer for numerical applications , 2015, 2015 Federated Conference on Computer Science and Information Systems (FedCSIS).

[6] Albert Cohen,et al. Polyhedral AST Generation Is More Than Scanning Polyhedra , 2015, ACM Trans. Program. Lang. Syst..

[7] Wlodzimierz Bielecki,et al. Using basis dependence distance vectors in the modified Floyd–Warshall algorithm , 2015, J. Comb. Optim..

[8] Albert Cohen,et al. Coarse-Grained Loop Parallelization: Iteration Space Slicing vs Affine Transformations , 2009, 2009 Eighth International Symposium on Parallel and Distributed Computing.