Latin cubes and parallel array access

The problem of efficiently storing a d-dimensional array into multiple memory modules of a shared memory machine is an important problem in parallel processing. We consider the problem for the three-dimensional arrays. More specifically, given an array A of size n/spl times/n/spl times/n and a shared memory machine with n memory modules, we show how to store A so that no two elements within any row, any column, any diagonal of a face of A and main sub-arrays of A are stored in the same memory module. The scheme thus achieves no memory conflicts when the processors of the shared memory machine simultaneously access elements within a row, column, sub-array. etc. We also show how to store A efficiently, if diagonals of A are required to be accessed conflict-free in addition to rows and columns. All of the schemes use latin cubes.<<ETX>>

[1]  Mark A. Johnson,et al.  Solving problems on concurrent processors. Vol. 1: General techniques and regular problems , 1988 .

[2]  Ervin Gergely,et al.  A Simple Method for Constructing Doubly Diagonalized Latin Squares , 1974, J. Comb. Theory, Ser. A.

[3]  Rachid Deriche,et al.  3D edge detection using recursive filtering: application to scanner images , 1989, Proceedings CVPR '89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  Paul Budnik,et al.  The Organization and Use of Parallel Memories , 1971, IEEE Transactions on Computers.

[5]  Sartaj Sahni,et al.  A Self-Routing Benes Network and Parallel Permutation Algorithms , 1981, IEEE Transactions on Computers.

[6]  Cauligi S. Raghavendra,et al.  On Array Storage for Conflict-Free Memory Access for Parallel Processors , 1988, International Conference on Parallel Processing.

[7]  Viktor K. Prasanna,et al.  Latin Squares for Parallel Array Access , 1993, IEEE Trans. Parallel Distributed Syst..

[8]  Gene H. Golub,et al.  Scientific computing: an introduction with parallel computing , 1993 .

[9]  Viktor K. Prasanna,et al.  Perfect Latin Squares , 1992, Discret. Appl. Math..

[10]  Marshall B. Long,et al.  Acquisition and representation of 2D and 3D data from turbulent flows and flames , 1989, Computer.

[11]  Cauligi S. Raghavendra,et al.  On Methods for Fast and Efficient Parallel Memory Access , 1990, ICPP.

[12]  D. T. Harper,et al.  A dynamic storage scheme for conflict-free vector access , 1989, ISCA '89.