How to share memory in a distributed system

The power of shared-memory in models of parallel computation is studied, and a novel distributed data structure that eliminates the need for shared memory without significantly increasing the run time of the parallel computation is described. More specifically, it is shown how a complete network of processors can deterministically simulate one PRAM step in <italic>O</italic>(log <italic>n</italic>/(log log <italic>n</italic>)<supscrpt>2</supscrpt>) time when both models use <italic>n</italic> processors and the size of the PRAM's shared memory is polynomial in <italic>n</italic>. (The best previously known upper bound was the trivial <italic>O</italic>(<italic>n</italic>)). It is established that this upper bound is nearly optimal, and it is proved that an on-line simulation of <italic>T</italic> PRAM steps by a complete network of processors requires &OHgr;(<italic>T</italic>(log <italic>n/</italic> log log <italic>n</italic>)) time. A simple consequence of the upper bound is that an Ultracomputer (the currently feasible general-purpose parallel machine) can simulate one step of a PRAM (the most convenient parallel model to program) in <italic>O</italic>((log <italic>n</italic>)<supscrpt>2</supscrpt>log log <italic>n</italic>) steps.

[1]  丸山 徹 Convex Analysisの二,三の進展について , 1977 .

[2]  David J. Kuck,et al.  A Survey of Parallel Machine Organization and Programming , 1977, CSUR.

[3]  Robert H. Thomas,et al.  A Majority consensus approach to concurrency control for multiple copy databases , 1979, ACM Trans. Database Syst..

[4]  Zvi Galil,et al.  Explicit Constructions of Linear-Sized Superconcentrators , 1981, J. Comput. Syst. Sci..

[5]  Stephen A. Cook,et al.  Bounds on the time for parallel RAM's to compute simple functions , 1982, STOC '82.

[6]  Uzi Vishkin,et al.  Implementation of Simultaneous Memory Address Access in Models That Forbid It , 1983, J. Algorithms.

[7]  Ralph Grishman,et al.  The NYU Ultracomputer—Designing an MIMD Shared Memory Parallel Computer , 1983, IEEE Transactions on Computers.

[8]  Avi Wigderson,et al.  Superconcentrators, generalizers and generalized connectors with limited depth , 1983, STOC.

[9]  János Komlós,et al.  An 0(n log n) sorting network , 1983, STOC.

[10]  Eli Upfal,et al.  A probabilistic relation between desirable and feasible, models of parallel computation , 1984, STOC '84.

[11]  VishkinUzi,et al.  Randomized and deterministic simulations of PRAMs by parallel machines with restricted granularity of parallel memories , 1984 .

[12]  Uzi Vishkin,et al.  A Parallel-Design Distributed-Implementation (PDDI) General-Purpose Computer , 2011, Theor. Comput. Sci..

[13]  Frank Thomson Leighton,et al.  Tight Bounds on the Complexity of Parallel Sorting , 1984, IEEE Transactions on Computers.