Fast, Efficient Mutual and Self Simulations for Shared Memory and Reconfigurable Mesh

This paper studies relations between the parallel random access machine (PRAM) model, and the recon-figurable mesh (RMESH) model, by providing mutual simulations between the models. We present an algorithm simulating one step of an (nlglgn)-processor CRCW PRAM on an n × n RMESH with delay O(lglgn) with high probability. We use our PRAM simulation to obtain the first efficient self-simulation algorithm of an RMESH with general switches: An algorithm running on an n × n RMESH is simulated on a p × p RMESH with delay O((n/p)2× lgn lglgp) with high probability, which is optimal for all p ≤ n/√lgnlglgn. Finally, we consider the simulation of RMESH on the PRAM. We show that a 2 × n RMESH can be optimally simulated on a CRCW PRAM in Θ(α(n)) time, where α(·) is the slow-growing inverse Ackermann function. In contrast, a PRAM with polynomial number of processors cannot simulate the 3 × n RMESH in less than Ω(lgn/lglgn) expected time.

[1]  David Peleg,et al.  The Power of Reconfiguration , 1991, J. Parallel Distributed Comput..

[2]  John H. Reif,et al.  Synthesis of Parallel Algorithms , 1993 .

[3]  John Beidler,et al.  Data Structures and Algorithms , 1996, Wiley Encyclopedia of Computer Science and Engineering.

[4]  Joseph JáJá,et al.  An Introduction to Parallel Algorithms , 1992 .

[5]  Eli Upfal,et al.  Efficient schemes for parallel communication , 1982, PODC '82.

[6]  Larry Rudolph,et al.  A Complexity Theory of Efficient Parallel Algorithms , 1990, Theor. Comput. Sci..

[7]  Leslie G. Valiant,et al.  General Purpose Parallel Architectures , 1991, Handbook of Theoretical Computer Science, Volume A: Algorithms and Complexity.

[8]  Rüdiger Reischuk,et al.  Exact Lower Time Bounds for Computing Boolean Functions on CREW PRAMs , 1994, J. Comput. Syst. Sci..

[9]  Uzi Vishkin,et al.  On Finding Lowest Common Ancestors: Simplification and Parallelization , 1988, AWOC.

[10]  Shiva Chaudhuri A lower bound for linear approximate compaction , 1993, [1993] The 2nd Israel Symposium on Theory and Computing Systems.

[11]  Gen-Huey Chen,et al.  Constant Time Algorithms for the Transitive Closure and Some Related Graph Problems on Processor Arrays with Reconfigurable Bus Systems , 1990, IEEE Trans. Parallel Distributed Syst..

[12]  Stephen A. Cook,et al.  Upper and Lower Time Bounds for Parallel Random Access Machines without Simultaneous Writes , 1986, SIAM J. Comput..

[13]  Leslie G. Valiant,et al.  Parallelism in Comparison Problems , 1975, SIAM J. Comput..

[14]  Gen-Huey Chen,et al.  Two-Dimensional Processor Array with a Reconfigurable Bus System is at Least as Powerful as CRCW Model , 1990, Inf. Process. Lett..

[15]  Zhen Liu,et al.  A Note on Graham's Bound , 1990, Inf. Process. Lett..

[16]  Eli Upfal,et al.  Parallel hashing: an efficient implementation of shared memory , 1988, JACM.

[17]  Hungwen Li,et al.  Reconfigurable Massively Parallel Computers , 1991 .

[18]  Ramachandran Vaidyanathan,et al.  Optimal Simulation of Multidimensional Reconfigurable Meshes by Two-Dimensional Reconfigurable Meshes , 1993, Inf. Process. Lett..

[19]  D. Peleg,et al.  The Power of Recon guration , 1998 .

[20]  Uzi Vishkin,et al.  Recursive Star-Tree Parallel Data Structure , 1993, SIAM J. Comput..

[21]  Prabhakar Ragde,et al.  The Parallel Simplicity of Compaction and Chaining , 1990, J. Algorithms.

[22]  Ramachandran Vaidyanathan Sorting on PRAMs with Reconfigurable Buses , 1992, Inf. Process. Lett..

[23]  Richard E. Ladner,et al.  Complexity of Sub-Bus Mesh Computations , 1996, SIAM J. Comput..

[24]  Stephan Olariu,et al.  Simulating Enhanced Meshes, with Applications , 1993, Parallel Process. Lett..

[25]  Dionysios I. Reisis,et al.  Parallel Computations on Reconfigurable Meshes , 1993, IEEE Trans. Computers.

[26]  Richard M. Karp,et al.  Parallel Algorithms for Shared-Memory Machines , 1991, Handbook of Theoretical Computer Science, Volume A: Algorithms and Complexity.

[27]  A. Schuster,et al.  The Complexity of Recon guring Network ModelsY , 1992 .

[28]  Quentin F. Stout,et al.  Reconfigurable SIMD massively parallel computers , 1991 .

[29]  Yossi Matias,et al.  Triply-Logarithmic Upper and Lower Bounds for Minimum, Range Minima, and Related Problems with Integer Inputs , 1993, WADS.