Truly Efficient Parallel Algorithms: 1-optimal Multisearch for an Extension of the BSP Model

In this paper we design and analyse parallel algorithms with the goal to get exact bounds on their speed-ups on real machines. For this purpose we define an extension of Valiant's BSP model, BSP*, that rewards blockwise communication, and uses Valiant's notion of c-optimality. Intuitively a c-optimal parallel algorithm for p processors achieves speed-up close to p/c. We consider the Multisearch problem: Assume a strip in 2D to be partitioned into m segments. Given n query points in the strip, the task is to locate, for each query, its segment. For m ≤ n we present a deterministic BSP* algorithm that is 1-optimal, if n = Ω(p log2p). For m > n, we present a randomized BSP* algorithm that is (1 + δ)-optimal for arbitrary δ>0, m ≤ 2p and n=Ω(p log2p). Both results hold for a wide range of BSP* parameters where the range becomes larger with growing input sizes m and n. We further report on implementation work in progress. Previous parallel algorithms for Multisearch were far away from being c-optimal in our model and do not consider blockwise communication.

[1]  Rob H. Bisseling,et al.  Scientific Computing on Bulk Synchronous Parallel Architectures , 1994, IFIP Congress.

[2]  Andrew Rau-Chaplin,et al.  Scalable parallel computational geometry for coarse grained multicomputers , 1996, Int. J. Comput. Geom. Appl..

[3]  Larry Rudolph,et al.  A Complexity Theory of Efficient Parallel Algorithms , 1990, Theor. Comput. Sci..

[4]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[5]  Ben H. H. Juurlink,et al.  Worm-Hole Gossiping on Meshes , 1996, Euro-Par, Vol. I.

[6]  Mikhail J. Atallah,et al.  Multisearch techniques for implementing data structures on a mesh-connected computer (preliminary version) , 1991, SPAA '91.

[7]  Mikhail J. Atallah,et al.  On the Multisearching Problem for Hypercubes , 1995, Comput. Geom..

[8]  Alok Aggarwal,et al.  On communication latency in PRAM computations , 1989, SPAA '89.

[9]  Ramesh Subramonian,et al.  LogP: towards a realistic model of parallel computation , 1993, PPOPP '93.

[10]  Richard Miller,et al.  Two-approaches to architecture-independent parallel computation , 1994 .

[11]  Mikhail J. Atallah,et al.  On the Multisearching Problem of Hypercubes , 1994, PARLE.

[12]  ParallelProgrammingW F McCollMarch The BSP Approach to Architecture Independent , 1995 .

[13]  John H. Reif,et al.  Randomized Algorithms for Binary Search and Load Balancing on Fixed Connection Networks with Geometric Applications , 1994, SIAM J. Comput..

[14]  Leslie G. Valiant,et al.  Direct Bulk-Synchronous Parallel Algorithms , 1992, J. Parallel Distributed Comput..

[15]  N. Fisher,et al.  Probability Inequalities for Sums of Bounded Random Variables , 1994 .

[16]  Uzi Vishkin,et al.  Parallel Dictionaries in 2-3 Trees , 1983, ICALP.

[17]  Leslie G. Valiant,et al.  A bridging model for parallel computation , 1990, CACM.

[18]  Abhiram G. Ranade Maintaining dynamic ordered sets on processor networks , 1992, SPAA '92.